Gogula Aryalingam on 17 Jan 2015 03:15:10
I have come across so many public data repositories that hold data in PDF format. Other websites have tables within documents such as annual reports etc., also in PDF format. A data source for PDFs or tables from PDFs would be awesome!
Administrator on 13 Apr 2019 00:56:11
The PDF connector is now generally available in the April release of Power BI Desktop. Learn more here: https://powerbi.microsoft.com/en-us/blog/power-bi-desktop-april-2019-feature-summary/#pdf
- Comments (284)
RE: Tables in PDF files
I also vote for PDF
RE: Tables in PDF files
This would be super for government data sources. Example: http://www.dfw.state.or.us/MRP/salmon/Historical_Data/docs/TrollEffTable.pdf
RE: Tables in PDF files
I'll add a third vote for this. As Gogula indicates, PDFs are the rule for a lot of public domain data on the Web, especially from the US Gov. Personally, I hate PDFs and my choice would be to simply make them illegal :) , but if we have to live with them, we're going to need a way to mine the data from that hideous file format.
RE: Tables in PDF files
This is huge to CFO and CMO teams. Parsing financial reports is essential task toward any competition analysis and strategic planning.