Add the ability to capture tabular PDF data regardless of how formatted in the PDF
The ability to identify and automatically parse the values of multi page tabular data without non columnar data (headings, page breaks and numbers, page headings, page column headings) on the second and subsequent pages corrupting the excel columns of the tabled data regardless of how formatted in the PDF.
PDFs include tabular data in various formats:
• Multiple pages/single page
• As a PDF table with headings on every page and typically page headers, page numbers, and other data that destroy any parsing I know how to use
• Tabulated data formatted as simple text that one can only capture as raw text
• Null values (spaces) that shift every subsequent cell in the table to the right