Details
Description
PDFBOX-1769 introduced a "self healing" mechanism to repair corrupt XRef offsets. But that one was just a starter and there remain a lot of issues to be solved. I'm planing to solve at least some of them.
All fixes and improvements are targeting the non-sequential parser and I won't port those changes to the old parser.
Attachments
Attachments
Issue Links
- duplicates
-
PDFBOX-2331 Expected a long type at offset 22112, instead got '_AJ1T'
- Closed
- is related to
-
PDFBOX-1918 PDF with incorrect startxref
- Closed
-
PDFBOX-2441 Improve XRef self healing mechanism when more than one xref table
- Closed
-
TIKA-1300 Switch default PDFBox parser to NonSequentialParser
- Resolved
- relates to
-
PDFBOX-1738 PDF with parsing IOException
- Closed