Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-3054

Getting Unicode mapping error, file was Ok in 1.8

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Duplicate
    • 2.0.0
    • None
    • Text extraction

    Description

      Text extraction on attached file is getting many errors like:

      WARNING: No Unicode mapping for c (131) in font C0HR11_T1GI0361

      and then returning gibberish for all but the first 4 strings.

      In 1.8 all the text characters were correct. Fine in Acrobat, can copy/paste from there also.

      This has type 3 fonts.

      Tested against trunk build 20151024.140757-1624

      Attachments

        1. unicode mapping error.pdf
          66 kB
          Fred Andrews

        Issue Links

          Activity

            People

              tilman Tilman Hausherr
              fred_andrews Fred Andrews
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: