Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-861

german umlaute are not recognized

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Not A Problem
    • 1.3.1
    • None
    • Text extraction
    • None
    • tika-0.8

    Description

      german umlaute are not recognized in this document
      http://www.computing.dcu.ie/~irehbein/SS08/uebung1/stts-guide.pdf

      Guidelines f
      
      ur das Tagging deutscher Textcorpora

      Attachments

        1. stts-guide.pdf
          386 kB
          Jukka Zitting

        Issue Links

          Activity

            People

              Unassigned Unassigned
              reinhard Reinhard Pötz
              Votes:
              1 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: