Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-950

Wrong Office Open XML detection in ZipContainerDetector

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Won't Fix
    • 1.1
    • None
    • mime

    Description

      Method detectOfficeOpenXML() in ZipContainerDetector class does not detect correctly an ooxml file due to an Exception throwed by OPCPackage.open(..) POI's class. This class make use of ZipFile when its method is called with (as Tika do) filePath string and in this way it generates an exception; passing instead a fileInputStream makes POI correctly detects OfficeOpenXML formats.

      Attachments

        1. ZipContainerDetector.diff
          1 kB
          Marco Quaranta

        Activity

          People

            Unassigned Unassigned
            101000 Marco Quaranta
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: