Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-3088

java.lang.NullPointerException when converting Open Office presentation (.odp) to html

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.23
    • 1.24.1
    • app
    • None

    Description

      The attempt to convert an odp file to html format ends with this:

      D:\>java -jar tika-app-1.23.jar --html D:\testdata\Presentations\11.odp

      Exception in thread "main" org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.odf.OpenDocumentParser@710f4dc7        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:282)        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)        at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143)        at org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:209)        at org.apache.tika.cli.TikaCLI.process(TikaCLI.java:496)        at org.apache.tika.cli.TikaCLI.main(TikaCLI.java:149)

      Caused by: java.lang.NullPointerException        at com.sun.org.apache.xml.internal.serializer.ToHTMLStream.endElement(Unknown Source)        at com.sun.org.apache.xalan.internal.xsltc.trax.TransformerHandlerImpl.endElement(Unknown Source)        at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)        at org.apache.tika.sax.ExpandedTitleContentHandler.endElement(ExpandedTitleContentHandler.java:70)        at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)        at org.apache.tika.sax.SecureContentHandler.endElement(SecureContentHandler.java:256)        at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)        at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)        at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)        at org.apache.tika.sax.SafeContentHandler.endElement(SafeContentHandler.java:274)        at org.apache.tika.sax.XHTMLContentHandler.endElement(XHTMLContentHandler.java:271)        at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)        at org.apache.tika.parser.odf.OpenDocumentContentParser$OpenDocumentElementMappingContentHandler.endElement(OpenDocumentContentParser.java:425)        at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)        at org.apache.tika.parser.odf.NSNormalizerContentHandler.endElement(NSNormalizerContentHandler.java:75)        at org.apache.tika.sax.ContentHandlerDecorator.endElement(ContentHandlerDecorator.java:136)        at org.apache.xerces.parsers.AbstractSAXParser.endElement(Unknown Source)        at org.apache.xerces.impl.XMLNSDocumentScannerImpl.scanEndElement(Unknown Source)        at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(Unknown Source)        at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source)        at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source)        at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source)        at org.apache.xerces.parsers.XMLParser.parse(Unknown Source)        at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)        at org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown Source)        at org.apache.xerces.jaxp.SAXParserImpl.parse(Unknown Source)        at javax.xml.parsers.SAXParser.parse(Unknown Source)        at org.apache.tika.utils.XMLReaderUtils.parseSAX(XMLReaderUtils.java:491)        at org.apache.tika.parser.odf.OpenDocumentContentParser.parseInternal(OpenDocumentContentParser.java:599)        at org.apache.tika.parser.odf.OpenDocumentParser.handleZipEntry(OpenDocumentParser.java:220)        at org.apache.tika.parser.odf.OpenDocumentParser.handleZipFile(OpenDocumentParser.java:204)        at org.apache.tika.parser.odf.OpenDocumentParser.parse(OpenDocumentParser.java:157)        at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)        ... 5 more

       

       

      Attachments

        1. 11.odp
          1.86 MB
          Vladimir Kotik
        2. 7.odp
          492 kB
          Vladimir Kotik
        3. 1.odp
          421 kB
          Vladimir Kotik

        Activity

          People

            Unassigned Unassigned
            kotik Vladimir Kotik
            Votes:
            1 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: