Uploaded image for project: 'Apache Any23 (Retired)'
  1. Apache Any23 (Retired)
  2. ANY23-318

ExtractionException handling in BaseRDFExtractor.java kills entire extraction

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Blocker
    • Resolution: Fixed
    • 2.1
    • 2.2
    • core, extractors
    • None

    Description

      Right now the following snippet of code contained within BaseRDFExtractor.java kills entire extractions. I propose to merely log the errors and continue with the extraction.

               } catch (RDFParseException ex) {
      -            throw new ExtractionException("Error while parsing RDF document.", ex, extractionResult);
      +            LOG.error("Error while parsing RDF document.", ex, extractionResult);
               }
           }
      

      The parsing strictness is inherited from the underlying semargl parsers which expect perfect syntax for input data... in the 'wild' however, this unfortunately is not realistic.
      The solution is for us to log the Exception, issues, etc. and carry on with the extraction.
      Patch coming up.

      Attachments

        Issue Links

          Activity

            People

              lewismc Lewis John McGibbney
              lewismc Lewis John McGibbney
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: