Uploaded image for project: 'OpenNLP'
  1. OpenNLP
  2. OPENNLP-1446

Investigate why LeskEvaluatorTest and MFSEvaluatorTest fail while parsing 'EnglishLS.train'

    XMLWordPrintableJSON

Details

    Description

      The LeskEvaluatorTest & MFSEvaluatorTest in the opennlp-wsd sandbox component both fail parsing the 'EnglishLS.train' file. The data is kept original, downloaded from https://web.eecs.umich.edu/~mihalcea/senseval/senseval3/data.html

      Aims:

      • Investigate what causes the xml parsing to fail
      • Fix it and make both existing tests pass
      • Optional: Improve the existing test code to be more strict.

      Note:

      The test setup to reproduce this is on a branch and to be merged into the main branch.

      Attachments

        Activity

          People

            mawiesne Martin Wiesner
            mawiesne Martin Wiesner
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 5h
                5h