XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.12.0
    • Enhancer

    Description

      Stanbol defines new specifications for the Text Annotations definitions as part of the result of an enhancement analysis. These specifications are published on the official web site [1].

      Their aim is to add the head/tail and prefix/suffix information to a Text Annotation. This would greatly benefit dependent services that somehow need to "clean-up" the textual contents before sending them for analysis, while receiving meaningful information about linking the identified entities with the related Text Annotations (without using the thus unreliable start/end information).

      In order to jump-start support for the head/tail and prefix/suffix model, we create a TextAnnotations-NewModel engine which is converting start/end information to head/tail/prefix/suffix information before the analysis results are returned to the client. This engine was previously announced to the dev mailing list [2].

      It would be nice to have the engine [3] merged in the trunk.

      [1]
      http://stanbol.apache.org/docs/trunk/components/enhancer/enhancementstructure.html#fisetextannotation

      [2]
      http://mail-archives.apache.org/mod_mbox/stanbol-dev/201211.mbox/%3CCAG94HGi2MiSWgtvYU7-bNqgQVmGRc0w7vL1CZEzV-Fc4XNSjrg@mail.gmail.com%3E

      [3]
      https://github.com/insideout10/wordlift-stanbol/tree/master/textannotations-futuremodel

      Attachments

        1. textannotations-futuremodel.zip
          171 kB
          David Riccitelli

        Activity

          People

            rwesten Rupert Westenthaler
            davidr David Riccitelli
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: