Uploaded image for project: 'Stanbol (Retired)'
  1. Stanbol (Retired)
  2. STANBOL-1285

FST Linking Engine / Linkable Token Filter should consider Chunks

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 0.12.0
    • 1.0.0, 0.12.1
    • Enhancement Engines
    • None

    Description

      The LinkableTokenFilter a Solr TokenFilter is used by the FST linking engine to add the TaggingAttribute (supported by the Solr Text Tagger library) to tokens that should be looked up in the FST - the vocabulary.

      This implementation can be improved by taking chunks into consideration that are

      • chunks representing named entities
      • processable (typically Noun phrases but no Verb phrases ...) AND
      • have a linkable token in the chunk OR
      • have two or more matchable tokens in the chunk

      All tokens in such chunks should be classified as tagable by setting the TaggingAttribute to true.

      Attachments

        Activity

          People

            rwesten Rupert Westenthaler
            rwesten Rupert Westenthaler
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: