Description
Stanbol defines new specifications for the Text Annotations definitions as part of the result of an enhancement analysis. These specifications are published on the official web site [1].
Their aim is to add the head/tail and prefix/suffix information to a Text Annotation. This would greatly benefit dependent services that somehow need to "clean-up" the textual contents before sending them for analysis, while receiving meaningful information about linking the identified entities with the related Text Annotations (without using the thus unreliable start/end information).
In order to jump-start support for the head/tail and prefix/suffix model, we create a TextAnnotations-NewModel engine which is converting start/end information to head/tail/prefix/suffix information before the analysis results are returned to the client. This engine was previously announced to the dev mailing list [2].
It would be nice to have the engine [3] merged in the trunk.
[3]
https://github.com/insideout10/wordlift-stanbol/tree/master/textannotations-futuremodel