Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Duplicate
-
3.1
-
None
Description
The TikaEntityProcessor does not choose a parser and does not extract data. The attached DIH config file only works if the Tika parser is specified with:
parser="org.apache.tika.parser.html.HtmlParser".
Remove that line and Tika will contribute nothing to the document.