Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
1.15
-
None
-
Windows, Linux, Apache tika 1.15 used with Apache Solr-6.6.0
-
Important
Description
Hello,
I am using Tika-1.15 with Solr-6.6.0 to indexing and searching. This setup fails to index text present in a table inside a textbox in a word document.
A MS Word document contains two words -
1. Germany - present in a table inside a textbox
2. Africa - present in a textbox
Germany is not getting indexed while Africa gets indexed successfully. Looks like Tika fails to extract the content present in table inside a textbox.
Please have a look.
Thanks,
Amit Humnabadkar
doc001.zip