Details
-
Sub-task
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
Description
From 0.5 version on Tika has a LanguageIdentifier to extract document language (using NGrams) so it would be good to add that capability too.