Details
-
Improvement
-
Status: Open
-
Minor
-
Resolution: Unresolved
-
4.8.1
-
None
-
All
-
New
Description
To promote code reuse, the customizability of the analyzers included with Lucene (e.g. EnglishAnalyzer) ought to be further improved.
To illustrate, it is currently difficult to specify general stemming behavior without having to modify each and every analyzer class. In our case, we had to change the constructors of every analyzer class to accept an AnalyzerOption argument.
The AnalyzerOption class has a getStemStrategy() method. StemStrategy is defined as follows:
public enum StemStrategy
{ AGGRESSIVE, LIGHT, NONE };
We needed to modify over 20 or so Lucene classes. This is obviously not ideal from a code reuse and maintainability standpoint.