Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
New
Description
SimpleNaiveBayesClassifier users term frequencies with add one smoothing to calculate likelihood and just tf for prior. Given Lucene has switched to BM25 it would be better to have a different impl which uses BM25
scoring as a probability measure of both prior and likelihood.