Details
-
New Feature
-
Status: Open
-
Minor
-
Resolution: Unresolved
-
ManifoldCF 1.1
Description
Hydra Processing Framework was recently released into the wild.
Hydra offers to solve the the missing piece into creating great consolidated search solutions.
What is Hydra?
When working with free text search using for example Apache Solr the quality of the data in the index is a key factor of the quality of the results delivered. Hydra is designed to give the search solution the tools necessary to modify the data that is to be indexed in an efficient and flexible way. This is done by providing a scalable and efficient pipeline which the documents will have to pass through before being indexed into the search engine.
Architecturally Hydra sits in between the search engine and the source integration. A common use-case would be to use Apache Manifold CF to crawl a folder on a filesystem and send the documents to hydra which in turn will process and dispatch processed documents to Solr for indexing.
More information and code to the framework on
https://github.com/Findwise/Hydra