Uploaded image for project: 'ManifoldCF'
  1. ManifoldCF
  2. CONNECTORS-475

A Hydra Output Connector

    XMLWordPrintableJSON

Details

    Description

      Hydra Processing Framework was recently released into the wild.

      Hydra offers to solve the the missing piece into creating great consolidated search solutions.

      What is Hydra?
      When working with free text search using for example Apache Solr the quality of the data in the index is a key factor of the quality of the results delivered. Hydra is designed to give the search solution the tools necessary to modify the data that is to be indexed in an efficient and flexible way. This is done by providing a scalable and efficient pipeline which the documents will have to pass through before being indexed into the search engine.

      Architecturally Hydra sits in between the search engine and the source integration. A common use-case would be to use Apache Manifold CF to crawl a folder on a filesystem and send the documents to hydra which in turn will process and dispatch processed documents to Solr for indexing.

      More information and code to the framework on
      https://github.com/Findwise/Hydra

      Attachments

        Activity

          People

            erlendfg Erlend GarĂ¥sen
            ebbesson Magnus Ebbesson
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: