Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-16675

Introduce the possibility to rerank topK results with vector similarity functions using DenseVectorField

    XMLWordPrintableJSON

Details

    • Task
    • Status: Closed
    • Blocker
    • Resolution: Done
    • None
    • 9.3
    • None

    Description

      When using knnQParser in reranking pay attention to the top-K parameter.

      The second pass score(deriving from KNN search) is calculated only if the document d from the first pass is within the K nearest neighbors(in the whole index) of the target vector to search.

      This is a current limitation.

      The final ranked list of results will have the first pass score(main query q) combined with the second pass score(the approximated similarity function distance to the target vector to search).

      Ideally, it should be possible to:

      • Rerank top K results with vector similarity. We should compute the vector similarity function using the DenseVectorField value of all the documents in top K results without the need of running a KNN query.
      • Use only the second pass score as the final score

      Attachments

        Issue Links

          Activity

            People

              abenedetti Alessandro Benedetti
              eliaporciani Elia Porciani
              Votes:
              1 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 0.5h
                  0.5h