Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-12266

[Rust][DataFusion] Fix null handling hash join

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Invalid
    • None
    • None
    • Rust - DataFusion
    • None

    Description

      Improve null handling of 

      SELECT id1, id2 FROM (SELECT null AS id1) t1
      INNER JOIN (SELECT 0 AS id2) t2 ON id1 = id2

      > NULL, NULL

      (should be empty result set)

      We should filter beforehand to make this result correct. Also this can make things more efficient as the non-null filter can be pushed down which can lead to efficiency gains (making data-set smaller, not having to deal with nullable data, or even entire files could be skipped when they only contain nulls).

      Attachments

        Activity

          People

            Dandandan Daniël Heres
            Dandandan Daniël Heres
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: