Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-4830

We are reading from sub-directory cache when we have a view

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Not A Bug
    • 1.8.0
    • None
    • None

    Description

      git.commit.id.abbrev=f476eb5

      The below plan suggests we are not reading from the sub-directory cache file

      create or replace view l1 as select dir0 num, substr(dir1, 1, 2) let, extract(day from dir2) `day`, l_orderkey, l_partkey, l_suppkey, l_linenumber, l_quantity, l_extendedorice, l_discount, l_tax from l_3level;
      +-------+----------------------------------------------------------------------+
      |  ok   |                               summary                                |
      +-------+----------------------------------------------------------------------+
      | true  | View 'l1' replaced successfully in 'dfs.metadata_caching_pp' schema  |
      +-------+----------------------------------------------------------------------+
      1 row selected (0.355 seconds)
      
      explain plan for select num, let, `day`, l_orderkey from l2 where num=2 and let='tw' and `day` = 12;
      +------+------+
      | text | json |
      +------+------+
      | 00-00    Screen
      00-01      Project(num=[$0], let=[$1], day=[$2], l_orderkey=[$3])
      00-02        Project(num=[$0], let=[SUBSTR($1, 1, 2)], day=[EXTRACT(FLAG(DAY), $2)], l_orderkey=[$3])
      00-03          SelectionVectorRemover
      00-04            Filter(condition=[AND(=($0, 2), =(SUBSTR($1, 1, 2), 'tw'), =(EXTRACT(FLAG(DAY), $2), 12))])
      00-05              Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=/drill/testdata/metadata_caching_pp/l_3level/2/two/2015-8-12/40.parquet], ReadEntryWithPath [path=/drill/testdata/metadata_caching_pp/l_3level/2/two/2015-9-12/50.parquet], ReadEntryWithPath [path=/drill/testdata/metadata_caching_pp/l_3level/2/two/2015-7-12/30.parquet]], selectionRoot=/drill/testdata/metadata_caching_pp/l_3level, numFiles=3, usedMetadataFile=true, cacheFileRoot=/drill/testdata/metadata_caching_pp/l_3level/2/two, columns=[`dir0`, `dir1`, `dir2`, `l_orderkey`]]])
      

      I attached the data set required.

      Attachments

        1. l_3level.tgz
          13 kB
          Rahul Kumar Challapalli

        Activity

          People

            Unassigned Unassigned
            rkins Rahul Kumar Challapalli
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: