Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
Impala 2.2.4
Description
Parquet stores min/max stats which can be used to skip reading blocks if they don't qualify a certain predicate
The query below ends up scanning all rows, which is not needed.
select count(*) from tpch_parquet.lineitem where l_orderkey = -1;
Attachments
Issue Links
- relates to
-
IMPALA-3909 Parquet file writer should populate the min/max statistics per block per column to be used by the reader
- Resolved