Details
-
Task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
Kudu_Impala
Description
Impala relies on bloom filters to reduce number of rows from coming out of the scan node for selective joins.
Queries get up to 20x speedup, not having bloom filter support in Kudu will create a big performance gap between Parquet and Kudu.
https://github.com/cloudera/Impala/blob/cdh5-trunk/be/src/util/bloom-filter.h
Attachments
Issue Links
- causes
-
IMPALA-13250 Document ENABLED_RUNTIME_FILTER_TYPES query option
- Resolved
- is related to
-
IMPALA-9691 Support Kudu Timestamp and Date Bloom Filter
- Resolved
-
IMPALA-4252 Add RuntimeFilters for min/max at KuduScanNode
- Resolved
- relates to
-
KUDU-2483 Scan tablets with bloom filter
- Resolved
-
IMPALA-9789 Disable ineffective bloom filters for Kudu scan
- Resolved