[PHOENIX-7319] Leverage Bloom Filters to improve performance on write path - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 5.2.0
Fix Version/s: 5.2.1, 5.3.0
Component/s: None
Labels:
None

Description

On the write path if the write is an atomic upsert or if the table has one or more indexes Phoenix first does a read. All these reads on the data table are point lookups. Bloom Filters can help optimize the performance of these lookups.

For new rows (inserts), the point lookup will not return any result. This negative lookup is ideal for bloom filters as our read can return by just checking the bloom filter block.
For updates, since new updates get accumulated into memstore and then flushed into new store files. A region can have multiple store files and when doing a read we have to read multiple store files. Bloom filter can help eliminate which store files should be read.

Attachments

Issue Links

links to

GitHub Pull Request #1897

Activity

People

Assignee:: Tanuj Khurana

Reporter:: Tanuj Khurana

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 27/May/24 20:02

Updated:: 30/May/24 18:07

Resolved:: 30/May/24 18:07