[FLINK-25397] support grouped_execution - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: 1.15.0
Fix Version/s: None
Component/s: Table SQL / Legacy Planner, Table SQL / Planner, Table SQL / Runtime
Labels:
None

Description

Performing data bucketing execution: two tables (orders, orders_item), divided into buckets (bucketing) based on the same fields (orderid) and the same number of buckets. In join by order id, join and aggregation calculations can be performed independently, because the same order ids of both tables are divided into buckets with the same ids.
This has several advantages：
1. Whenever a bucket of data is computed, the memory occupied by this bucket can be released immediately, so memory consumption can be limited by controlling the number of buckets processed in parallel.
2. reduces a lot of shuffling

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: ZhuoYu Chen

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 21/Dec/21 04:08

Updated:: 21/Dec/21 04:09