Description
This umbrella JIRA tracks the progress of implementing Storage Partitioned Join feature for Spark.
Attachments
Issue Links
- depends upon
-
SPARK-35703 Relax constraint for Spark bucket join and remove HashClusteredDistribution
- Resolved
- is a parent of
-
SPARK-48613 Umbrella: Storage Partition Join Improvements
- Reopened
- relates to
-
SPARK-40295 Allow v2 functions with literal args in write distribution and ordering
- Resolved
-
SPARK-40508 Treat unknown partitioning as UnknownPartitioning
- Resolved
-
SPARK-48030 InternalRowComparableWrapper should cache rowOrdering to improve performace
- Resolved
1.
|
SPJ: Introduce a new DataSource V2 interface SupportsPushDownClusterKeys | In Progress | Unassigned | |
2.
|
SPJ: Include keyGroupedPartitioning in StoragePartitionJoinParams equality check | Open | Unassigned | |
3.
|
Improve picking the side of partially clustered distribution accroding to partition size | Open | Unassigned |