Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Incomplete
-
None
-
None
Description
For a two way shuffle join, if one or multiple groups are skewed in one table (say left table) but having a relative small number of rows in another table (say right table), we can use broadcast join for these skewed groups and use shuffle join for other groups.
Attachments
Attachments
Issue Links
- is duplicated by
-
SPARK-16753 Spark SQL doesn't handle skewed dataset joins properly
- Resolved
-
SPARK-17788 RangePartitioner results in few very large tasks and many small to empty tasks
- Resolved
-
SPARK-24799 A solution of dealing with data skew in left,right,inner join
- Resolved
- links to