Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
SystemML 0.11
-
None
-
None
Description
The implementation of our spark frame right indexing instruction unnecessarily tries to create aligned partial blocks, which causes unnecessary shuffle and merge issues (on merging these partial blocks) as frame can have variable block sizes. We should avoid merge/shuffle altogether and simply slice out the relevant sub blocks.
This issue results in errors such as:
org.apache.sysml.runtime.DMLRuntimeException: Mismatched frame block sizes for: 15 65539 11 65539 at org.apache.sysml.runtime.instructions.spark.utils.FrameRDDAggregateUtils$MergeBlocksFunction.call(FrameRDDAggregateUtils.java:82) at org.apache.sysml.runtime.instructions.spark.utils.FrameRDDAggregateUtils$MergeBlocksFunction.call(FrameRDDAggregateUtils.java:64)