Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
Currently the framework sorts the input splits by size before the job is submitted. This makes it very difficult to run map only jobs that transform the input because the assignment of input names to output names isn't obvious. We fixed this once in HADOOP-1440, but the fix was broken so it was rolled back.
Attachments
Issue Links
- is blocked by
-
MAPREDUCE-207 Computing Input Splits on the MR Cluster
- Open