Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
0.3.2
-
None
-
None
Description
Because of the logic in the JobTracker's pollForNewTask, second jobs will rarely start running maps until the first job finishes completely. The JobTracker leaves room to re-run failed maps from the first job and it reserves the total number of maps for the first job. Thus, if you have more maps in the first job than your cluster capacity, none of the second job maps will ever run.
I propose setting the reserve to 1% of the first job's maps.
Attachments
Attachments
Issue Links
- duplicates
-
HADOOP-295 jobs don't get executed in parallel
- Closed