Details
-
Sub-task
-
Status: Closed
-
Major
-
Resolution: Fixed
-
3.0.0-alpha1
-
None
-
Reviewed
Description
For long lived containers we don't want the AM to be a SPOF.
When the RM restarts a (failed) AM, it should be given the list of containers it had already been allocated. the AM should then be able to contact the NMs to get details on them. NMs would also need to do any binding of the containers needed to handle a moved/restarted AM.
Attachments
Attachments
Issue Links
- blocks
-
YARN-896 Roll up for long-lived services in YARN
- Open
- breaks
-
MAPREDUCE-5726 TestRMContainerAllocator#testCompletedTasksRecalculateSchedule fails
- Resolved
- is related to
-
YARN-1588 Rebind NM tokens for previous attempt's running containers to the new attempt
- Closed
-
MAPREDUCE-5743 TestRMContainerAllocator is failing
- Closed