Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
Description
If multiple applications cannot be recovered, the admin is forced to repeatedly attempt to start the RM, check the logs, and purge the offending app. Worse, the admin has no information about the app that was purged, other than the ID.
This JIRA proposes to add a -force-recovery option to the resourcemanager CLI that will automatically purge any apps that fail to recover after dumping the full application profile to a log. It would nice if the option prompts the users with an "are you sure?" before continuing.
Attachments
Issue Links
- is related to
-
YARN-6031 Application recovery has failed when node label feature is turned off during RM recovery
- Resolved