[AURORA-1181] optimize host_drain to speed up maintenance - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Task
Status: Resolved
Priority: Minor
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.8.0
Component/s: Maintenance
Labels:
None

Description

Aurora's maintenance primitives, whilst great, can be frustrating to use when dealing with large clusters, primarily due to the speed of draining hosts. The host_drain feature does accept a grouping function that can be used to drain hosts in batches, but for large clusters we typically don't want to arbitrarily divide the cluster into groups/batches and would prefer instead to drain everything that was requested, where possible, without violating the SLA.

eg, 100 hosts in need of maintenance, with each host running 1 task (of many) from 100 different jobs – all 100 hosts could be drained simultaneously without violating the SLA.

Attachments

Activity

People

Assignee:: David Robinson

Reporter:: Daniel Robinson

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 10/Mar/15 00:45

Updated:: 04/May/15 13:12

Resolved:: 12/Mar/15 23:50