[CASSANDRA-5272] Hinted Handoff Throttle based on cluster size - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Low
Resolution: Fixed
Fix Version/s: 1.2.6
Component/s: None
Labels:
- lhf

Description

For a 12-node EC2 m1.xlarge cluster, restarting a node causes it to get completely overloaded with the default 2-thread, 1024KB setting in 1.2.x. This seemed to be a smaller problem when it was 6-nodes, but still required us to abort handoffs. The old defaults in 1.1.x were WAY more conservative. I've dropped this way down to 128KB on our production cluster which is really conservative, but appears to have solved it. The default seems way too high on any cluster that is non-trivial in size.

After putting some thought to this, it seems that this should really be based on cluster size, making the throttle a "target" for how much write load a single node can swallow. As the cluster grows, the amount of hints that can be delivered by each other node in the cluster goes down, so the throttle should self-adjust to take that into account.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

5272.txt
22/May/13 20:11
2 kB
Jonathan Ellis

Issue Links

causes

CASSANDRA-15859 Avoid per-host hinted-handoff throttle being rounded to 0 in large cluster

Resolved

Activity

People

Assignee:: Jonathan Ellis

Reporter:: Rick Branson

Authors:: Jonathan Ellis

Reviewers:: Rick Branson

Tester:: Daniel Meyer

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 19/Feb/13 22:37

Updated:: 10/Jul/20 13:49

Resolved:: 29/May/13 20:00