[CASSANDRA-14281] Improve LatencyMetrics performance by reducing write path processing - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Normal
Resolution: Fixed
Fix Version/s: 4.0-alpha1, 4.0
Component/s: Feature/Lightweight Transactions, Legacy/Core
Labels:
- LWT
- perfomance

Description

Currently for each write/read/rangequery/CAS touching the CFS we write a latency metric which takes a lot of processing time (up to 66% of the total processing time if the update was empty).

The way latencies are recorded is to use both a dropwizard "Timer" as well as "Counter". Latter is used for totalLatency and the previous is decaying metric for rates and certain percentile metrics. We then replicate all of these CFS writes to the KeyspaceMetrics and globalWriteLatencies.

Instead of doing this on the write phase we should merge the metrics when they're read. This is much less common occurrence and thus we save a lot of CPU time in total. This also speeds up the write path.

Currently, the DecayingEstimatedHistogramReservoir acquires a lock for each update operation, which causes a contention if there are more than one thread updating the histogram. This impacts scalability when using larger machines. We should make it lock-free as much as possible and also avoid a single CAS-update from blocking all the concurrent threads from making an update.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

bench.png
06/Apr/18 03:15
6.85 MB
Chris Lohfink
bench2.png
06/Apr/18 04:14
239 kB
Michael Shuler
benchmark.html
18/Apr/18 04:14
366 kB
Chris Lohfink
benchmark2.png
18/Apr/18 04:12
283 kB
Chris Lohfink

Issue Links

is related to

CASSANDRA-15213 DecayingEstimatedHistogramReservoir Inefficiencies

Resolved

links to

GitHub Pull Request #217

Activity

People

Assignee:: Michael Burman

Reporter:: Michael Burman

Authors:: Michael Burman

Reviewers:: Chris Lohfink

Votes:: 0 Vote for this issue

Watchers:: 13 Start watching this issue

Dates

Created:: 28/Feb/18 12:50

Updated:: 16/Mar/22 11:37

Resolved:: 25/Apr/18 04:01

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

10m