Uploaded image for project: 'Kudu'
  1. Kudu
  2. KUDU-820

Add metrics for diagnosing recent cluster issues

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.2.0
    • None
    • metrics, supportability
    • None

    Description

      We had a lot of difficulty trouble-shooting some recent issues on bolt80. We need the following metrics added:

      • number of ops in the PREPARE queue
      • number of ops in the APPLY queue
      • number of milliseconds of spinlock contention (histogram?)
      • consensus error rate seen by leader
      • consensus RTT seen by leader

      Attachments

        Issue Links

          Activity

            People

              tlipcon Todd Lipcon
              tlipcon Todd Lipcon
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated: