Uploaded image for project: 'Kudu'
  1. Kudu
  2. KUDU-821

Tight loop trying to send requests to dead server after logs GCed

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • Private Beta
    • Public beta
    • consensus
    • None

    Description

      We saw this issue take down a server on bolt after one of its followers died. Eventually, we GCed that follower's logs, and then we started logging about 25 times a second that we couldn't GC its logs.

      This also caused a lot of lock contention trying to write to other peers, etc, and make consensus more or less grind to halt.

      Attachments

        Issue Links

          Activity

            People

              tlipcon Todd Lipcon
              tlipcon Todd Lipcon
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: