Uploaded image for project: 'Ratis'
  1. Ratis
  2. RATIS-2129

Low replication performance because of lock contention on RaftLog

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Blocker
    • Resolution: Fixed
    • 3.1.0
    • 3.2.0
    • server

    Description

      Today, the GrpcLogAppender thread makes a lot of calls that need RaftLog's readLock. In an active environment, RaftLog is always busy appending transactions from clients, thus writeLock is frequently busy. This makes the replication performance slow. 

      See the dn_echo_leader_profile.html, or in the picture below, the purple is the time taken to acquire readLock from RaftLog.

      A summary of LockContention in Ratis. 

       

      Today, RaftLog consistency is protected by a global ReadWriteLock. (global means RaftLog has a single ReadWriteLock and the lock is acquired at the scope of the RaftLog instance, or a RaftGroup).

      In a RaftGroup, the following actors race to obtain this global ReadWriteLock in the leader node:

      • The writer, which is the GRPC Client Service, accepts transaction submissions from Raft clients and appends transactions (or log entries) to RaftLog. Each append operation needs to acquire the writeLock from RaftLog to put the transaction to RaftLog's memory queue. Although each of these append operations is quick, Ratis is designed to maximize transactions append and so the writeLock should be always busy.
      • StateMachineUpdater. For each transaction, when it is acknowledged by enough followers, this single thread actor will read the log from RaftLog and call StateMachine to apply the transaction. This actor acquires readLock from RaftLog for each log entry read. 
      • GrpcLogAppender: for each follower, there's a thread of GrpcLogAppender that constantly reads log entries from RaftLog and replicates them to the follower. This thread acquires readLock from RaftLog every time it reads a log entry.

      All writer, StateMachineUpdater, and GrpcLogAppender are all designed in a way to maximize their throughput. For instance,  StateMachineUpdater invokes StateMachine's applyTransaction as asynchronous calls. The same is the way GrpcLogAppender replicates log entries to the follower. 

      The global ReadWriteLock creates a tough contention between the RaftLog writers and readers. And that's what limit the ratis throughput down. The faster the writers and readers are, the more they block each other.

       

      Attachments

        1. ratis_ratfLog_lock_contention.png
          42 kB
          Duong
        2. Screenshot 2024-07-22 at 4.40.07 PM-1.png
          309 kB
          Duong
        3. Screenshot 2024-07-22 at 4.40.07 PM.png
          308 kB
          Duong
        4. image-2024-07-22-15-25-46-155.png
          283 kB
          Duong
        5. dn_echo_leader_profile.html
          467 kB
          Duong

        Issue Links

          Activity

            People

              szetszwo Tsz-wo Sze
              duongnguyen Duong
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 4h 10m
                  4h 10m