Uploaded image for project: 'Jackrabbit Oak'
  1. Jackrabbit Oak
  2. OAK-1158 MVCC old revision cleanup
  3. OAK-1341

DocumentNodeStore: Implement revision garbage collection

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 0.20
    • mongomk
    • None

    Description

      For the MongoMK (as well as for other storage engines that are based on it), garbage collection is most easily implemented by iterating over all documents and removing unused entries (either whole documents, or data within the document).

      Iteration can be done in parallel (for example one process per shard), and it can be done in any order.

      The most efficient order is probably the id order; however, it might be better to iterate only over documents that were not changed recently, by using the index on the "_modified" property. That way we don't need to iterate over the whole repository over and over again, but just over those documents that were actually changed.

      Attachments

        Issue Links

          Activity

            People

              chetanm Chetan Mehrotra
              thomasm Thomas Mueller
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: