Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-7386

Investigate providing some supervisor support for znode deletion

    XMLWordPrintableJSON

Details

    • Task
    • Status: Closed
    • Blocker
    • Resolution: Won't Fix
    • None
    • None
    • master, regionserver, scripts
    • None

    Description

      There a couple of JIRAs for deleting the znode on a process failure:
      HBASE-5844 (RS)
      HBASE-5926 (Master)
      which are pretty neat; on process failure, they delete the znode of the underlying process so HBase can recover faster.

      These JIRAs were implemented via the startup scripts; i.e. the script hangs around and waits for the process to exit, then deletes the znode.

      There are a few problems associated with this approach, as listed in the below JIRAs:
      1) Hides startup output in script
      https://issues.apache.org/jira/browse/HBASE-5844?focusedCommentId=13463401&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13463401
      2) two hbase processes listed per launched daemon
      https://issues.apache.org/jira/browse/HBASE-5844?focusedCommentId=13463409&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13463409
      3) Not run by a real supervisor
      https://issues.apache.org/jira/browse/HBASE-5844?focusedCommentId=13463409&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13463409
      4) Weird output after kill -9 actual process in standalone mode
      https://issues.apache.org/jira/browse/HBASE-5926?focusedCommentId=13506801&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13506801
      5) Can kill existing RS if called again
      https://issues.apache.org/jira/browse/HBASE-5844?focusedCommentId=13463401&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13463401
      6) Hides stdout/stderr[6]
      https://issues.apache.org/jira/browse/HBASE-5844?focusedCommentId=13506832&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13506832

      I suspect running in via something like supervisor.d can solve these issues if we provide the right support.

      Attachments

        1. HBASE-7386-v0.patch
          3 kB
          Gregory Chanan
        2. supervisordconfigs-v0.patch
          2 kB
          Gregory Chanan
        3. HBASE-7386-bin.patch
          36 kB
          Samir Ahmic
        4. HBASE-7386-conf.patch
          22 kB
          Samir Ahmic
        5. HBASE-7386-src.patch
          2 kB
          Samir Ahmic
        6. HBASE-7386-bin-v2.patch
          37 kB
          Samir Ahmic
        7. HBASE-7386-conf-v2.patch
          22 kB
          Samir Ahmic
        8. HBASE-7386-bin-v3.patch
          40 kB
          Samir Ahmic
        9. HBASE-7386-conf-v3.patch
          22 kB
          Samir Ahmic
        10. HBASE-7386-master-00.patch
          39 kB
          Samir Ahmic
        11. HBASE-7386-master-01.patch
          42 kB
          Samir Ahmic

        Issue Links

          Activity

            People

              Unassigned Unassigned
              gchanan Gregory Chanan
              Votes:
              0 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: