Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-19848

Zookeeper thread leaks in hbase-spark bulkLoad method

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.2.0
    • 3.0.0-alpha-1
    • spark, Zookeeper
    • hbase-spark-1.2.0-cdh5.12.1 version

      spark 1.6

    • Reviewed
    • Patch, Important

    Description

      In hbase-spark project, HBaseContext provides bulkload methond for loading spark rdd data to hbase easily.But when i using it frequently, the program will throw "cannot create native thread" exception.

      using pstack command in spark driver process , the thread num is increasing 

      using jstack, named "main-SendThread" and "main-EventThread"  thread so many

      It seems like that , connection created before bulkload ,but close method uninvoked at last

      Attachments

        1. HBaseContext.scala
          41 kB
          Key Hutu
        2. HBaseContext.patch
          16 kB
          Key Hutu
        3. HBASE-19848-V2.patch
          1 kB
          Key Hutu
        4. HBASE-19848-V3.patch
          0.7 kB
          Key Hutu

        Activity

          People

            Key Hutu Key Hutu
            Key Hutu Key Hutu
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - 72h
                72h
                Remaining:
                Remaining Estimate - 72h
                72h
                Logged:
                Time Spent - Not Specified
                Not Specified