Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-3102

Fetch failure of a speculated task causes job hang

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • 0.7.0
    • 0.7.1, 0.8.3
    • None
    • None
    • Reviewed

    Description

      If a task speculates then succeeds, one task will be marked successful and the other killed. Then if the task retroactively fails due to fetch failures the Tez AM will fail to reschedule another task. This results in a hung job.

      Attachments

        1. TEZ-3102.003.patch
          7 kB
          Jason Darrell Lowe
        2. TEZ-3102.002.patch
          6 kB
          Jason Darrell Lowe
        3. TEZ-3102.001.patch
          6 kB
          Jason Darrell Lowe

        Issue Links

          Activity

            People

              jlowe Jason Darrell Lowe
              jlowe Jason Darrell Lowe
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: