[MAPREDUCE-6351] Reducer hung in copy phase. - ASF JIRA

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Duplicate
Affects Version/s: 2.6.0
Fix Version/s: None
Component/s: mrv2
Labels:
None

Description

Problem
Reducer gets stuck in copy phase and doesn't make progress for very long time. After killing this task for couple of times manually, it gets completed.

Observations

Verfied gc logs. Found no memory related issues. Attached the logs.
Verified thread dumps. Found no thread related problems.
On verification of logs, fetcher threads are not copying the map outputs and they are just waiting for merge to happen.
Merge thread is alive and in wait state.

Analysis
On careful observation of logs, thread dumps and code, this looks to me like a classic case of multi-threading issue. Thread goes to wait state after it has been notified.

Here is the suspect code flow.
Thread #1
Fetcher thread - notification comes first
org.apache.hadoop.mapreduce.task.reduce.MergeThread.startMerge(Set<T>)

      synchronized(pendingToBeMerged) {
        pendingToBeMerged.addLast(toMergeInputs);
        pendingToBeMerged.notifyAll();
      }

Thread #2
Merge Thread - goes to wait state (Notification goes unconsumed)
org.apache.hadoop.mapreduce.task.reduce.MergeThread.run()

        synchronized (pendingToBeMerged) {
          while(pendingToBeMerged.size() <= 0) {
            pendingToBeMerged.wait();
          }
          // Pickup the inputs to merge.
          inputs = pendingToBeMerged.removeFirst();
        }

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

jstat-gc.log
02/May/15 18:22
0.7 kB
Laxman
reducer-container-partial.log.zip
02/May/15 18:22
1.15 MB
Laxman
thread-dumps.out
02/May/15 18:22
246 kB
Laxman

Issue Links

duplicates

MAPREDUCE-6334 Fetcher#copyMapOutput is leaking usedMemory upon IOException during InMemoryMapOutput shuffle handler

Closed

Activity

Descending order - Click to sort in ascending order

Laxman added a comment - 05/May/15 09:12

Thanks a lot Jason for details. We are hitting exactly same scenario (disk bad) as explained in ~~MAPREDUCE-6334~~.
We will try the patch and update the details in this jira.

Laxman added a comment - 05/May/15 09:12 Thanks a lot Jason for details. We are hitting exactly same scenario (disk bad) as explained in MAPREDUCE-6334 . We will try the patch and update the details in this jira.

Jason Darrell Lowe added a comment - 04/May/15 13:44

I suspect this is a duplicate of ~~MAPREDUCE-6334~~. I see a lot of these types of messages in the reducer log:

2015-05-01 19:59:37,632 WARN [fetcher#13] org.apache.hadoop.mapreduce.task.reduce.Fetcher: Shuffle output from glgs1190.grid.uh1.inmobi.com:13562 failed, retry it.

I think it is leaking memory allocations from the shuffle errors and the shuffle buffer runs out of available memory (hence fetchers told to WAIT) but there isn't enough data in the shuffle buffer to trigger a merge. All of the memory that was leaked will never complete to kick off the merge and unblock the other threads.

Jason Darrell Lowe added a comment - 04/May/15 13:44 I suspect this is a duplicate of MAPREDUCE-6334 . I see a lot of these types of messages in the reducer log: 2015-05-01 19:59:37,632 WARN [fetcher#13] org.apache.hadoop.mapreduce.task.reduce.Fetcher: Shuffle output from glgs1190.grid.uh1.inmobi.com:13562 failed, retry it. I think it is leaking memory allocations from the shuffle errors and the shuffle buffer runs out of available memory (hence fetchers told to WAIT) but there isn't enough data in the shuffle buffer to trigger a merge. All of the memory that was leaked will never complete to kick off the merge and unblock the other threads.

Laxman added a comment - 04/May/15 09:06

"Threads analysis" mentioned in description above found to be incorrect when I retrace the code flow. Pre-notification is not a problem as merger wait is guarded by size check.

However, problem exists, fetchers are not proceeding and waiting for merger to free some memory and merge doing nothing.

Laxman added a comment - 04/May/15 09:06 "Threads analysis" mentioned in description above found to be incorrect when I retrace the code flow. Pre-notification is not a problem as merger wait is guarded by size check. However, problem exists, fetchers are not proceeding and waiting for merger to free some memory and merge doing nothing.

Laxman added a comment - 02/May/15 18:22

Attached the logs (container log, thread dumps, jstat output) for reference.

Please note that, my thoughts on threading issue may be premature and incorrect. Irrespective of this analysis problem exists.

Laxman added a comment - 02/May/15 18:22 Attached the logs (container log, thread dumps, jstat output) for reference. Please note that, my thoughts on threading issue may be premature and incorrect. Irrespective of this analysis problem exists.

Hadoop Map/Reduce

Reducer hung in copy phase.

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates