[HADOOP-2919] Create fewer copies of buffer data during sort/spill - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Blocker
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.17.0
Component/s: None
Labels:
None

Description

Currently, the sort/spill works as follows:

Let r be the number of partitions
For each call to collect(K,V) from map:

If buffers do not exist, allocate a new DataOutputBuffer to collect K,V bytes, allocate r buffers for collecting K,V offsets
Write K,V into buffer, noting offsets
Register offsets with associated partition buffer, allocating/copying accounting buffers if nesc
Calculate the total mem usage for buffer and all partition collectors by iterating over the collectors
If total mem usage is greater than half of io.sort.mb, then start a new thread to spill, blocking if another spill is in progress

For each spill (assuming no combiner):

Save references to our K,V byte buffer and accounting data, setting the former to null (will be recreated on the next call to collect(K,V))
Open a SequenceFile.Writer for this partition
Sort each partition separately (the current version of sort reuses, but still requires wrapping, indices in IntWritable objects)
Build a RawKeyValueIterator of sorted data for the partition
Deserialize each key and value and call SequenceFile::append(K,V) on the writer for this partition

There are a number of opportunities for reducing the number of copies, creations, and operations we perform in this stage, particularly since growing many of the buffers involved requires that we copy the existing data to the newly sized allocation.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

2919-7.patch
26/Mar/08 22:27
62 kB
Christopher Douglas
2919-6.patch
20/Mar/08 01:23
62 kB
Christopher Douglas
2919-5.patch
19/Mar/08 03:00
61 kB
Christopher Douglas
2919-4.patch
13/Mar/08 22:28
60 kB
Christopher Douglas
2919-3.patch
10/Mar/08 17:58
60 kB
Christopher Douglas
2919-2.patch
10/Mar/08 07:01
60 kB
Christopher Douglas
2919-1.patch
07/Mar/08 01:51
57 kB
Christopher Douglas
2919-0.patch
01/Mar/08 02:03
53 kB
Christopher Douglas

Issue Links

duplicates

MAPREDUCE-427 Earlier key-value buffer from MapTask.java is still referenced even though its not required anymore.

Resolved

incorporates

HADOOP-872 map output sorter doesn't compress the outputs before the sort

Closed

HADOOP-287 Speed up SequenceFile sort with memory reduction

Closed

HADOOP-1609 Optimize MapTask.MapOutputBuffer.spill() by not deserialize/serialize keys/values but use appendRaw

Closed

HADOOP-2054 Improve memory model for map-side sorts

Closed

is blocked by

HADOOP-2943 Compression for intermediate map output is broken

Closed

(1 is blocked by)

Activity

People

Assignee:: Christopher Douglas

Reporter:: Christopher Douglas

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Created:: 01/Mar/08 02:02

Updated:: 17/Jul/14 20:25

Resolved:: 31/Mar/08 22:51