Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
Description
Local map segments should never be handled as though they were remote (i.e., copied through a servlet to local disk). This optimization is uniformally more efficient for the fetch, though it increases the number of on-disk segments. Each segment has its own overhead, which can exceed the cost of pulling it into memory (e.g., the decompressor overhead for an active segment exceeds the cost of decompression into memory). Some logic is required to handle a large number of such segments.
Attachments
Issue Links
- is part of
-
MAPREDUCE-4584 Umbrella: Preemption and restart of MapReduce tasks
- Open