Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
Reviewed
-
Description
TableSnapshotInputFormat runs one map task per region in the table snapshot. This places unnecessary restriction that the region layout of the original table needs to take the processing resources available to MR job into consideration. Allowing to run multiple mappers per region (assuming reasonably even key distribution) would be useful.
Attachments
Attachments
Issue Links
- is related to
-
HBASE-19204 branch-1.2 times out and is taking 6-7 hours to complete
- Resolved
- relates to
-
HBASE-20769 getSplits() has a out of bounds problem in TableSnapshotInputFormatImpl
- Resolved
-
HBASE-16894 Create more than 1 split per region, generalize HBASE-12590
- Resolved
- links to