Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
3.2.0
-
None
-
None
Description
There are a number of outstanding distcp options related to: extensibility, failure reporting/cleanup, long-haul options, cloud performance.
Hadoop 3.1 added some speedups; follow this up with others.
Attachments
Issue Links
- depends upon
-
HADOOP-8233 Turn CRC checking off for 0 byte size and differing blocksizes
- Open
-
HADOOP-13002 distcp behaves differently through code compared to toolrunner invocation from command-line
- Open
-
HADOOP-14631 Distcp should add a default atomicWorkPath properties when using atomic
- Open
-
HADOOP-15790 distcp app should fail if m/r job fails
- Open
-
HADOOP-12900 distcp -delete should show a counter of files deleted
- Open
-
HADOOP-14567 DistCP NullPointerException when -atomic is set but -tmp is not
- Open
-
HDFS-9455 In distcp, Invalid Argument Error thrown in case of filesystem operation failure
- Open
-
HADOOP-15300 distcp -update to WASB and ADL copies up all the files, always
- Resolved
-
HADOOP-14544 DistCp documentation for command line options is misaligned.
- Resolved
-
HADOOP-15789 DistCp does not clean staging folder if class extends DistCp
- Resolved
-
KNOX-482 Support DistCp via Knox
- Open
-
HDFS-11234 distcp performance is suboptimal for high bandwidth/high latency setups
- Open
-
HADOOP-11043 Use Path Instead of String in DistCp Tests
- Open
-
HADOOP-14086 Improve DistCp Speed for small files
- Open
-
MAPREDUCE-6840 Distcp to support cutoff time
- Open
-
HADOOP-15281 Distcp to add no-rename copy option
- Resolved
- relates to
-
HDFS-3889 distcp overwrites files even when there are missing checksums
- Open