Details
Description
This JIRA is intended to capture discussion around proposed work to enhance Apache Hadoop to run well on Windows. Apache Hadoop has worked on Microsoft Windows since its inception, but Windows support has never been a priority. Currently Windows works as a development and testing platform for Hadoop, but Hadoop is not natively integrated, full-featured or performance and scalability tuned for Windows Server or Windows Azure. We would like to change this and engage in a dialog with the broader community on the architectural design points for making Windows (enterprise and cloud) an excellent runtime and deployment environment for Hadoop.
The Isotope team at Microsoft (names below) has developed an Apache Hadoop 1.0 patch set that addresses these performance, integration and feature gaps, allowing Apache Hadoop to be used with Azure and Windows Server without recourse to virtualization technologies such as Cygwin. We have significant interest in the deployment of Hadoop across many multi-tenant, PaaS and IaaS environments - which bring their own unique requirements.
Microsoft has recently completed a CCLA with Apache and would like to contribute these enhancements back to the Apache Hadoop community.
In the interest of improving Apache Hadoop so that it runs more smoothly on all platforms, including Windows, we propose first contributing this work to the Apache community by attaching it to this JIRA. From there we would like to work with the community to refine the patch set until it is ready to be merged into the Apache trunk.
Your feedback solicited,
Alexander Stojanovic
Min Wei
David Lao
Lengning Liu
David Zhang
Asad Khan
Attachments
Attachments
Issue Links
- incorporates
-
HADOOP-8374 Improve support for hard link manipulation on Windows
- Resolved
-
HADOOP-8409 Fix TestCommandLineJobSubmission and TestGenericOptionsParser to work for windows
- Resolved
-
HADOOP-8411 TestStorageDirecotyFailure, TestTaskLogsTruncater, TestWebHdfsUrl and TestSecurityUtil fail on Windows
- Resolved
-
HADOOP-8412 TestModTime, TestDelegationToken and TestAuthenticationToken fail intermittently on Windows
- Resolved
-
HADOOP-8414 Address problems related to localhost resolving to 127.0.0.1 on Windows
- Resolved
-
HADOOP-8421 Verify and fix build of c++ targets in Hadoop on Windows
- Resolved
-
HADOOP-8424 Web UI broken on Windows because classpath not setup correctly
- Resolved
-
HADOOP-8486 Resource leak - Close the open resource handles (File handles) before throwing the exception from the SequenceFile constructor
- Resolved
-
HADOOP-8534 Some tests leave a config file open causing failure on windows
- Resolved
-
HDFS-3424 TestDatanodeBlockScanner and TestReplication fail intermittently on Windows
- Resolved
-
MAPREDUCE-4201 Getting PID not working on Windows. Termination of Task/TaskJVM's not working
- Resolved
-
MAPREDUCE-4263 Use taskkill /T to terminate tasks on Windows
- Resolved
-
MAPREDUCE-4321 DefaultTaskController fails to launch tasks on Windows
- Resolved
-
MAPREDUCE-4368 TaskRunner fails to start jars when the java.library.path contains a quoted path with embedded spaces
- Resolved
-
MAPREDUCE-4369 Fix streaming job failures with WindowsResourceCalculatorPlugin
- Resolved
-
HADOOP-8454 Fix the ‘chmod =[perm]’ bug in winutils
- Resolved
-
HADOOP-8544 Move an assertion location in 'winutils chmod'
- Resolved
-
HADOOP-8440 HarFileSystem.decodeHarURI fails for URIs whose host contains numbers
- Closed
-
HADOOP-8101 Access Control support for Non-secure deployment of Hadoop on Windows
- Resolved
-
MAPREDUCE-3898 Hadoop for Windows - Interfacing with Windows to manage MR tasks
- Resolved
-
MAPREDUCE-4203 Create equivalent of ProcfsBasedProcessTree for Windows
- Resolved
-
MAPREDUCE-4260 Use JobObject to spawn tasks on Windows
- Resolved
-
HADOOP-8645 Stabilize branch-1-win
- Resolved
- is related to
-
HADOOP-8139 Path does not allow metachars to be escaped
- Open
-
MAPREDUCE-4322 Fix command-line length abort issues on Windows
- Resolved
-
HBASE-6814 [WINDOWS] HBase on Windows
- Closed
- relates to
-
HADOOP-8900 BuiltInGzipDecompressor throws IOException - stored gzip size doesn't match decompressed size
- Closed
-
HDFS-3564 Design enhancements to the pluggable blockplacementpolicy
- Resolved
-
HDFS-3566 Custom Replication Policy for Azure
- Resolved
-
YARN-240 Rename ProcessTree.isSetsidAvailable
- Resolved
1.
|
General Util Changes for Hadoop for Windows | Resolved | Unassigned | |
2.
|
Hadoop-bin commands for windows | Resolved | Unassigned | |
3.
|
Support for Azure Storage | Resolved | Unassigned | |
4.
|
Initial patch for branch-1-win | Resolved | Unassigned | |
5.
|
Investigate uses of FileUtil and functional correctness based on current use cases | Resolved | Bikas Saha | |
6.
|
Windows build failing after 1.0.3 got merged into branch-1-win | Resolved | Bikas Saha | |
7.
|
Add Windows build and unit test to test-patch pre-commit testing | Resolved | Unassigned |