Details
-
Improvement
-
Status: Closed
-
Minor
-
Resolution: Fixed
-
0.2.0
-
None
-
None
Description
The dfs Datanode can only store data on a single filesystem volume. When a node runs its disks JBOD this means running a Datanode per disk on the machine. While the scheme works reasonably well on small clusters, on larger installations (several 100 nodes) it implies a very large number of Datanodes with associated management overhead in the Namenode.
The Datanod should be enhanced to be able to handle multiple volumes on a single machine.
Attachments
Attachments
Issue Links
- duplicates
-
HADOOP-74 hash blocks into dfs.data.dirs
- Closed
-
HADOOP-257 starting one data node thread to manage multiple data directories
- Closed
- incorporates
-
HADOOP-50 dfs datanode should store blocks in multiple directories
- Closed