Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
Description
Currently all the volumes on datanode is reported as a single storage. This change proposes reporting them as individual storage. This requires:
- A unique storage ID for each storage
- This needs to be generated during formatting
- There should be an option to allow existing disks to be reported as single storage unit for backward compatibility.
- A functionality is also needed to split the existing all volumes as single storage unit to to individual storage units.
Configuration must allow for each storage unit a storage type attribute. (NowHDFS-5000)- Block reports must be sent on a per storage basis. In some cases (such memory tier) block reports may need to be sent more frequently. That means block reporting period must be on a per storage type basis.
My proposal is for new clusters to configure volumes by default as separate storage unit. Lets discuss.
Attachments
Attachments
Issue Links
- depends upon
-
HDFS-4985 Add storage type to the protocol and expose it in block report and block locations
- Resolved
-
HDFS-5000 DataNode configuration should allow specifying storage type
- Resolved
-
HDFS-5232 Protocol changes to transmit StorageUuid
- Resolved
-
HDFS-5233 Use Datanode UUID to identify Datanodes
- Resolved
- is depended upon by
-
HDFS-5153 Datanode should send block reports for each storage in a separate message
- Closed
- is duplicated by
-
HDFS-5155 Deprecate API of getStorageID() and setStorageID() in DatanodeID as multiple storages
- Resolved
- relates to
-
HDFS-5264 Consider renaming StorageID
- Resolved
-
HDFS-5377 Heartbeats from Datandode should include one storage report per storage directory
- Resolved
-
HDFS-5390 Send one incremental block report per storage directory
- Resolved