DataNodes

DataNodes are the storage nodes in a distributed file system such as the Hadoop Distributed File System (HDFS). Each DataNode runs on a server with local storage and stores portions of the system's data blocks. DataNodes serve data to clients and to other DataNodes during replication, while the NameNode maintains the namespace and block mapping.

DataNodes store blocks on local disks and report storage status to the NameNode via heartbeats and block

DataNodes perform local integrity checks with checksums and store blocks according to the configured replication factor.

under-replication.

a

a

under-replicated

a