For in-depth information on various Big Data technologies, check out my free e-book “Introduction to Big Data“.
Hadoop Distributed File System (HDFS) is a multi-machine file system that runs on top of machines’ local file system but appears as a single namespace, accessible through
hdfs:// URIs. It is designed to reliably store very large files across machines in a large cluster of inexpensive commodity hardware. HDFS closely follows the design of the Google File System (GFS). Continue reading