Daily Term: Hadoop
Hadoop
Apache Hadoop is an open-source framework for distributed storage and processing of large datasets across clusters of computers. It uses HDFS for storage and MapReduce for processing. For example, Hadoop might process petabytes of web logs to analyze user behavior. Hadoop excels at scalability and fault tolerance for batch processing, but it’s slower than in-memory frameworks like Spark and requires managing a complex ecosystem.
Date: 2025-12-19