Hadoop
- Hadoop is an open source framework provided by Apache software Foundation for storing and processing huge Data sets with a cluster of commodity Hardware.
- Hadoop emphasizes on moving code to data instead of data to code because code is smaller than data so it is easy to move around.
- It is a stage that gives both distributed storage(HDFS) and computational capabilities(Map reduce).