Hadoop and HDFS in Big Data

Hadoop

  • Hadoop is an open source framework provided by Apache software Foundation for storing and processing huge Data sets with a cluster of commodity Hardware.
  • Hadoop emphasizes on moving code to data instead of data to code because code is smaller than data so it is easy to move around.
  • It is a stage that gives both distributed storage(HDFS) and computational capabilities(Map reduce).

Leave a Reply

Your email address will not be published. Required fields are marked *