Apache Hadoop

Apache Hadoop: Hadoop, an open-source software framework provides a platform to work with huge amount of data that has to be analyzed, visualized and organized. The concept of Hadoop goes hand in hand with the concepts of HDFS (Hadoop Distributed File System) and MapReduce that are its two key components. HDFS deals with storage part of the execution while MapReduce has the ability to compute them. These days, Hadoop is being widely used as they are written in Java programming language. Hadoop was developed by the engineers working for Google as they were finding it difficult to handle with the terabytes of data while using search engines. As the codes are smaller than data, Hadoop tries moving code to data and not data to data which helps in reducing the data storage and makes the movement easier

Leave a Reply

Your email address will not be published. Required fields are marked *