Performance Comparison of Hadoop Map Reduce and Apache Spark
Keywords:
Apache Spark, Big Data, Hadoop, HDFS, Map ReduceAbstract
With the advancement of the electronics and the communication technology information generation rate has
gain tremendous growth. Huge –Huge Amount of the data has been generated per hour from various medium of the
Internet of Thing (IoT), referred as big data, and is a trending term these days. Enormous Data has been the point of
interest for Computer Science devotee around the globe, and has increased considerably more noticeable quality over
the most recent couple of years. This paper discusses the comparison of Hadoop MapReduce and Apache Spark. Both
Hadoop and Spark are framework for analyzing big data. Although both of these resources are based on the idea of Big
Data, their performance varies significantly based on the application under consideration. In this paper two frameworks
are being compared along with providing the performance comparison using word count algorithm. In this paper,
various datasets has been analyzed over Hadoop MapReduce and Apache Spark environment for word count algorithm.