Performance Comparison of Hadoop Map Reduce and Apache Spark

Authors

  • Anju Parmar Department of Computer Science, HP University, Shimla
  • Vikrant Bhardwaj Department of Computer Science, HP University, Shimla
  • Divya Chauhan JRF, Department of Computer Science, HP University, Shimla
  • K L Bansal Professor, Department of Computer Science, HP University, Shimla

Keywords:

Apache Spark, Big Data, Hadoop, HDFS, Map Reduce

Abstract

With the advancement of the electronics and the communication technology information generation rate has
gain tremendous growth. Huge –Huge Amount of the data has been generated per hour from various medium of the
Internet of Thing (IoT), referred as big data, and is a trending term these days. Enormous Data has been the point of
interest for Computer Science devotee around the globe, and has increased considerably more noticeable quality over
the most recent couple of years. This paper discusses the comparison of Hadoop MapReduce and Apache Spark. Both
Hadoop and Spark are framework for analyzing big data. Although both of these resources are based on the idea of Big
Data, their performance varies significantly based on the application under consideration. In this paper two frameworks
are being compared along with providing the performance comparison using word count algorithm. In this paper,
various datasets has been analyzed over Hadoop MapReduce and Apache Spark environment for word count algorithm.

Published

2018-03-25

How to Cite

Anju Parmar, Vikrant Bhardwaj, Divya Chauhan, & K L Bansal. (2018). Performance Comparison of Hadoop Map Reduce and Apache Spark. International Journal of Advance Engineering and Research Development (IJAERD), 5(3), 1323–1328. Retrieved from https://ijaerd.org/index.php/IJAERD/article/view/2919