Performance Comparison of Hadoop Map Reduce and Apache Spark

Anju Parmar; Vikrant Bhardwaj; Divya Chauhan; K L Bansal

Authors

Anju Parmar Department of Computer Science, HP University, Shimla
Vikrant Bhardwaj Department of Computer Science, HP University, Shimla
Divya Chauhan JRF, Department of Computer Science, HP University, Shimla
K L Bansal Professor, Department of Computer Science, HP University, Shimla

Keywords:

Apache Spark, Big Data, Hadoop, HDFS, Map Reduce

Abstract

With the advancement of the electronics and the communication technology information generation rate has
gain tremendous growth. Huge –Huge Amount of the data has been generated per hour from various medium of the
Internet of Thing (IoT), referred as big data, and is a trending term these days. Enormous Data has been the point of
interest for Computer Science devotee around the globe, and has increased considerably more noticeable quality over
the most recent couple of years. This paper discusses the comparison of Hadoop MapReduce and Apache Spark. Both
Hadoop and Spark are framework for analyzing big data. Although both of these resources are based on the idea of Big
Data, their performance varies significantly based on the application under consideration. In this paper two frameworks
are being compared along with providing the performance comparison using word count algorithm. In this paper,
various datasets has been analyzed over Hadoop MapReduce and Apache Spark environment for word count algorithm.

Performance Comparison of Hadoop Map Reduce and Apache Spark

Authors

Keywords:

Abstract

Downloads

Published

Issue

Section

How to Cite

Similar Articles

Most read articles by the same author(s)

Make a Submission

downloads

Imp links

google

Latest publications

Information