Network Level Security In Hadoop Using WireLine Encryption
Keywords:
Big Data, Hadoop, HDFS, MapReduce, NameNode, DataNode.Abstract
Hadoop is a open-source framework and a distributed system that provides a Distributed file system(HDFS)
and MapReduce. Hadoop is used on private clusters and is use to store sensitive data; as a result strong authentication
and authorization is necessary to protect private data. Adding security to Hadoop is challenging because all the
interactions do not follow the classic client-server architecture. To address these security challenges, the Keberoes
authentication mechanism is supplemented by delegation and capability like access tokens. A huge number of companies
have begun using the open source technology as Apache Hadoop Distributed File System to store and analyze large
volume of structured and unstructerd data which are captured from social media networks, websites, etc. There are
several ways a user access the data on Hadoop clusters. The goal is to explore some advanced methodologies which
deals with security and privacy concern for people who out-source data on Hadoop clusters.