Two Level Clustering Using Hadoop Map Reduce Framework
Keywords:
Hadoop, MapReduce, K-Means Clustering, Two LevelAbstract
In the field of information mining, clustering is one of the vital techniques. K-Means is an ordinary
separation based clustering calculation; 2-level clustering should actualize adaptable clustering by methods for
partitioning, inspecting and information coordinating. Among those apparatuses of disseminated handling, Map-Reduce
has been generally grasped by both scholarly world and industry. Hadoop is an open-source parallel and conveyed
programming system for the usage of Map-Reduce figuring model. With the investigation of the Map-Reduce worldview
of figuring, we find that Hadoop parallel and disseminated registering model is proper for the usage of adaptable
clustering calculation. This paper takes focal points of K-Means, 2-level clustering component and Map-Reduce
registering model; proposes another technique for parallel and circulated clustering to investigate dispersed clustering
issue in view of Map-Reduce. The strategy intends to apply the clustering calculation successfully to the disseminated
condition. The broad investigations show that the proposed calculation is adaptable, and the time execution is steady.