An Efficient Clustering Algorithm with Enhanced MapReduce Design based Modified K Means for Outlier

https://doi.org/10.22214/ijraset.2020.31679

September 2020

Turn static files into dynamic content formats.

An Efficient Clustering Algorithm with Enhanced MapReduce Design based Modified K Means for Outlier

Published on Sep 29, 2020

With an increase in amount of data also the speed of access to information has increased. Large data is the distorted form of every data well gained from different sources such as photos, , videos, social media sharing, network blogs, log files, etc. into a consequential and feasible forms. Clustering methods are very useful. Clustering process permits very analogous data to be positioned under a group by unscrambling the data into a explicit group. Once datasets are separated, outlier detection is used to discover false data. In this research work, goal is to formulate data clustering and outlier detection process quicker by using MapReduce technology with modified K-means clustering method. Clustering on enormous data can be time overriding. Hence, MapReduce computing design is utilized and focused on reliable, unfailing and swift clustering process by this technology. The successful execution with comparison to traditional K means is drawn. The results are offered in tables and graphs using sample dataset.

Issuu converts static files into:

and more. Sign up and create your flipbook.