A platform for research: civil engineering, architecture and urbanism
MapReduce-based Fuzzy C-means Algorithm for Distributed Document Clustering
The clustering of big data is a challenging task. The traditional clustering algorithms are inefficient for clustering big data. The recent researches in this field suggest that the traditional clustering algorithms needed to be redesigned for the modern architecture of computing. This wok has proposed a novel MapReduce-based fuzzy C-means algorithm for big document data clustering. The algorithm is extensively experimented with using different sizes of document datasets and executed over the Hadoop cluster of different sizes. The proposed algorithm’s efficiency is compared against serial traditional fuzzy C-means and MapReduce-based K-means algorithms. The proposed design of the fuzzy C-means algorithm is scaled well with the Hadoop platform and documents big datasets and resulted in a performance gain.
MapReduce-based Fuzzy C-means Algorithm for Distributed Document Clustering
The clustering of big data is a challenging task. The traditional clustering algorithms are inefficient for clustering big data. The recent researches in this field suggest that the traditional clustering algorithms needed to be redesigned for the modern architecture of computing. This wok has proposed a novel MapReduce-based fuzzy C-means algorithm for big document data clustering. The algorithm is extensively experimented with using different sizes of document datasets and executed over the Hadoop cluster of different sizes. The proposed algorithm’s efficiency is compared against serial traditional fuzzy C-means and MapReduce-based K-means algorithms. The proposed design of the fuzzy C-means algorithm is scaled well with the Hadoop platform and documents big datasets and resulted in a performance gain.
MapReduce-based Fuzzy C-means Algorithm for Distributed Document Clustering
J. Inst. Eng. India Ser. B
Sardar, Tanvir H. (author) / Ansari, Zahid (author)
Journal of The Institution of Engineers (India): Series B ; 103 ; 131-142
2022-02-01
12 pages
Article (Journal)
Electronic Resource
English
An Analysis of Distributed Document Clustering Using MapReduce Based K-Means Algorithm
Springer Verlag | 2020
|Distributed Big Data Clustering using MapReduce-based Fuzzy C-Medoids
Springer Verlag | 2022
|Data Categorization Using Hadoop MapReduce-Based Parallel K-Means Clustering
Springer Verlag | 2019
|Soil clustering by fuzzy c-means algorithm
Tema Archive | 2005
|Hausdorff Distance Measure Based Interval Fuzzy Possibilistic C-Means Clustering Algorithm
British Library Online Contents | 2013
|