A platform for research: civil engineering, architecture and urbanism
Distributed Big Data Clustering using MapReduce-based Fuzzy C-Medoids
Efficient big data clustering is a requirement for massive data generating in this digitalized connected world. The traditional clustering algorithms do not scale over massively sized and highly unstructured big data. Thus, to obtain efficiency in clustering big data new architecture and programming paradigm is required. In this work, a novel MapReduce-based Fuzzy C-Medoids clustering algorithm is designed and experimented with to cluster big data repository of documents datasets. The performance of the proposed algorithm is experimentally evaluated for different-sized Hadoop cluster sizes and different-sized document datasets. The algorithm is found to be scalable and efficient in performing clustering jobs.
Distributed Big Data Clustering using MapReduce-based Fuzzy C-Medoids
Efficient big data clustering is a requirement for massive data generating in this digitalized connected world. The traditional clustering algorithms do not scale over massively sized and highly unstructured big data. Thus, to obtain efficiency in clustering big data new architecture and programming paradigm is required. In this work, a novel MapReduce-based Fuzzy C-Medoids clustering algorithm is designed and experimented with to cluster big data repository of documents datasets. The performance of the proposed algorithm is experimentally evaluated for different-sized Hadoop cluster sizes and different-sized document datasets. The algorithm is found to be scalable and efficient in performing clustering jobs.
Distributed Big Data Clustering using MapReduce-based Fuzzy C-Medoids
J. Inst. Eng. India Ser. B
Sardar, Tanvir H. (author) / Ansari, Zahid (author)
2022-02-01
10 pages
Article (Journal)
Electronic Resource
English
MapReduce-based Fuzzy C-means Algorithm for Distributed Document Clustering
Springer Verlag | 2022
|Shape Clustering Using K-Medoids in Architectural Form Finding
British Library Conference Proceedings | 2019
|An Analysis of Distributed Document Clustering Using MapReduce Based K-Means Algorithm
Springer Verlag | 2020
|Data Categorization Using Hadoop MapReduce-Based Parallel K-Means Clustering
Springer Verlag | 2019
|