Normal view MARC view ISBD view

Mapreduce-based fuzzy C-means algorithm for distributed document clustering

By: Sardar, Tanvir H.
Contributor(s): Ansari, Zahid.
Publisher: New York Springer 2022Edition: Vol.103(1), Feb.Description: 131-142p.Subject(s): Humanities and Applied SciencesOnline resources: Click here In: Journal of the institution of engineers (India): Series BSummary: The clustering of big data is a challenging task. The traditional clustering algorithms are inefficient for clustering big data. The recent researches in this field suggest that the traditional clustering algorithms needed to be redesigned for the modern architecture of computing. This wok has proposed a novel MapReduce-based fuzzy C-means algorithm for big document data clustering. The algorithm is extensively experimented with using different sizes of document datasets and executed over the Hadoop cluster of different sizes. The proposed algorithm’s efficiency is compared against serial traditional fuzzy C-means and MapReduce-based K-means algorithms. The proposed design of the fuzzy C-means algorithm is scaled well with the Hadoop platform and documents big datasets and resulted in a performance gain.
Tags from this library: No tags from this library for this title. Log in to add tags.
    average rating: 0.0 (0 votes)
Item type Current location Call number Status Date due Barcode Item holds
Articles Abstract Database Articles Abstract Database School of Engineering & Technology
Archieval Section
Not for loan 2022-1579
Total holds: 0

The clustering of big data is a challenging task. The traditional clustering algorithms are inefficient for clustering big data. The recent researches in this field suggest that the traditional clustering algorithms needed to be redesigned for the modern architecture of computing. This wok has proposed a novel MapReduce-based fuzzy C-means algorithm for big document data clustering. The algorithm is extensively experimented with using different sizes of document datasets and executed over the Hadoop cluster of different sizes. The proposed algorithm’s efficiency is compared against serial traditional fuzzy C-means and MapReduce-based K-means algorithms. The proposed design of the fuzzy C-means algorithm is scaled well with the Hadoop platform and documents big datasets and resulted in a performance gain.

There are no comments for this item.

Log in to your account to post a comment.

Click on an image to view it in the image viewer

Unique Visitors hit counter Total Page Views free counter
Implemented and Maintained by AIKTC-KRRC (Central Library).
For any Suggestions/Query Contact to library or Email: librarian@aiktc.ac.in | Ph:+91 22 27481247
Website/OPAC best viewed in Mozilla Browser in 1366X768 Resolution.

Powered by Koha