Home / Journals / CSSE / Online First / doi:10.32604/csse.2023.037957
Special lssues

Open Access

ARTICLE

Fuzzy C-Means Algorithm Based on Density Canopy and Manifold Learning

Jili Chen1,2, Hailan Wang2, Xiaolan Xie1,2,*
1 Guangxi Key Laboratory of Embedded Technology and Intelligent System, Guilin, 541006, China
2 College of Information Science and Engineering, Guilin University of Technology, Guilin, 541004, China
* Corresponding Author: Xiaolan Xie. Email: email

Computer Systems Science and Engineering https://doi.org/10.32604/csse.2023.037957

Received 22 November 2022; Accepted 17 February 2023; Published online 06 March 2024

Abstract

Fuzzy C-Means (FCM) is an effective and widely used clustering algorithm, but there are still some problems. considering the number of clusters must be determined manually, the local optimal solutions is easily influenced by the random selection of initial cluster centers, and the performance of Euclid distance in complex high-dimensional data is poor. To solve the above problems, the improved FCM clustering algorithm based on density Canopy and Manifold learning (DM-FCM) is proposed. First, a density Canopy algorithm based on improved local density is proposed to automatically deter-mine the number of clusters and initial cluster centers, which improves the self-adaptability and stability of the algorithm. Then, considering that high-dimensional data often present a nonlinear structure, the manifold learning method is applied to construct a manifold spatial structure, which preserves the global geometric properties of complex high-dimensional data and improves the clustering effect of the algorithm on complex high-dimensional datasets. Fowlkes-Mallows Index (FMI), the weighted average of homogeneity and completeness (V-measure), Adjusted Mutual Information (AMI), and Adjusted Rand Index (ARI) are used as performance measures of clustering algorithms. The experimental results show that the manifold learning method is the superior distance measure, and the algorithm improves the clustering accuracy and performs superiorly in the clustering of low-dimensional and complex high-dimensional data.

Keywords

Fuzzy C-Means (FCM); cluster center; density canopy; ISOMAP; clustering
  • 207

    View

  • 34

    Download

  • 0

    Like

Share Link