Open Access
ARTICLE
An Active Safe Semi-Supervised Fuzzy Clustering with Pairwise Constraints Based on Cluster Boundary
1 Graduate University of Science and Technology, Vietnam Academy of Science and Technology, Hanoi, 100000, Vietnam
2 FPT Software Company Limited, Hanoi, 100000, Vietnam
3 Institute of Information Technology, Vietnam Academy of Science and Technology, Hanoi, 100000, Vietnam
4 Faculty of Information Technology, Electric Power University, Hanoi, 100000, Vietnam
5 School of Information and Communications Technology, Hanoi University of Industry, Hanoi, 100000, Vietnam
* Corresponding Author: Luong Thi Hong Lan. Email:
Computers, Materials & Continua 2025, 85(3), 5625-5642. https://doi.org/10.32604/cmc.2025.069636
Received 27 June 2025; Accepted 28 August 2025; Issue published 23 October 2025
Abstract
Semi-supervised clustering techniques attempt to improve clustering accuracy by utilizing a limited number of labeled data for guidance. This method effectively integrates prior knowledge using pre-labeled data. While semi-supervised fuzzy clustering (SSFC) methods leverage limited labeled data to enhance accuracy, they remain highly susceptible to inappropriate or mislabeled prior knowledge, especially in noisy or overlapping datasets where cluster boundaries are ambiguous. To enhance the effectiveness of clustering algorithms, it is essential to leverage labeled data while ensuring the safety of the previous knowledge. Existing solutions, such as the Trusted Safe Semi-Supervised Fuzzy Clustering Method (TS3FCM), struggle with random centroid initialization, fixed neighbor radius formulas, and handling outliers or noise at cluster overlaps. A new framework called Active Safe Semi-Supervised Fuzzy Clustering with Pairwise Constraints Based on Cluster Boundary (AS3FCPC) is proposed in this paper to deal with these problems. It does this by combining pairwise constraints and active learning. AS3FCPC uses active learning to query only the most informative data instances close to the cluster boundaries. It also uses pairwise constraints to enforce the cluster structure, which makes the system more accurate and robust. Extensive test results on diverse datasets, including challenging noisy and overlapping scenarios, demonstrate that AS3FCPC consistently achieves superior performance compared to state-of-the-art methods like TS3FCM and other baselines, especially when the data is noisy and overlaps. This significant improvement underscores AS3FCPC’s potential for reliable and accurate semi-supervised fuzzy clustering in complex, real-world applications, particularly by effectively managing mislabeled data and ambiguous cluster boundaries.Keywords
Cite This Article
Copyright © 2025 The Author(s). Published by Tech Science Press.This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.


Submit a Paper
Propose a Special lssue
View Full Text
Download PDF
Downloads
Citation Tools