Open Access iconOpen Access

ARTICLE

crossmark

kProtoClust: Towards Adaptive k-Prototype Clustering without Known k

Yuan Ping1,2,*, Huina Li1, Chun Guo3, Bin Hao4

1 School of Information Engineering, Xuchang University, Xuchang, 461000, China
2 Henan Province Engineering Technology Research Center of Big Data Security and Applications, Xuchang, 461000, China
3 College of Computer Science and Technology, Guizhou University, Guiyang, 550025, China
4 Here Data Technology, Shenzhen, 518000, China

* Corresponding Author: Yuan Ping. Email: email

Computers, Materials & Continua 2025, 82(3), 4949-4976. https://doi.org/10.32604/cmc.2025.057693

Abstract

Towards optimal k-prototype discovery, k-means-like algorithms give us inspirations of central samples collection, yet the unstable seed samples selection, the hypothesis of a circle-like pattern, and the unknown K are still challenges, particularly for non-predetermined data patterns. We propose an adaptive k-prototype clustering method (kProtoClust) which launches cluster exploration with a sketchy division of K clusters and finds evidence for splitting and merging. On behalf of a group of data samples, support vectors and outliers from the perspective of support vector data description are not the appropriate candidates for prototypes, while inner samples become the first candidates for instability reduction of seeds. Different from the representation of samples in traditional, we extend sample selection by encouraging fictitious samples to emphasize the representativeness of patterns. To get out of the circle-like pattern limitation, we introduce a convex decomposition-based strategy of one-cluster-multiple-prototypes in which convex hulls of varying sizes are prototypes, and accurate connection analysis makes the support of arbitrary cluster shapes possible. Inspired by geometry, the three presented strategies make kProtoClust bypassing the K dependence well with the global and local position relationship analysis for data samples. Experimental results on twelve datasets of irregular cluster shape or high dimension suggest that kProtoClust handles arbitrary cluster shapes with prominent accuracy even without the prior knowledge K.

Keywords

Prototype finding; k-means++; convex hull; support vector data description; geometrical information

Cite This Article

APA Style
Ping, Y., Li, H., Guo, C., Hao, B. (2025). kprotoclust: towards adaptive k-prototype clustering without known k. Computers, Materials & Continua, 82(3), 4949–4976. https://doi.org/10.32604/cmc.2025.057693
Vancouver Style
Ping Y, Li H, Guo C, Hao B. kprotoclust: towards adaptive k-prototype clustering without known k. Comput Mater Contin. 2025;82(3):4949–4976. https://doi.org/10.32604/cmc.2025.057693
IEEE Style
Y. Ping, H. Li, C. Guo, and B. Hao, “kProtoClust: Towards Adaptive k-Prototype Clustering without Known k,” Comput. Mater. Contin., vol. 82, no. 3, pp. 4949–4976, 2025. https://doi.org/10.32604/cmc.2025.057693



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 311

    View

  • 164

    Download

  • 0

    Like

Share Link