Open Access iconOpen Access

ARTICLE

Enhanced Kinship Verification through Ear Images: A Comparative Study of CNNs, Attention Mechanisms, and MLP Mixer Models

Thien-Tan Cao, Huu-Thanh Duong, Viet-Tuan Le, Hau Nguyen Trung, Vinh Truong Hoang, Kiet Tran-Trung*

Faculty of Information Technology, Ho Chi Minh City Open University, Ho Chi Minh, 722000, Vietnam

* Corresponding Author: Kiet Tran-Trung. Email: email

(This article belongs to the Special Issue: Novel Methods for Image Classification, Object Detection, and Segmentation)

Computers, Materials & Continua 2025, 83(3), 4373-4391. https://doi.org/10.32604/cmc.2025.061583

Abstract

Kinship verification is a key biometric recognition task that determines biological relationships based on physical features. Traditional methods predominantly use facial recognition, leveraging established techniques and extensive datasets. However, recent research has highlighted ear recognition as a promising alternative, offering advantages in robustness against variations in facial expressions, aging, and occlusions. Despite its potential, a significant challenge in ear-based kinship verification is the lack of large-scale datasets necessary for training deep learning models effectively. To address this challenge, we introduce the EarKinshipVN dataset, a novel and extensive collection of ear images designed specifically for kinship verification. This dataset consists of 4876 high-resolution color images from 157 multiracial families across different regions, forming 73,220 kinship pairs. EarKinshipVN, a diverse and large-scale dataset, advances kinship verification research using ear features. Furthermore, we propose the Mixer Attention Inception (MAI) model, an improved architecture that enhances feature extraction and classification accuracy. The MAI model fuses Inceptionv4 and MLP Mixer, integrating four attention mechanisms to enhance spatial and channel-wise feature representation. Experimental results demonstrate that MAI significantly outperforms traditional backbone architectures. It achieves an accuracy of 98.71%, surpassing Vision Transformer models while reducing computational complexity by up to 95% in parameter usage. These findings suggest that ear-based kinship verification, combined with an optimized deep learning model and a comprehensive dataset, holds significant promise for biometric applications.

Keywords

Biometric analytics; ear kin; Inceptionv4; kinship verification; kin; ear images

Cite This Article

APA Style
Cao, T., Duong, H., Le, V., Trung, H.N., Hoang, V.T. et al. (2025). Enhanced Kinship Verification through Ear Images: A Comparative Study of CNNs, Attention Mechanisms, and MLP Mixer Models. Computers, Materials & Continua, 83(3), 4373–4391. https://doi.org/10.32604/cmc.2025.061583
Vancouver Style
Cao T, Duong H, Le V, Trung HN, Hoang VT, Tran-Trung K. Enhanced Kinship Verification through Ear Images: A Comparative Study of CNNs, Attention Mechanisms, and MLP Mixer Models. Comput Mater Contin. 2025;83(3):4373–4391. https://doi.org/10.32604/cmc.2025.061583
IEEE Style
T. Cao, H. Duong, V. Le, H. N. Trung, V. T. Hoang, and K. Tran-Trung, “Enhanced Kinship Verification through Ear Images: A Comparative Study of CNNs, Attention Mechanisms, and MLP Mixer Models,” Comput. Mater. Contin., vol. 83, no. 3, pp. 4373–4391, 2025. https://doi.org/10.32604/cmc.2025.061583



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 503

    View

  • 152

    Download

  • 0

    Like

Share Link