Open Access iconOpen Access

ARTICLE

Comprehensive Analysis of Gender Classification Accuracy across Varied Geographic Regions through the Application of Deep Learning Algorithms to Speech Signals

Abhishek Singhal*, Devendra Kumar Sharma

Department of Electronics and Communication Engineering, Faculty of Engineering and Technology, SRM Institute of Science and Technology, Delhi–NCR Campus, Ghaziabad, 201204, India

* Corresponding Author: Abhishek Singhal. Email: email

Computer Systems Science and Engineering 2024, 48(3), 609-625. https://doi.org/10.32604/csse.2023.046730

Abstract

This article presents an exhaustive comparative investigation into the accuracy of gender identification across diverse geographical regions, employing a deep learning classification algorithm for speech signal analysis. In this study, speech samples are categorized for both training and testing purposes based on their geographical origin. Category 1 comprises speech samples from speakers outside of India, whereas Category 2 comprises live-recorded speech samples from Indian speakers. Testing speech samples are likewise classified into four distinct sets, taking into consideration both geographical origin and the language spoken by the speakers. Significantly, the results indicate a noticeable difference in gender identification accuracy among speakers from different geographical areas. Indian speakers, utilizing 52 Hindi and 26 English phonemes in their speech, demonstrate a notably higher gender identification accuracy of 85.75% compared to those speakers who predominantly use 26 English phonemes in their conversations when the system is trained using speech samples from Indian speakers. The gender identification accuracy of the proposed model reaches 83.20% when the system is trained using speech samples from speakers outside of India. In the analysis of speech signals, Mel Frequency Cepstral Coefficients (MFCCs) serve as relevant features for the speech data. The deep learning classification algorithm utilized in this research is based on a Bidirectional Long Short-Term Memory (BiLSTM) architecture within a Recurrent Neural Network (RNN) model.

Keywords


Cite This Article

APA Style
Singhal, A., Sharma, D.K. (2024). Comprehensive analysis of gender classification accuracy across varied geographic regions through the application of deep learning algorithms to speech signals. Computer Systems Science and Engineering, 48(3), 609-625. https://doi.org/10.32604/csse.2023.046730
Vancouver Style
Singhal A, Sharma DK. Comprehensive analysis of gender classification accuracy across varied geographic regions through the application of deep learning algorithms to speech signals. Comput Syst Sci Eng. 2024;48(3):609-625 https://doi.org/10.32604/csse.2023.046730
IEEE Style
A. Singhal and D.K. Sharma, "Comprehensive Analysis of Gender Classification Accuracy across Varied Geographic Regions through the Application of Deep Learning Algorithms to Speech Signals," Comput. Syst. Sci. Eng., vol. 48, no. 3, pp. 609-625. 2024. https://doi.org/10.32604/csse.2023.046730



cc This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 498

    View

  • 204

    Download

  • 0

    Like

Share Link