Open Access iconOpen Access

ARTICLE

crossmark

Speech Intelligibility Enhancement Algorithm Based on Multi-Resolution Power-Normalized Cepstral Coefficients (MRPNCC) for Digital Hearing Aids

Xia Wang1, Xing Deng2,3, Hongming Shen1,*, Guodong Zhang1, Shibing Zhang1

1 School of Information Science and Technology, Nantong University, Nantong, 226019, China
2 School of Automation, Key Laboratory of Measurement and Control for CSE, Ministry of Education, Southeast University, Nanjing, 210096, China
3 School of Computer Science, Jiangsu University of Science and Technology, Zhenjiang, 212003, China

* Corresponding Author: Hongming Shen. Email: email

Computer Modeling in Engineering & Sciences 2021, 126(2), 693-710. https://doi.org/10.32604/cmes.2021.013186

Abstract

Speech intelligibility enhancement in noisy environments is still one of the major challenges for hearing impaired in everyday life. Recently, Machine-learning based approaches to speech enhancement have shown great promise for improving speech intelligibility. Two key issues of these approaches are acoustic features extracted from noisy signals and classifiers used for supervised learning. In this paper, features are focused. Multi-resolution power-normalized cepstral coefficients (MRPNCC) are proposed as a new feature to enhance the speech intelligibility for hearing impaired. The new feature is constructed by combining four cepstrum at different time–frequency (T–F) resolutions in order to capture both the local and contextual information. MRPNCC vectors and binary masking labels calculated by signals passed through gammatone filterbank are used to train support vector machine (SVM) classifier, which aim to identify the binary masking values of the T–F units in the enhancement stage. The enhanced speech is synthesized by using the estimated masking values and wiener filtered T–F unit. Objective experimental results demonstrate that the proposed feature is superior to other comparing features in terms of HIT-FA, STOI, HASPI and PESQ, and that the proposed algorithm not only improves speech intelligibility but also improves speech quality slightly. Subjective tests validate the effectiveness of the proposed algorithm for hearing impaired.

Keywords


Cite This Article

Wang, X., Deng, X., Shen, H., Zhang, G., Zhang, S. (2021). Speech Intelligibility Enhancement Algorithm Based on Multi-Resolution Power-Normalized Cepstral Coefficients (MRPNCC) for Digital Hearing Aids. CMES-Computer Modeling in Engineering & Sciences, 126(2), 693–710.



cc This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 1781

    View

  • 1076

    Download

  • 0

    Like

Share Link