Open Access iconOpen Access

ARTICLE

crossmark

Speech Quality Enhancement Using Phoneme with Cepstrum Variation Features

K. C. Rajeswari1,*, R. S. Mohana2, S. Manikandan3, S. Beski Prabaharan4

1 Department of Computer Science and Engineering, Sona College of Technology, Salem, 636005, Tamil Nadu, India
2 Department of Computer Science and Engineering, Kongu Engineering College, Perundurai, 638060, Tamil Nadu, India
3 Department of Information Technology, E.G.S. Pillay Engineering College, Nagapattinam, 611002, Tamil Nadu, India
4 Department of Computer Science and Engineering, Chitkara University Institute of Engineering and Technology, Rajpura, 140401, Punjab, India

* Corresponding Author: K. C. Rajeswari. Email: email

Intelligent Automation & Soft Computing 2022, 34(1), 65-86. https://doi.org/10.32604/iasc.2022.022681

Abstract

In recent years, Text-to-Speech (TTS) synthesis is taking a new dimension. People prefer voice embedded toys, online buyers are interested in interactive chat application in the form of text-to-speech facility, screen readers for visually challenged people, and many more applications use TTS module. TTSis a system that is capable of converting the arbitrary text input into natural sounding speech. It’s success lies in producing more human like speech sounding more natural. The most importanttechnical aspect of TTS is feature extraction process. Both text and speech features are needed but it is not that easy to select meaningful and useful features from the text or from speech. There are many feature extraction techniques available for both text and speech, still there is a need for very simplest form of feature extraction technique. Though the emergence of Deep learning technique automates feature extraction, it is suitable only when the volume of data is enormous. This paper proposes a novel text and speech feature extraction technique which is based on special symbols present in the text and phoneme with cepstrum variation of the speech signal respectively. These techniques are simple and works well for real-time applications in which size of data is small or moderate. The proposed methods not only extract useful features but also meaningful features in terms of fetching the salient traits of the text and speech cepstrum. The experimental results have shown that the quality of speech is increased by 14% when compared to the other conevntional feature extraction techniques.

Keywords


Cite This Article

K. C. Rajeswari, R. S. Mohana, S. Manikandan and S. Beski Prabaharan, "Speech quality enhancement using phoneme with cepstrum variation features," Intelligent Automation & Soft Computing, vol. 34, no.1, pp. 65–86, 2022.



cc This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 1320

    View

  • 676

    Download

  • 0

    Like

Share Link