Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (3)
  • Open Access

    ARTICLE

    A New Speech Encoder Based on Dynamic Framing Approach

    Renyuan Liu1, Jian Yang1, Xiaobing Zhou1,*, Xiaoguang Yue2,3,4

    CMES-Computer Modeling in Engineering & Sciences, Vol.136, No.2, pp. 1259-1276, 2023, DOI:10.32604/cmes.2023.021995

    Abstract Latent information is difficult to get from the text in speech synthesis. Studies show that features from speech can get more information to help text encoding. In the field of speech encoding, a lot of work has been conducted on two aspects. The first aspect is to encode speech frame by frame. The second aspect is to encode the whole speech to a vector. But the scale in these aspects is fixed. So, encoding speech with an adjustable scale for more latent information is worthy of investigation. But current alignment approaches only support frame-by-frame encoding and speech-to-vector encoding. It remains… More >

  • Open Access

    ARTICLE

    Emotional Vietnamese Speech Synthesis Using Style-Transfer Learning

    Thanh X. Le, An T. Le, Quang H. Nguyen*

    Computer Systems Science and Engineering, Vol.44, No.2, pp. 1263-1278, 2023, DOI:10.32604/csse.2023.026234

    Abstract In recent years, speech synthesis systems have allowed for the production of very high-quality voices. Therefore, research in this domain is now turning to the problem of integrating emotions into speech. However, the method of constructing a speech synthesizer for each emotion has some limitations. First, this method often requires an emotional-speech data set with many sentences. Such data sets are very time-intensive and labor-intensive to complete. Second, training each of these models requires computers with large computational capabilities and a lot of effort and time for model tuning. In addition, each model for each emotion failed to take advantage… More >

  • Open Access

    ARTICLE

    Speech Quality Enhancement Using Phoneme with Cepstrum Variation Features

    K. C. Rajeswari1,*, R. S. Mohana2, S. Manikandan3, S. Beski Prabaharan4

    Intelligent Automation & Soft Computing, Vol.34, No.1, pp. 65-86, 2022, DOI:10.32604/iasc.2022.022681

    Abstract In recent years, Text-to-Speech (TTS) synthesis is taking a new dimension. People prefer voice embedded toys, online buyers are interested in interactive chat application in the form of text-to-speech facility, screen readers for visually challenged people, and many more applications use TTS module. TTSis a system that is capable of converting the arbitrary text input into natural sounding speech. It’s success lies in producing more human like speech sounding more natural. The most importanttechnical aspect of TTS is feature extraction process. Both text and speech features are needed but it is not that easy to select meaningful and useful features… More >

Displaying 1-10 on page 1 of 3. Per Page