Performance Analysis of a Chunk-Based Speech Emotion Recognition Model Using RNN

Hyun-Sam Shin; Jun-Ki Hong

doi:10.32604/iasc.2023.033082

Open Access icon Open Access

ARTICLE

Performance Analysis of a Chunk-Based Speech Emotion Recognition Model Using RNN

Hyun-Sam Shin¹, Jun-Ki Hong^2,*

1 Division of Software Convergence, Hanshin University, Osan-si, 18101, Korea
2 Division of AI Software Engineering, Pai Chai University, Daejeon, 35345, Korea

* Corresponding Author: Jun-Ki Hong. Email: email

Intelligent Automation & Soft Computing 2023, 36(1), 235-248. https://doi.org/10.32604/iasc.2023.033082

Received 07 June 2022; Accepted 12 July 2022; Issue published 29 September 2022

Abstract

Recently, artificial-intelligence-based automatic customer response system has been widely used instead of customer service representatives. Therefore, it is important for automatic customer service to promptly recognize emotions in a customer’s voice to provide the appropriate service accordingly. Therefore, we analyzed the performance of the emotion recognition (ER) accuracy as a function of the simulation time using the proposed chunk-based speech ER (CSER) model. The proposed CSER model divides voice signals into 3-s long chunks to efficiently recognize characteristically inherent emotions in the customer’s voice. We evaluated the performance of the ER of voice signal chunks by applying four RNN techniques—long short-term memory (LSTM), bidirectional-LSTM, gated recurrent units (GRU), and bidirectional-GRU—to the proposed CSER model individually to assess its ER accuracy and time efficiency. The results reveal that GRU shows the best time efficiency in recognizing emotions from speech signals in terms of accuracy as a function of simulation time.

Keywords

RNN; speech emotion recognition; attention mechanism; time efficiency

Cite This Article

APA Style

Shin, H., Hong, J. (2023). Performance analysis of a chunk-based speech emotion recognition model using RNN. Intelligent Automation & Soft Computing, 36(1), 235-248. https://doi.org/10.32604/iasc.2023.033082

Vancouver Style

Shin H, Hong J. Performance analysis of a chunk-based speech emotion recognition model using RNN. Intell Automat Soft Comput . 2023;36(1):235-248 https://doi.org/10.32604/iasc.2023.033082

IEEE Style

H. Shin and J. Hong, "Performance Analysis of a Chunk-Based Speech Emotion Recognition Model Using RNN," Intell. Automat. Soft Comput. , vol. 36, no. 1, pp. 235-248. 2023. https://doi.org/10.32604/iasc.2023.033082

BibTex EndNote RIS

This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Performance Analysis of a Chunk-Based Speech Emotion Recognition Model Using RNN

Abstract

Keywords

Cite This Article

806

679

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Share Link