Content area

Abstract

Speech is one of the most efficient methods of communication among humans, inspiring advancements in machine speech processing under Natural Language Processing (NLP). This field aims to enable computers to analyze, comprehend, and generate human language naturally. Speech processing, as a subset of artificial intelligence, is rapidly expanding due to its applications in emotion recognition, human-computer interaction, and sentiment analysis. This study introduces a novel algorithm for emotion recognition from speech using deep learning techniques. The proposed model achieves up to a 15% improvement compared to state-of-the-art deep learning methods in speech emotion recognition. It employs advanced supervised learning algorithms and deep neural network architectures, including Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) with Long Short-Term Memory (LSTM) units. These models are trained on labeled datasets to accurately classify emotions such as happiness, sadness, anger, fear, surprise, and neutrality. The research highlights the system’s real-time application potential, such as analyzing audience emotional responses during live television broadcasts. By leveraging advancements in deep learning, the model achieves high accuracy in understanding and predicting emotional states, offering valuable insights into user behavior. This approach contributes to diverse domains, including media analysis, customer feedback systems, and human-machine interaction, showcasing the transformative potential of combining speech processing with neural networks.

Details

1009240
Title
A deep learning framework for gender sensitive speech emotion recognition based on MFCC feature selection and SHAP analysis
Author
Hu, Qingqing 1 ; Peng, Yiran 2 ; Zheng, Zhong 1 

 Faculty of Humanities and Arts, Macau University of Science and Technology, Avenida Wai Long, 999078, Taipa, Macau, China (ROR: https://ror.org/03jqs2n27) (GRID: grid.259384.1) (ISNI: 0000 0000 8945 4455) 
 Faculty of Innovation Engineering, Macau University of Science and Technology, Avenida Wai Long, 999078, Taipa, Macau, China (ROR: https://ror.org/03jqs2n27) (GRID: grid.259384.1) (ISNI: 0000 0000 8945 4455) 
Volume
15
Issue
1
Pages
28569
Number of pages
20
Publication year
2025
Publication date
2025
Section
Article
Publisher
Nature Publishing Group
Place of publication
London
Country of publication
United States
Publication subject
e-ISSN
20452322
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-08-05
Milestone dates
2025-07-28 (Registration); 2025-03-04 (Received); 2025-07-28 (Accepted)
Publication history
 
 
   First posting date
05 Aug 2025
ProQuest document ID
3236805064
Document URL
https://www.proquest.com/scholarly-journals/deep-learning-framework-gender-sensitive-speech/docview/3236805064/se-2?accountid=208611
Copyright
© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-08-06
Database
ProQuest One Academic