Speech emotion recognition in conversations using artificial intelligence: a systematic review and meta-analysis

Abstract

Manifestations of emotion in social conversational interactions stand at a focal point in the rapidly growing affective computing area, with applications in healthcare, education and human-computer interaction. Artificial intelligence (AI) holds great potential in modeling the challenging dynamic nature of affect in speech conversation. In this paper, we analyze and criticize latest trends and open problems through a systematic review and multi-subgroup meta-analysis of AI approaches for emotion recognition in conversation (ERC). We adopt the PRISMA-DTA guidelines toward analysis of AI-driven speech ERC. A comprehensive database search through predefined query strings and selection criteria allowed for data extraction of essential diagnostic performance parameters. We analyze salient patterns related to methodological quality and risk of bias. Univariate random-effects models are then designed with a multi-subgroup perspective, centered around affective annotations models, while encompassing the ERC parameters of modalities, feature extraction and conversation style. 51 studies were systematically reviewed for qualitative analysis, whereas 27 articles were included in the meta-analysis. Diagnostic test performance manifested with high heterogeneity, with intriguing insights regarding affective state annotation, input modality, feature extraction methods, and dataset conversation style. Our analysis raised concerns regarding bias, reporting quality and inter-rater reliability in annotations. Our research contributes fine-grained insights as recommendations that tackle open-problems in ERC. While providing valuable information on diagnostic performance of AI in speech ERC, we underscore the imperative need for further advancements in annotations and models capable of handling diverse emotional expressions.

Trial Registration: PROSPERO identifier - CRD42023416879.

Details

Business indexing term

Subject:

Artificial intelligence

Research method

Quantitative research; Meta-analysis; Systematic review

Title

Speech emotion recognition in conversations using artificial intelligence: a systematic review and meta-analysis

Publication title

The Artificial Intelligence Review; Dordrecht

Volume

Issue

Pages

198

Publication year

2025

Publication date

Jul 2025

Publisher

Springer Nature B.V.

Place of publication

Dordrecht

Country of publication

Netherlands

Publication subject

Computers--Artificial Intelligence

ISSN

02692821

e-ISSN

15737462

Source type

Scholarly Journal

Language of publication

English

Document type

Journal Article

Publication history

Online publication date

2025-04-11

Milestone dates

2025-03-07 (Registration); 2025-03-07 (Accepted)

Publication history

First posting date

11 Apr 2025

DOI

https://doi.org/10.1007/s10462-025-11197-8

ProQuest document ID

3190395670

Document URL

https://www.proquest.com/scholarly-journals/speech-emotion-recognition-conversations-using/docview/3190395670/se-2?accountid=208611

Last updated

2025-11-14

Database

ProQuest One Academic

Speech emotion recognition in conversations using artificial intelligence: a systematic review and meta-analysis

Content area

Abstract

Details