An emotion-sensitive dialogue policy for

Abstract

Reinforcement learning (RL) is an effective method in training dialogue policies to steer the conversation towards successful task completion. However, most RL-based methods only rely on semantic inputs that lack empathy as they ignore the user emotional information. Moreover, these methods suffer from delayed rewards caused by the user simulator returning valuable results only at dialogue end. Recently, some methods have been proposed to learn the reward function together with user emotions, but they omit considering user emotion in each dialogue turn. In this paper, we proposed an emotion-sensitive dialogue policy model (ESDP), it incorporates user emotions information into dialogue policy and selects the optimal action by the combination of top-k actions with the user emotions. The user emotion information in each turn is used as an immediate reward for the current dialogue state to solve sparse rewards and the dependency on termination. Extensive experiments validate that our method outperforms the baseline approaches when combined with different Q-Learning algorithms, and also surpasses other popular existing dialog policies’ performance.

Details

Title

An emotion-sensitive dialogue policy for task-oriented dialogue system

Author

Zhu, Hui¹; Wang, Xv²; Wang, Zhenyu²; Xv, Kai²

¹ College of Economics and Trade Guangdong Mechanical & Electrical Polytechnic, Guangzhou, China
² South China University of Technology, School of Software Engineering, Guangzhou, China (GRID:grid.79703.3a) (ISNI:0000 0004 1764 3838)

Pages

19759

Publication year

2024

Publication date

2024

Publisher

Nature Publishing Group

e-ISSN

20452322

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1038/s41598-024-70463-x

ProQuest document ID

3097304913

© The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

An emotion-sensitive dialogue policy for task-oriented dialogue system

Jump to:

Abstract

Details

Suggested sources