Full text

Turn on search term navigation

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Conversational agents are gaining huge popularity in industrial applications such as digital assistants, chatbots, and particularly systems for natural language understanding (NLU). However, a major drawback is the unavailability of a common metric to evaluate the replies against human judgement for conversational agents. In this paper, we develop a benchmark dataset with human annotations and diverse replies that can be used to develop such metric for conversational agents. The paper introduces a high-quality human annotated movie dialogue dataset, HUMOD, that is developed from the Cornell movie dialogues dataset. This new dataset comprises 28,500 human responses from 9500 multi-turn dialogue history-reply pairs. Human responses include: (i) ratings of the dialogue reply in relevance to the dialogue history; and (ii) unique dialogue replies for each dialogue history from the users. Such unique dialogue replies enable researchers in evaluating their models against six unique human responses for each given history. Detailed analysis on how dialogues are structured and human perception on dialogue score in comparison with existing models are also presented.

Details

Title
Human Annotated Dialogues Dataset for Natural Conversational Agents
Author
Merdivan, Erinc 1 ; Singh, Deepika 2 ; Hanke, Sten 3 ; Kropf, Johannes 4   VIAFID ORCID Logo  ; Holzinger, Andreas 5 ; Geist, Matthieu 6 

 AIT Austrian Institute of Technology, 2700 Wiener Neustadt, Austria; [email protected]; CentraleSupélec, Université de Lorraine, CNRS, LORIA, F-57000 Metz, France 
 AIT Austrian Institute of Technology, 2700 Wiener Neustadt, Austria; [email protected]; Holzinger Group, HCI-KDD, Institute for Medical Informatics/Statistics, Medical University Graz, 8036 Graz, Austria; [email protected] 
 FH Joanneum Gesellschaft mbH, 8020 Graz, Austria; [email protected] 
 AIT Austrian Institute of Technology, 2700 Wiener Neustadt, Austria; [email protected] 
 Holzinger Group, HCI-KDD, Institute for Medical Informatics/Statistics, Medical University Graz, 8036 Graz, Austria; [email protected] 
 Université de Lorraine, CNRS, LIEC, F-57000 Metz, France; [email protected] 
First page
762
Publication year
2020
Publication date
2020
Publisher
MDPI AG
e-ISSN
20763417
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2533921031
Copyright
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.