Content area

Abstract

In an era where user-generated content becomes ever more prevalent, reliable methods to judge emotional properties of these kinds of complex texts are needed, for example for developing corpora in machine learning contexts. In this study, we focus on Dutch Twitter messages, a genre which is high in emotional content and frequently investigated in the field of computational linguistics. We compare three annotation methods to annotate the emotional dimensions valence, arousal and dominance in 300 Tweets, namely rating scales, pairwise comparison and best–worst scaling. We evaluate the annotation methods on the criterion of inter-annotator agreement, based on judgments of 18 annotators in total. On this dataset, best–worst scaling has the highest inter-annotator agreement. We find that the difference in agreement is largest for dominance and smallest for valence, suggesting that the benefit of best–worst scaling becomes more pronounced as the annotation task gets more difficult. However, we also find that best–worst scaling is particularly more time-consuming than are rating scale and pairwise comparison annotations. This leads us to conclude that, in particular when dealing with computational models, a comparative assessment of quality versus costs needs to be made.

Details

Title
Annotating affective dimensions in user-generated content
Author
De Bruyne Luna 1   VIAFID ORCID Logo  ; De Clercq Orphée 1 ; Hoste Véronique 1 

 Ghent University, LT3 Language and Translation Technology Team, Department of Translation, Interpreting and Communication, Ghent, Belgium (GRID:grid.5342.0) (ISNI:0000 0001 2069 7798) 
Pages
1017-1045
Publication year
2021
Publication date
Dec 2021
Publisher
Springer Nature B.V.
ISSN
1574020X
e-ISSN
1574-0218
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2580827900
Copyright
© The Author(s), under exclusive licence to Springer Nature B.V. part of Springer Nature 2021.