Options
An Analysis of Annotated Corpora for Emotion Classification in Text
Bostan, Laura Ana Maria; Klinger, Roman (2018): An Analysis of Annotated Corpora for Emotion Classification in Text, in: Emily M. Bender, Leon Derczynski, Pierre Isabelle, u. a. (Hrsg.), Proceedings of the 27th International Conference on Computational Linguistics, Santa Fe, New Mexico: Association for Computational Linguistics, S. 2104–2119.
Faculty/Chair:
Author:
Title of the compilation:
Proceedings of the 27th International Conference on Computational Linguistics
Editors:
Bender, Emily M.
Derczynski, Leon
Isabelle, Pierre
Conference:
COLING ; Santa Fe, New Mexico
Publisher Information:
Year of publication:
2018
Pages:
Language:
English
Abstract:
Several datasets have been annotated and published for classification of emotions. They differ in several ways: (1) the use of different annotation schemata (e. g., discrete label sets, including joy, anger, fear, or sadness or continuous values including valence, or arousal), (2) the domain, and, (3) the file formats. This leads to several research gaps: supervised models often only use a limited set of available resources. Additionally, no previous work has compared emotion corpora in a systematic manner. We aim at contributing to this situation with a survey of the datasets, and aggregate them in a common file format with a common annotation schema. Based on this aggregation, we perform the first cross-corpus classification experiments in the spirit of future research enabled by this paper, in order to gain insight and a better understanding of differences of models inferred from the data. This work also simplifies the choice of the most appropriate resources for developing a model for a novel domain. One result from our analysis is that a subset of corpora is better classified with models trained on a different corpus. For none of the corpora, training on all data altogether is better than using a subselection of the resources. Our unified corpus is available at http://www.ims.uni-stuttgart.de/data/unifyemotion.
GND Keywords: ; ; ;
Computerlinguistik
Korpus <Linguistik>
Annotation
Gefühl
Keywords:
Emotion Classification
DDC Classification:
RVK Classification:
Peer Reviewed:
Yes:
International Distribution:
Yes:
Open Access Journal:
Yes:
Type:
Conferenceobject
Activation date:
March 7, 2024
Versioning
Question on publication
Permalink
https://fis.uni-bamberg.de/handle/uniba/93962