Options
Instance Selection Improves Cross-Lingual Model Training for Fine-Grained Sentiment Analysis
Klinger, Roman; Cimiano, Philipp (2024): Instance Selection Improves Cross-Lingual Model Training for Fine-Grained Sentiment Analysis, in: Bamberg: Otto-Friedrich-Universität, S. 153–163.
Faculty/Chair:
Author:
Publisher Information:
Year of publication:
2024
Pages:
Source/Other editions:
Proceedings of the Nineteenth Conference on Computational Natural Language Learning / Association for Computational Linguistics, 2015, S. 153–163. - ISBN: 978-1-941643-77-8
Year of first publication:
2015
Language:
English
Abstract:
Scarcity of annotated corpora for many languages is a bottleneck for training finegrained sentiment analysis models that can tag aspects and subjective phrases. We propose to exploit statistical machine translation to alleviate the need for training data by projecting annotated data in a source language to a target language such that a supervised fine-grained sentiment analysis system can be trained. To avoid a negative influence of poor-quality translations, we propose a filtering approach based on machine translation quality estimation measures to select only high-quality sentence pairs for projection. We evaluate on the language pair German/English on a corpus of product reviews annotated for both languages and compare to in-target-language training. Projection without any filtering leads to 23 % F1 in the task of detecting aspect phrases, compared to 41 % F1 for in-target-language training. Our approach obtains up to 47 % F1. Further, we show that the detection of subjective phrases is competitive to in-target-language training without filtering.
GND Keywords: ; ;
Textanalyse
Gefühl <Motiv>
Maschinelles Lernen
Keywords:
Instance Selection
DDC Classification:
RVK Classification:
Peer Reviewed:
Yes:
International Distribution:
Yes:
Open Access Journal:
Yes:
Type:
Conferenceobject
Activation date:
July 30, 2024
Permalink
https://fis.uni-bamberg.de/handle/uniba/96621