MLSA – A Multi-layered Reference Corpus for German Sentiment Analysis

MLSA – A Multi-layered Reference Corpus for German Sentiment Analysis

Title: MLSA – A Multi-layered Reference Corpus for German Sentiment Analysis
Authors: Simon Clematide , Stefan Gindl , Manfred Klenner , Stefanos Petrakis , Robert Remus , Josef Ruppenhofer , Ulli Waltinger , Michael Wiegand
Pub/Conf:  Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC-2012), Istanbul, Turkey, 3551-3556, 2012

Abstract:
In this paper, we describe MLSA, a publicly available multi-layered reference corpus for German-language sentiment analysis. The construction of the corpus is based on the manual annotation of 270 German-language sentences considering three different layers of granularity. The sentence-layer annotation, as the most coarse-grained annotation, focuses on aspects of objectivity, subjectivity and the overall polarity of the respective sentences. Layer 2 is concerned with polarity on the word- and phrase-level, annotating both subjective and factual language. The annotations on Layer 3 focus on the expression-level, denoting frames of private states such as objective and direct speech events. These three layers and their respective annotations are intended to be fully independent of each other. At the same time, exploring for and discovering interactions that may exist between different layers should also be possible. The reliability of the respective annotations was assessed using the average pairwise agreement and Fleiss’ multi-rater measures. We believe that MLSA is a beneficial resource for sentiment analysis research, algorithms and applications that focus on the German language.

BibTeX:
@inproceedings{DBLP:conf/lrec/ClematideGKPRRWW12,
  author    = {Simon Clematide and
               Stefan Gindl and
               Manfred Klenner and
               Stefanos Petrakis and
               Robert Remus and
               Josef Ruppenhofer and
               Ulli Waltinger and
               Michael Wiegand},
  title     = {MLSA - A Multi-layered Reference Corpus for German Sentiment
               Analysis},
  booktitle = {Proceedings of the Eighth International Conference on Language
               Resources and Evaluation (LREC-2012), Istanbul, Turkey,
               May 23-25, 2012},
  year      = {2012},
  pages     = {3551-3556},
  editor    = {Nicoletta Calzolari and
               Khalid Choukri and
               Thierry Declerck and
               Mehmet Ugur Dogan and
               Bente Maegaard and
               Joseph Mariani and
               Jan Odijk and
               Stelios Piperidis},
}

PDFBibTeX