Deutsch
 
Hilfe Datenschutzhinweis Impressum
  DetailsucheBrowse

Datensatz

DATENSATZ AKTIONENEXPORT

Freigegeben

Konferenzbeitrag

Metrical Tagging in the Wild: Building and Annotating Poetry Corpora with Rhythmic Features

MPG-Autoren
/persons/resource/persons209101

Haider,  Thomas
Department of Language and Literature, Max Planck Institute for Empirical Aesthetics, Max Planck Society;
Institute for Natural Language Processing (IMS), University of Stuttgart;

Externe Ressourcen
Es sind keine externen Ressourcen hinterlegt
Volltexte (beschränkter Zugriff)
Für Ihren IP-Bereich sind aktuell keine Volltexte freigegeben.
Volltexte (frei zugänglich)
Es sind keine frei zugänglichen Volltexte in PuRe verfügbar
Ergänzendes Material (frei zugänglich)
Es sind keine frei zugänglichen Ergänzenden Materialien verfügbar
Zitation

Haider, T. (2021). Metrical Tagging in the Wild: Building and Annotating Poetry Corpora with Rhythmic Features. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics (pp. 3715-3725).


Zitierlink: https://hdl.handle.net/21.11116/0000-0008-72B4-C
Zusammenfassung
A prerequisite for the computational study of literature is the availability of properly digitized texts, ideally with reliable meta-data and ground-truth annotation. Poetry corpora do exist for a number of languages, but larger collections lack consistency and are encoded in various standards, while annotated corpora are typically constrained to a particular genre and/or were designed for the analysis of certain linguistic features (like rhyme). In this work, we provide large poetry corpora for English and German, and annotate prosodic features in smaller corpora to train corpus driven neural models that enable robust large scale analysis.
We show that BiLSTM-CRF models with syllable embeddings outperform a CRF baseline and different BERT-based approaches. In a multi-task setup, particular beneficial task relations illustrate the inter-dependence of poetic features. A model learns foot boundaries better when jointly predicting syllable stress, aesthetic emotions and verse measures benefit
from each other, and we find that caesuras are quite dependent on syntax and also integral to shaping the overall measure of the line.