Procesamiento del lenguaje natural y fijacióndel textoExperiencias en torno a la constituciónde un corpus diacrónico de sonetos
- Helena Bermúdez Sabel 1
- Clara Isabel Martínez Cantón 2
- Pablo Ruiz Fabo 3
- 1 JinnTec
-
2
Universidad Nacional de Educación a Distancia
info
-
3
University of Strasbourg
info
- Susanna Allés Torrent (coord.)
- Eugenia Fosalba Vela (coord.)
Publisher: Servicio de Publicaciones = Servei de Publicacions ; Universidad Autónoma de Barcelona = Universitat Autònoma de Barcelona
ISBN: 978-84-128138-3-8
Year of publication: 2024
Pages: 161-174
Type: Book chapter
Abstract
We present work carried out within the development of DISCO, the Diachronic Spanish Sonnet Corpus project, which consists of 4,530 sonnets in Spanish from Europe, Latin America and the Philippines, including texts from the15th to the 20th centuries. The resource offers versification annotations obtained automatically through tools based on Natural Language Processing(NLP). In this article, we present how automatic annotation results can be exploited to detect textual transmission errors. Drawing on our experience withDISCO, we present observations towards the creation of workflows assisted byNLP-based tools, which can help detect possible textual errors, thus allowing usto focus on specific passages for our manual correction effort.