Análisis de corpus poéticos con Litcon

  1. Hernández-Lorenzo, Laura 1
  1. 1 Universidad de Sevilla
    info

    Universidad de Sevilla

    Sevilla, España

    ROR https://ror.org/03yxnpp24

Journal:
Revista de Humanidades Digitales

ISSN: 2531-1786

Year of publication: 2022

Issue: 7

Pages: 104-126

Type: Article

DOI: 10.5944/RHD.VOL.7.2022.34586 DIALNET GOOGLE SCHOLAR lock_openOpen access editor

More publications in: Revista de Humanidades Digitales

Abstract

Research in corpus analysis has grown and evolved significantly in recent years, with an increase in the tools and resources available for the analysis and exploration of textual corpora. After a brief review of the available corpus analysis programs, this article presents Litcon, a new software developed with special attention to the study of poetic texts in verse. Litcon is cross-platform, it includes common tools of corpus analysis software, such as the generation of concordances, word frequency lists and keywords, and even functionalities for contrasting corpora, tagging and analyzing Part-of-Speech tagged text, and creating random samples. Litcon and its tools are presented through a case study with the poetry of Fernando de Herrera.

Bibliographic References

  • Alonso Ramos, M. (1994). Hacia una definición del concepto de colocación: De J. R. Firth a I. A. Mel’čuk. Revista de Lexicografía, I, 9-28. https://ruc.udc.es/dspace/bitstream/handle/2183/5383/RL_1-1.pdf?sequence=1
  • Anthony, L. (2013). A critical look at software tools in Corpus Linguistics. Linguistic Research, 30(2), 141-161.
  • Anthony, L. (2022a). What can corpus software do? En A. O’Keeffe & M. McCarthy (Eds.), The Routledge Handbook of Corpus Linguistics (2nd ed.) (pp. 103-125). Routledge.
  • Anthony, L. (2022b). Antconc (4.1.1) [Computer software]. Waseda University. https://www.laurenceanthony.net/software/antconc/
  • Baker, P., McEnery, A., & Hardie, A. (2006). A Glossary of Corpus Linguistics. Edinburgh University Press.
  • Corpas Pastor, G. (2001). En torno al concepto de colocación. Euskera, XLVI(1), 89-108.
  • Eder, M., Rybicki, J., & Kestemont, M. (2016). Stylometry with R: A Package for Computational Text Analysis. The R Journal, 8(1), 107-121. https://journal.r-project.org/archive/2016/RJ-2016-007/index.html
  • Gutiérrez, S. (2019). Análisis de corpus con Voyant Tools. The Programming Historian en español, 3. https://programminghistorian.org/es/lecciones/analisis-voyant-tools
  • Heiden, S. (2010). The TXM Platform: Building Open-Source Textual Analysis Software Compatible with the TEI Encoding Scheme. En 24th Pacific Asia Conference on Language, Information and Computation (pp. 389-398). Institute for Digital Enhancement of Cognitive Development, Waseda University. https://halshs.archives-ouvertes.fr/halshs-00549764
  • Heiden, S. (2018). TXM (0.7.9) [Computer software]. http://textometrie.ens-lyon.fr/?lang=fr
  • Heiden, S., Magué, J.-P., & Pincemin, B. (2010). TXM : Une plateforme logicielle open-source pour la textométrie – conception et développement. En S. Bolasco (Ed.), Proc. Of 10th International Conference on the Statistical Analysis of Textual Data-JADT 2010 (Vol. 2, pp. 1021-1032). Edizioni Universitarie di Lettere Economia Diritto. https://halshs.archives-ouvertes.fr/halshs-00549779
  • Hernández-Lorenzo, L. (2020). Los textos poéticos de Fernando de Herrera: Aproximaciones desde la Estilística de corpus y la Estilometría. https://idus.us.es/handle/11441/93465
  • Hernández-Lorenzo, L. (2021). Nueva Luz para la problemática de Versos: Una aproximación a su léxico desde las Humanidades Digitales y los estudios de corpus. En J. Montero & P. Ruiz Pérez (Coords.), De Herrera. Estudios reunidos con motivo del IV Centenario de Versos (1619) (pp. 151-206). Universidad de Sevilla.
  • Herrera, F. de. (1975). Obra poética (J. M. Blecua, Ed.). Boletín de la Real Academia Española.
  • Hockey, S. (2004). The History of Humanities Computing. En S. Schreibman, R. Siemens, & J. Unsworth (Eds.), A Companion to Digital Humanities (pp. 3-19). Blackwell.
  • Jannidis, F. (2014). PyDelta [Python]. Universität Julius-Maximilians Würzburg. https://github.com/cophi-wue/pydelta
  • Juola, P. (2005). A Prototype for Authorship Attribution Software. En P. Liddell, R. Siemens, A. Bia, M. Holmes, P. Baer, G. Newton, & S. Arneil (Eds.), The International Conference on Humanities Computing and Digital Scholarship. The 17th Joint International Conference (pp. 97-99). University of Victoria.
  • Jurafsky, D., & Martin, J. H. (2021). Sequence Labeling for Parts of Speech and Named Entities. En Speech and Language Processing. Prentice Hall.
  • Macrí, O. (1972). Fernando de Herrera. Gredos.
  • McEnery, A., & Hardie, A. (2012). Corpus linguistics: Method, theory and practice. Cambridge University Press.
  • McEnery, A., & Wilson, A. (2001). Corpus Linguistics. An Introduction. Edinburgh University Press.
  • Montero, J. (2021). La transmisión de los textos poéticos de Fernando de Herrera: Estado de la cuestión y nuevas perspectivas. En De Herrera. Estudios reunidos en el centenario de versos (1619) (pp. 107-149). Editorial Universidad de Sevilla.
  • Padró, L. (2011). Analizadores multilingües en freeling. Linguamática, 3, 13-20. http://upcommons.upc.edu/handle/2117/14772
  • Pincemin, B., & Heiden, S. (s. f.). Qu’est-ce que la textométrie? Présentation. Site du projet Textométrie. https://pages.textometrie.org/textometrie/Introduction
  • Rayson, P., & Garside, R. (2000). Comparing Corpora using Frequency Profiling. Proceedings of the Workshop Comparing Corpora, held in conjuction with the 38th annual meeting of the Association for Computational Linguistics (ACL 2000), 9, 1-6.
  • Rockwell, G., & Sinclair, S. (2016). Hermeneutica. Computer-Assisted Interpretation in the Humanities. MIT Press.
  • Rojas Castro, A. (2013). Las Humanidades Digitales: Principios, valores y prácticas. Janus: estudios sobre el Siglo de Oro, 2, 74-99. http://www.janusdigital.es/articulo.htm?id=24
  • Rüdiger, J. O. (2018a). Corpus Explorer (2.0) [Computer software]. https://notes.jan-oliver-ruediger.de/software/corpusexplorer-overview/
  • Rüdiger, J. O. (2018b). CorpusExplorer v2.0-Seminartauglich in einem halben Tag. DHd2018. Kritik der digitalen Vernunft, 28-30. http://dhd2018.uni-koeln.de/wp-content/uploads/boa-DHd2018-web-ISBN.pdf
  • Schmid, H. (1994). Probabilistic Part-of-Speech Tagging Using Decision Trees. Proceedings of International Conference on New Methods in Language Processing.
  • Scott, M. (2010). What can corpus software do? En A. O’Keeffe & M. McCarthy (Eds.), The Routledge Handbook of Corpus Linguistics (1st ed.) (pp. 136-151). Routledge.
  • Scott, M. (2016). WordSmith Tools (7.0) [Computer software]. Lexical Analysis Software. https://www.lexically.net/wordsmith/
  • Sinclair, S., & Rockwell, G. (2016). Voyant Tools [Web application]. https://voyant-tools.org/