Compiling and analyzing a tagged learner corpusa corpus-based study of adjective uses

  1. Castillo Rodríguez, Cristina 1
  2. Díaz Lage, José María 2
  3. Rubio Martínez, Beatriz 3
  1. 1 Universidad de Málaga

    Universidad de Málaga

    Málaga, España


  2. 2 Universidad Internacional de La Rioja

    Universidad Internacional de La Rioja

    Logroño, España


  3. 3 University College Dublin

    University College Dublin

    Dublín, Irlanda


Círculo de lingüística aplicada a la comunicación

ISSN: 1576-4737

Ano de publicación: 2020

Título do exemplar: Monográfico: TAME, gramaticalización e interfaz sintaxis-pragmática del español y el mapudungún

Número: 81

Páxinas: 115-136

Tipo: Artigo

DOI: 10.5209/CLAC.67932 DIALNET GOOGLE SCHOLAR lock_openAcceso aberto editor

Outras publicacións en: Círculo de lingüística aplicada a la comunicación

Obxectivos de Desenvolvemento Sustentable


A learner corpus (LC) is widely known as a rich source of information regarding the use of expressions and the errors made by students in their productions. In fact, we, as teachers, can profit from the compilation of their tasks so as to analyze in detail their way of writing. However, the mere compilation of texts does not guarantee a successful exploitation, as more steps than saving texts must be involved in the whole process. Therefore, it seems essential to follow a protocolized methodology of compilation. In this paper we propose five phases for compiling a LC containing texts from the spontaneous written productions from undergraduate and postgraduate students. The outcomes thrown with the LC exploitation will reveal the errors in students’ productions regarding the use of plural, comparative and superlative in adjectives and also other fails detected in the tagging phase, most of which are due to students’ misuses.

Información de financiamento

This work has been carried out in the frame of the research project B0036-1617-104-ETEL (Universidad Internacional de la Rioja, 2016-2018).


Referencias bibliográficas

  • Anthony, Lawrence (2015a). AntConc (version 3.4.4) [Computer Software]. Tokyo, Japan: Waseda University. Available from
  • Anthony, Lawrence (2015b). TagAnt (version 1.2.0) [Computer Software]. Tokyo, Japan: Waseda University. Available from
  • Anthony, Lawrence (2016). EncodeAnt (version 1.2.0) [Computer Software]. Tokyo, Japan: Waseda University. Available from
  • Castillo Rodríguez, Cristina & Díaz Lage, José María. (2015). Exploitation of a learner corpus: analysing openings and endings in academic forums. Revista Opción, 31.6, 192-210.
  • Biber, Douglas, Conrad, Susan, & Reppen, Randi (1998). Corpus linguistics: Investigating language structure and use. Cambridge: Cambridge University Press.
  • Castillejos López, Willelmira (2009). Error analysis in a learner corpus: what are the learners' strategies? In Cantos Gómez, Pascual & Sánchez Pérez, Aquilino, eds., A survey of corpus-based research. Panorama de investigaciones basadas en corpus, 675-690. Murcia: AELINCO.
  • Corder, Stephen Pit (1992). Introducción a la lingüística aplicada. México: Limusa.
  • Dellar, Hugh (2003). What have corpora ever done for us? Developing Teachers.
  • Díaz-Negrillo, Ana & Thompson, Paul (2013). Learner corpora: looking towards the future. In Díaz-Negrillo, Ana, Ballier, Nicolas & Thompson, Paul, eds., Automatic Treatment Analysis of Learner Corpus Data, 9-29. Amsterdam: John Benjamins.
  • Ellis, Rod (1994). The study of second language acquisition. Oxford: OUP.
  • Els van, Theo, Bongaerts, Theo, Extra, Guus, van Os, Charles & Janssen-van Dieten, Anne-Mieke (1984). Applied Linguistics and the Learning and Teaching of Foreign Languages. London: Edward Arnold.
  • Gabbrielli, Richard (1998). Incorporating a student corpus in your teaching. IATEFL Newsletter, 141, 14-15.
  • Gabrielatos, Costas (2005). Corpora and Language Teaching: Just a fling or wedding bells? TESL-EJ, 8.4.
  • Granger, Sylviane (2003). Error-tagged learner corpora and CALL: a promising synergy. CALICO journal, 20.3, 465-480. doi: 10.1558/cj.v20i3.465-480
  • Granger, Sylviane (2004). Computer learner corpus research. In Connor, Ulla & Upton, Thomas, A. eds., Applied Corpus Linguistics: A Multidimensional Perspective, 123-145. Amsterdam: Rodopi.
  • Granger, Sylviane, Gilquin, Gaëtanelle, & Meunier, Fanny (2015). Introduction: learner corpus research – past, present and future. In Granger, Sylviane, Gilquin, Gaëtanelle & Meunier, Fanny, eds., The Cambridge Handbook of learner corpus research, 1-7. Cambridge: Cambridge University Press.
  • Gilquin, Gaëtanelle (2015). From design to collection of learner corpora. In Granger, Sylviane, Gilquin, Gaëtanelle & Meunier, Fanny, eds., The Cambridge Handbook of learner corpus research, 9-34. Cambridge: Cambridge University Press.
  • Kennedy, Graeme (1998). An introduction to corpus linguistics. London: Longman.
  • Lozano, Cristóbal & Mendikoetxea, Amaya (2013). Learner corpora and SLA: the design and collection of CEDEL2. In Díaz-Negrillo, Ana, Ballier, Nicolas & Thompson, Paul, eds., Automatic Treatment and Analysis of Learner Corpus Data, 65-100. Amsterdam: John Benjamins.
  • McEnery, Tony, Xiao, Richard & Tono, Yukio (2006). Corpus-Based Language Studies. London and NY: Routledge.
  • McEnery, Tony & Wilson, Andrew (2001). (2nd edn.) Corpus linguistics. Edinburgh: Edinburgh University Press.
  • Meunier, Fanny (2011). Corpus linguistics and second/foreign language learning: exploring multiple paths. Revista Brasileira de Linguística Aplicada, 11.2, 459-477.
  • Meyer, Charles F. (2002). English corpus linguistics: An introduction. Cambridge: Cambridge University Press.
  • Nesselhauf, Nadja (2004). Learner corpora and their potential for language teaching. In Sinclair, John, ed., How to Use Corpora in Language Teaching, 125-152. Amsterdam: Benjamins.
  • Pravec, Norma. A. (2002). Survey of learner corpora. ICAME journal, 26, 81-114.
  • Tognini-Bonelli, Elena (2001). Corpus linguistics at work. Amsterdam: John Benjamins.
  • UCREL, n.d. CLAWS part-of-the-speech tagger for English. Retrieved from
  • Widdowson, Henry George (1991). The description and prescription of language. In Alatis, James E., ed., Georgetown University Round Table on Languages and Linguistics. Linguistics and language pedagogy: The state of the art, 11-24. Washington, D.C.: Georgetown University Press.