Publikationen in Zusammenarbeit mit Forschern von Universidad del País Vasco/Euskal Herriko Unibertsitatea (9)

2016

  1. TweetLID: a benchmark for tweet language identification

    Language Resources and Evaluation, Vol. 50, Núm. 4, pp. 729-766

2015

  1. TweetNorm: a benchmark for lexical normalization of Spanish tweets

    Language Resources and Evaluation, Vol. 49, Núm. 4, pp. 883-905

2014

  1. Overview of TweetLID: Tweet language identification at SEPLN 2014

    CEUR Workshop Proceedings

  2. TweetNorm es corpus: An annotated corpus for Spanish microtext normalization

    Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014

2013

  1. Introducción a la tarea compartida Tweet-Norm 2013: normalización léxica de tuits en español

    XXIX Congreso de la Sociedad Española de Procesamiento de Lenguaje Natural: SEPLN 2013

2007

  1. Multilingual news clustering: Feature translation vs. identification of cognate named entities

    Pattern Recognition Letters, Vol. 28, Núm. 16, pp. 2305-2311

2006

  1. Multilingual Document Clustering: An heuristic approach based on cognate named entities

    COLING/ACL 2006 - 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference

  2. Multilingual News Document Clustering: Two algorithms based on cognate named entities

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

2004

  1. Evaluation of web page representations by content through clustering

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)