Anotación y representación temporal de tweets multilingües

  1. Asunción Vázquez-Méndez
  2. Ana García-Serrano
Revista:
Procesamiento del lenguaje natural

ISSN: 1135-5948

Ano de publicación: 2015

Número: 54

Páxinas: 53-60

Tipo: Artigo

Outras publicacións en: Procesamiento del lenguaje natural

Resumo

Time is a crucial element in any space of information and Twitter is no an exception. Although the exploitation of temporal information in retrieval and organization tasks has a long tradition, content-based approaches have not been fully explored for Twitter and researchers lack of sufficient Corpus annotated with temporal information. In this paper, we propose a temporal document annotation model based on Formal Concept Analysis theory for Twitter domain. The tweets attributes defining the temporal context are the temporal expressions, the events and their types. It is also proposed a calendar especially suited to the phenomena of commemoration of anniversaries and dates in Twitter: The Social-Imaginary Calendar. The Corpus used to the experiments is a subset of the RepLab2013 collection. A detailed description of its temporal aspects is provided.

Referencias bibliográficas

  • Alonso, O., M. Gertz, y R. Baeza-Yates. 2009. Clustering and exploring search results using timeline constructions. En Proceedings of the 18th ACM conference on Information and knowledge management, páginas 97-106.
  • Alonso, O., J. Strötgen, R. Baeza-Yates, y M. Gertz. 2011. Temporal Information Retrieval: Challenges and Opportunities. TWAW, 11:1-8.
  • Amigó, E., J. Carrillo de Albornoz, I. Chugur, A. Corujo, J. Gonzalo, T. Martín-Wanton, E. Meij, M. de Rijke, y D. Spina. 2013. Overview of RepLab 2013: Evaluating Online Reputation Monitoring Systems. En CLEF, volumen 8138 de LNCS, páginas 333-352. Springer.
  • Castellanos, A., J. Cigarrán, y A. García-Serrano. 2013. Modelling Techniques for Twitter Contents: A Step beyond Classification based Approaches. En Working Notes of the CLEF 2013.
  • Goralwalla, I., Y. Leontiev, M.T. Özsu, D. Szafron, y C. Combi. 2001. Temporal granularity: Completing the puzzle. Journal of Intelligent Information Systems, 16(1):41{63.
  • Llorens, H., E. Saquete, y B. Navarro. 2010. TIPSem (English and Spanish): Evaluating CRFs and Semantic Roles in TempEval-2. En Proceedings of the 5th International Workshop on Semantic Evaluation, SemEval '10, páginas 284-291.
  • Pustejovsky, J., M.J. Castaño, R. Ingria, R. Saurí, R.J. Gaizauskas, A. Setzer, G. Katz, y D.R. Radev. 2003. TimeML: Robust Specification of Event and Temporal Expressions in Text. New directions in question answering, 3:28-34.
  • Ritter, A., O. Etzioni, S. Clark, y others. 2012. Open domain event extraction from Twitter. En Proceedings of the 18th ACM SIGKDD International conference on Knowledge discovery and data mining, páginas 1104-1112. ACM.
  • Strötgen, J. y M. Gertz. 2010. HeidelTime: High Quality Rule-Based Extraction and Normalization of Temporal Expressions. En Proceedings of the 5th International Workshop on Semantic Evaluation, páginas 321-324, Uppsala, Sweden, July. ACL.
  • Vázquez-Méndez, A. 2014. Explotación de la Información Temporal en Twitter para la organización de tweets. Tesis de Máster, UNED.
  • Verhagen, M., I. Mani, R. Saurí, R. Knippen, S.B. Jang, J. Littman, A. Rumshisky, J. Phillips, y J. Pustejovsky. 2005. Automating temporal annotation with TARSQI. En Proceedings of the ACL 2005 on Interactive poster and demonstration sessions, páginas 81-84.
  • Vicente-Díez, M.T y P. Martínez. 2009. Temporal semantics extraction for improving web search. En 20th International Workshop on Database and Expert Systems Application, paginas 69{73. IEEE.
  • Yevtushenko, S., J. Tane, T.B. Kaiser, S. Obiedkov, J. Hereth, y H. Reppe. ConExp - The Concept Explorer. URL: http://conexp.sourceforge.net.