Extracción de información temporal de la DBpediaPropuesta de integración en un corpus semiestructurado

  1. García Serrano, Ana María
  2. Castellanos, Ángel
  3. Merás, Adolfo
Journal:
Procesamiento del lenguaje natural

ISSN: 1135-5948

Year of publication: 2017

Issue: 58

Pages: 117-124

Type: Article

More publications in: Procesamiento del lenguaje natural

Abstract

The goal of this work is to make a proposal for the automatic extraction of temporal information in the DBpedia, general enough to be applied to different domains. The experiment is performed using a concrete domain by the identification and management of domain related DBpedia resources. With the relevant temporal information extracted from the resources it will be feed a timeline and intersected with the temporal information of the DIMH corpus (semi-structured texts or cards). Thus, we will enrich these cards with related events of the timeline. In order to visualize the results, we are using a graphical interface to facilitate the lexical and the temporal information access. In the absence of a gold standard to intrinsically evaluate the proposal, it will be applied domain and users dependent criteria and the annotated corpus is made available to the scientific community (GitHub).

Bibliographic References

  • Allen, J. 1983. Maintaining knowledge about temporal intervals. Communications of the ACM, 26(11): 832-843.
  • Derczynski, L., J. Strötgen, D. Maynard, M. A. Greenwood and M. Jung. 2016. GATE-Time: Extraction of Temporal Expressions and Events. In 10th LREC.
  • Filter J. 2015. Interactive Visualization of Large Concept Lattice. Facultad de Ciencias de la Computación, U. Magdeburgo. Alemania
  • Ganter B. 2002. Formal Concept Analysis: Methods and Applications. Computer Science. TU Dresden.
  • García-Serrano, A. y A. Castellanos. 2016. Conceptualización, acceso y visibilidad de la información en el proyecto DIMH. Cap. 16 El dibujante ingeniero al servicio de la monarquía hispánica (XVI-XVIII), páginas 379-400. ISBN: 978-84-942695-6-1.
  • Lehmann, J., R. Isele, M. Jakob, A. Jentzsch, D. Kontokostas, P. Mendes, S. Hellmann, M. Morsey, P. van Kleef, S. Auer and C. Bizer. 2015. DBpedia a large-scale, multilingual knowledge base extracted from wikipedia. Semantic Web Journal, 6(2): 167-195.
  • Llorens H., B. Navarro, E. Saquete. 2009. Detección de expresiones temporales TimeML en Catalán mediante roles semánticos y redes semánticas. Procesamiento del Lenguaje Natural (43): 13-21.
  • Merás A. 2016. Propuesta para extracción, representación y organización de información temporal en textos semiestructurados: aplicación al corpus DIMH. Tesis del máster “Lenguajes y Sistemas Informáticos” de la UNED.
  • Mihindukulasooriya N., M. Rico, R. García Castro, A. Gómez-Pérez. 2015. An Analysis of the Quality Issues of the Properties Available. Spanish Dbpedia, LNCS 9422, páginas 198-209.
  • Neouchi R., A. Tawfik and R. Frost. 2001. Towards a Temporal Extension of Formal Concept Analysis. Proceedings of the 14th Canadian Conference on AI, Ottawa, Ontario.
  • Padró M., Ll. Padró. 2014. Comparing methods for language identification. Procesamiento del Lenguaje Natural (33): 155-161.
  • Pustejovsky J., J. Castaño, R. Ingria, R. Sauri, R. Gaizauskas, A. Setzer and G. Katz. 2003. TimeML: Robust Specification of Event and Temporal Expressions in Proceedings of the IWCS International Workshop on Computational Semantics.
  • Strötgen, J. and M. Gertz. 2010. HeidelTime: High Quality Rule-Based Extraction and Normalization of Temporal Expressions. En Proceedings of the 5th International Workshop on Semantic Evaluation, páginas 321-324, Uppsala, Sweden, July. ACL.
  • Tran, N., A. Ceroni, N. Kanhabua, and C. Niederée. 2015. Back to the past: Supporting interpretations of forgotten stories by time aware re-contextualization. In Proc. of the ACM International Conference on Web Search and Data Mining, páginas 339-348.
  • Vázquez-Méndez, A. y A. García-Serrano. 2015. Anotación y representación temporal de tweets multilingües. Procesamiento del Lenguaje Natural (54): 53-60.
  • Vicente-Díez, M.T., D. Samy and P. Martínez. 2008. An empirical approach to a preliminary successful identification and resolution of temporal expressions in Spanish news corpora. Proc. of the Sixth Int. Language Resources and Evaluation Conf. (LREC'08), Marrakech, Morocco, May, 2008, European Language Resources Association (ELRA), ISBN: 2-9517408-4-0, páginas 2153-2158.
  • Vicente-Díez M.T., J. Moreno-Schneider, P. Martínez. 2010. Temporal information needs in ResPubliQA: an attempt to improve accuracy. The UC3M Participation at CLEF 2010, CLEF 2010 LABs and Workshops, Notebook Papers, Padova, Italy, September.
  • Zhang, L., W. Chen, T. Tran and A. Rettinger. 2015. Time-Aware Entity Search in DBpedia. In European Semantic Web Conference, páginas175-179.