Tratamiento de Redes Sociales en Desambiguación de Nombres de Persona en la Web

  1. Montalvo Herranz, Soto
  2. Fresno Fernández, Víctor
  3. Delgado Muñoz, Agustín D
  4. Martínez Unanue, Raquel
Revista:
Procesamiento del lenguaje natural

ISSN: 1135-5948

Año de publicación: 2016

Número: 57

Páginas: 117-124

Tipo: Artículo

Otras publicaciones en: Procesamiento del lenguaje natural

Resumen

In this work, we present two heuristics to treat web pages from social networks for person name disambiguation in the Web. This problem consists in clustering the results provided by a search engine when the query is a person name according to the individual they refer to. Although these web pages could negatively affect when grouping the results, most of the systems in the state-of-the-art do not take into account their role in this problem. We have evaluated our heuristics with two collections that contain this kind of web pages. We have used an extension of an algorithm of the state of the art to cluster the web pages. Both heuristics get improvements when there is a high number of social web pages, and the proposed algorithm is more independent with respect to the ambiguity degree of person names than other ones in the state of the art.

Referencias bibliográficas

  • Artiles, J. 2009. Web People Search. PhD Thesis, UNED University.
  • Artiles, J., J. Gonzalo, and S. Sekine. 2007. The SemEval-2007 WePS Evaluation: Establishing a Benchmark for the Web People Search Task. En Proceedings of SemEval2007, pages 64-69. ACL.
  • Artiles, J., E. Amigo, and J. Gonzalo. 2009a. The Role of Named Entities in Web People Search. En Proceedings of EMNLP 2009.
  • Artiles, J., J. Gonzalo, and S. Sekine. 2009b. Weps 2 Evaluation Campaign: Overview of the Web People Search Clustering Task. En 2nd Web People Search Evaluation Workshop (WePS 2009), 18th WWW Conference.
  • Artiles, J., A. Borthwick, J. Gonzalo, S. Sekine, and E. Amigo. 2010. WePS-3 Evaluation Campaign: Overview of the Web People Search Clustering and Attribute Extraction Tasks. En Third Web People Search Evaluation Forum (WePS-3), CLEF 2010.
  • Bagga, A. and B. Baldwin. 1998. EntityBased Cross-Document Coreferencing Using the Vector Space Model. En Proceedings of the COLING/ACL’98 - Volume 1, pages 79- 85.
  • Balog, K., J. He, K. Hofmann, V. Jijkoun, C. Monz, M. Tsagkias, W. Weerkamp, and M. de Rijke. 2009. The University of Amsterdam at WePS-2. En 2nd Web People Search Evaluation Workshop (WePS 2009), 18th WWW Conference.
  • Richard Berendsen 2015. Finding People, Papers, and Posts: Vertical Search Algorithms and Evaluation. PhD Thesis. Informatics Institute, University of Amsterdam. Chen. Y., Yat Mei Lee, S., and Huang, C.R. 2012. A Robust Web Personal Name Information Extraction System. En Expert Systems with Applications, Vol. 32, Issue 3, pp. 2690-2699.
  • Delgado, A. D, R. Mart´ınez, V. Fresno, and S. Montalvo. 2014. A Data Driven Approach for Person Name Disambiguation in Web Search Results. En Proceedings of COLING 2014, pages 301-310.
  • Delgado, A. D, R. Mart´ınez, S. Montalvo, and V. Fresno. 2014. An Unsupervised Algorithm for Person Name Disambiguation in the Web. En Procesamiento del Lenguaje Natural, 53, pages 51-58.
  • Gruetze, T., Kasneci, G., Zuo, Z., and Naumann, F. 2014. Bootstrapping Wikipedia to answer ambiguous person name queries. En Proceedings of the 30th International Conference on Data Engineering Workshops (ICDE), pages 56-61. Chicago, IL, USA.
  • Liu, Z., Q. Lu, and J. Xu. 2011. High Performance Clustering for Web Person Name Disambiguation using Topic Capturing. En International Workshop on Entity-Oriented Search (EOS).
  • Long, C. and L. Shi. 2010. Web Person Name Disambiguation by Relevance Weighting of Extended Feature Sets. En Third Web People Search Evaluation Forum (WePS-3), CLEF 2010.
  • Nuray-Turan, R., Kalashnikov, D. V., and Mehrotra S. 2012. Exploiting Web querying for Web People Search. ACM Transactions on Database Systems (TODS), Vol. 37, Issue 1.
  • Wilcoxon, F. 1945. Individual Comparisons by Ranking Methods, 1(6). Biometrics Bulletin.
  • Xu, J., Lu, Q., Li, M., and Li, W. 2015. Web Person Disambiguation Using Hierarchical CoReference Model. En Proceedings of CICLing 2015, Part I, pages 279-291.