UNED LSI en TASS 2013: Consideraciones acerca de la representación textual para la clasificación de tweets basada en recuperación de Información

Ángel Castellanos González; Juan Cigarrán Recuero; Ana García Serrano

UNED LSI en TASS 2013Consideraciones acerca de la representación textual para la clasificación de tweets basada en recuperación de Información

Ángel Castellanos González ¹
Juan Cigarrán Recuero ¹
Ana García Serrano ¹

1 Universidad Nacional de Educación a Distancia

Universidad Nacional de Educación a Distancia

Madrid, España

ROR https://ror.org/02msb5n36

Llibre:

XXIX Congreso de la Sociedad Española de Procesamiento de Lenguaje Natural: SEPLN 2013

Alberto Díaz Esteban (coord.)
Iñaki Alegria Loinaz (coord.)
Julio Villena Román (coord.)

Editorial: Sociedad Española para el Procesamiento del Lenguaje Natural

ISBN: 978-84-695-8349-4

Any de publicació: 2013

Pàgines: 213-219

Congrés: Sociedad Española para el Procesamiento del Lenguaje Natural. Congreso (29. 2013. Madrid)

Tipus: Aportació congrés

DIALNET GOOGLE SCHOLAR

Resum

This article summarizes the work proposed for our participation at TASS 2013, which is proposed as an extension of work done for TASS 2012. The work carried out the previous year was focused on the tweet classification based on an Information Retrieval (IR) approach: the classes are modeled according to the textual information of the tweets belonging to each class, and the tweets to be classified are used as query. This year we have applied this approach on Sentiment Analysis and Topic Classification tasks, but this year our work is focused on analyzing the type of tweet information to use to carry out the classification and what process should be followed to take this information into account. In this sense, we have proposed different types of modeling as well as different ways of performing the information retrieval process according to the different types of information. The results suggest that although the use of this type of information is valuable (especially named entities), it should always be done in conjunction with the overall content of the tweets.

Fuente de los datos: Dialnet

UNED LSI en TASS 2013Consideraciones acerca de la representación textual para la clasificación de tweets basada en recuperación de Información

Universidad Nacional de Educación a Distancia

Resum