A tagged corpus for automatic labeling of disabilities in medical scientific papers

  1. Carlos Valmaseda 1
  2. Juan Martinez Romo 1
  3. Lourdes Araujo 1
  1. 1 Universidad Nacional de Educación a Distancia
    info

    Universidad Nacional de Educación a Distancia

    Madrid, España

    ROR https://ror.org/02msb5n36

Libro:
10th conference on International Language Resources and Evaluation (LREC'16)
  1. Nicoletta Calzolari (coord.)
  2. Khalid Choukri (coord.)
  3. Thierry Declerck (coord.)
  4. Asuncion Moreno (coord.)

Editorial: European Language Resources Association

ISBN: 978-2-9517408-9-1

Año de publicación: 2016

Páginas: 1022-1025

Tipo: Capítulo de Libro

Resumen

This paper presents the creation of a corpus of labeled disabilities in scientific papers. The identification of medical concepts in documents and, especially, the identification of disabilities, is a complex task mainly due to the variety of expressions that can make reference to the same problem. Currently there is not a set of documents manually annotated with disabilities with which to evaluate an automatic detection system of such concepts. This is the reason why this corpus arises, aiming to facilitate the evaluation of systems that implement an automatic annotation tool for extracting biomedical concepts such as disabilities. The result is a set of scientific papers manually annotated. For the selection of these scientific papers has been conducted a search using a list of rare diseases, since they generally have associated several disabilities of different kinds.