Cargando…

Linguistic scope-based and biological event-based speculation and negation annotations in the BioScope and Genia Event corpora

BACKGROUND: The treatment of negation and hedging in natural language processing has received much interest recently, especially in the biomedical domain. However, open access corpora annotated for negation and/or speculation are hardly available for training and testing applications, and even if th...

Descripción completa

Detalles Bibliográficos
Autores principales: Vincze, Veronika, Szarvas, György, Móra, György, Ohta, Tomoko, Farkas, Richárd
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3239308/
https://www.ncbi.nlm.nih.gov/pubmed/22166355
http://dx.doi.org/10.1186/2041-1480-2-S5-S8
_version_ 1782219164632481792
author Vincze, Veronika
Szarvas, György
Móra, György
Ohta, Tomoko
Farkas, Richárd
author_facet Vincze, Veronika
Szarvas, György
Móra, György
Ohta, Tomoko
Farkas, Richárd
author_sort Vincze, Veronika
collection PubMed
description BACKGROUND: The treatment of negation and hedging in natural language processing has received much interest recently, especially in the biomedical domain. However, open access corpora annotated for negation and/or speculation are hardly available for training and testing applications, and even if they are, they sometimes follow different design principles. In this paper, the annotation principles of the two largest corpora containing annotation for negation and speculation – BioScope and Genia Event – are compared. BioScope marks linguistic cues and their scopes for negation and hedging while in Genia biological events are marked for uncertainty and/or negation. RESULTS: Differences among the annotations of the two corpora are thematically categorized and the frequency of each category is estimated. We found that the largest amount of differences is due to the issue that scopes – which cover text spans – deal with the key events and each argument (including events within events) of these events is under the scope as well. In contrast, Genia deals with the modality of events within events independently. CONCLUSIONS: The analysis of multiple layers of annotation (linguistic scopes and biological events) showed that the detection of negation/hedge keywords and their scopes can contribute to determining the modality of key events (denoted by the main predicate). On the other hand, for the detection of the negation and speculation status of events within events, additional syntax-based rules investigating the dependency path between the modality cue and the event cue have to be employed.
format Online
Article
Text
id pubmed-3239308
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32393082011-12-16 Linguistic scope-based and biological event-based speculation and negation annotations in the BioScope and Genia Event corpora Vincze, Veronika Szarvas, György Móra, György Ohta, Tomoko Farkas, Richárd J Biomed Semantics Research BACKGROUND: The treatment of negation and hedging in natural language processing has received much interest recently, especially in the biomedical domain. However, open access corpora annotated for negation and/or speculation are hardly available for training and testing applications, and even if they are, they sometimes follow different design principles. In this paper, the annotation principles of the two largest corpora containing annotation for negation and speculation – BioScope and Genia Event – are compared. BioScope marks linguistic cues and their scopes for negation and hedging while in Genia biological events are marked for uncertainty and/or negation. RESULTS: Differences among the annotations of the two corpora are thematically categorized and the frequency of each category is estimated. We found that the largest amount of differences is due to the issue that scopes – which cover text spans – deal with the key events and each argument (including events within events) of these events is under the scope as well. In contrast, Genia deals with the modality of events within events independently. CONCLUSIONS: The analysis of multiple layers of annotation (linguistic scopes and biological events) showed that the detection of negation/hedge keywords and their scopes can contribute to determining the modality of key events (denoted by the main predicate). On the other hand, for the detection of the negation and speculation status of events within events, additional syntax-based rules investigating the dependency path between the modality cue and the event cue have to be employed. BioMed Central 2011-10-06 /pmc/articles/PMC3239308/ /pubmed/22166355 http://dx.doi.org/10.1186/2041-1480-2-S5-S8 Text en Copyright ©2011 Vincze et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Vincze, Veronika
Szarvas, György
Móra, György
Ohta, Tomoko
Farkas, Richárd
Linguistic scope-based and biological event-based speculation and negation annotations in the BioScope and Genia Event corpora
title Linguistic scope-based and biological event-based speculation and negation annotations in the BioScope and Genia Event corpora
title_full Linguistic scope-based and biological event-based speculation and negation annotations in the BioScope and Genia Event corpora
title_fullStr Linguistic scope-based and biological event-based speculation and negation annotations in the BioScope and Genia Event corpora
title_full_unstemmed Linguistic scope-based and biological event-based speculation and negation annotations in the BioScope and Genia Event corpora
title_short Linguistic scope-based and biological event-based speculation and negation annotations in the BioScope and Genia Event corpora
title_sort linguistic scope-based and biological event-based speculation and negation annotations in the bioscope and genia event corpora
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3239308/
https://www.ncbi.nlm.nih.gov/pubmed/22166355
http://dx.doi.org/10.1186/2041-1480-2-S5-S8
work_keys_str_mv AT vinczeveronika linguisticscopebasedandbiologicaleventbasedspeculationandnegationannotationsinthebioscopeandgeniaeventcorpora
AT szarvasgyorgy linguisticscopebasedandbiologicaleventbasedspeculationandnegationannotationsinthebioscopeandgeniaeventcorpora
AT moragyorgy linguisticscopebasedandbiologicaleventbasedspeculationandnegationannotationsinthebioscopeandgeniaeventcorpora
AT ohtatomoko linguisticscopebasedandbiologicaleventbasedspeculationandnegationannotationsinthebioscopeandgeniaeventcorpora
AT farkasrichard linguisticscopebasedandbiologicaleventbasedspeculationandnegationannotationsinthebioscopeandgeniaeventcorpora