Cargando…

Negated bio-events: analysis and identification

BACKGROUND: Negation occurs frequently in scientific literature, especially in biomedical literature. It has previously been reported that around 13% of sentences found in biomedical research articles contain negation. Historically, the main motivation for identifying negated events has been to ensu...

Descripción completa

Detalles Bibliográficos
Autores principales: Nawaz, Raheel, Thompson, Paul, Ananiadou, Sophia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3561152/
https://www.ncbi.nlm.nih.gov/pubmed/23323936
http://dx.doi.org/10.1186/1471-2105-14-14
_version_ 1782257914420920320
author Nawaz, Raheel
Thompson, Paul
Ananiadou, Sophia
author_facet Nawaz, Raheel
Thompson, Paul
Ananiadou, Sophia
author_sort Nawaz, Raheel
collection PubMed
description BACKGROUND: Negation occurs frequently in scientific literature, especially in biomedical literature. It has previously been reported that around 13% of sentences found in biomedical research articles contain negation. Historically, the main motivation for identifying negated events has been to ensure their exclusion from lists of extracted interactions. However, recently, there has been a growing interest in negative results, which has resulted in negation detection being identified as a key challenge in biomedical relation extraction. In this article, we focus on the problem of identifying negated bio-events, given gold standard event annotations. RESULTS: We have conducted a detailed analysis of three open access bio-event corpora containing negation information (i.e., GENIA Event, BioInfer and BioNLP’09 ST), and have identified the main types of negated bio-events. We have analysed the key aspects of a machine learning solution to the problem of detecting negated events, including selection of negation cues, feature engineering and the choice of learning algorithm. Combining the best solutions for each aspect of the problem, we propose a novel framework for the identification of negated bio-events. We have evaluated our system on each of the three open access corpora mentioned above. The performance of the system significantly surpasses the best results previously reported on the BioNLP’09 ST corpus, and achieves even better results on the GENIA Event and BioInfer corpora, both of which contain more varied and complex events. CONCLUSIONS: Recently, in the field of biomedical text mining, the development and enhancement of event-based systems has received significant interest. The ability to identify negated events is a key performance element for these systems. We have conducted the first detailed study on the analysis and identification of negated bio-events. Our proposed framework can be integrated with state-of-the-art event extraction systems. The resulting systems will be able to extract bio-events with attached polarities from textual documents, which can serve as the foundation for more elaborate systems that are able to detect mutually contradicting bio-events.
format Online
Article
Text
id pubmed-3561152
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35611522013-02-05 Negated bio-events: analysis and identification Nawaz, Raheel Thompson, Paul Ananiadou, Sophia BMC Bioinformatics Research Article BACKGROUND: Negation occurs frequently in scientific literature, especially in biomedical literature. It has previously been reported that around 13% of sentences found in biomedical research articles contain negation. Historically, the main motivation for identifying negated events has been to ensure their exclusion from lists of extracted interactions. However, recently, there has been a growing interest in negative results, which has resulted in negation detection being identified as a key challenge in biomedical relation extraction. In this article, we focus on the problem of identifying negated bio-events, given gold standard event annotations. RESULTS: We have conducted a detailed analysis of three open access bio-event corpora containing negation information (i.e., GENIA Event, BioInfer and BioNLP’09 ST), and have identified the main types of negated bio-events. We have analysed the key aspects of a machine learning solution to the problem of detecting negated events, including selection of negation cues, feature engineering and the choice of learning algorithm. Combining the best solutions for each aspect of the problem, we propose a novel framework for the identification of negated bio-events. We have evaluated our system on each of the three open access corpora mentioned above. The performance of the system significantly surpasses the best results previously reported on the BioNLP’09 ST corpus, and achieves even better results on the GENIA Event and BioInfer corpora, both of which contain more varied and complex events. CONCLUSIONS: Recently, in the field of biomedical text mining, the development and enhancement of event-based systems has received significant interest. The ability to identify negated events is a key performance element for these systems. We have conducted the first detailed study on the analysis and identification of negated bio-events. Our proposed framework can be integrated with state-of-the-art event extraction systems. The resulting systems will be able to extract bio-events with attached polarities from textual documents, which can serve as the foundation for more elaborate systems that are able to detect mutually contradicting bio-events. BioMed Central 2013-01-16 /pmc/articles/PMC3561152/ /pubmed/23323936 http://dx.doi.org/10.1186/1471-2105-14-14 Text en Copyright ©2013 Nawaz et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Nawaz, Raheel
Thompson, Paul
Ananiadou, Sophia
Negated bio-events: analysis and identification
title Negated bio-events: analysis and identification
title_full Negated bio-events: analysis and identification
title_fullStr Negated bio-events: analysis and identification
title_full_unstemmed Negated bio-events: analysis and identification
title_short Negated bio-events: analysis and identification
title_sort negated bio-events: analysis and identification
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3561152/
https://www.ncbi.nlm.nih.gov/pubmed/23323936
http://dx.doi.org/10.1186/1471-2105-14-14
work_keys_str_mv AT nawazraheel negatedbioeventsanalysisandidentification
AT thompsonpaul negatedbioeventsanalysisandidentification
AT ananiadousophia negatedbioeventsanalysisandidentification