Cargando…

Detecting modification of biomedical events using a deep parsing approach

BACKGROUND: This work describes a system for identifying event mentions in bio-molecular research abstracts that are either speculative (e.g. analysis of IkappaBalpha phosphorylation, where it is not specified whether phosphorylation did or did not occur) or negated (e.g. inhibition of IkappaBalpha...

Descripción completa

Detalles Bibliográficos
Autores principales: MacKinlay, Andrew, Martinez, David, Baldwin, Timothy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3339397/
https://www.ncbi.nlm.nih.gov/pubmed/22595089
http://dx.doi.org/10.1186/1472-6947-12-S1-S4
_version_ 1782231350855598080
author MacKinlay, Andrew
Martinez, David
Baldwin, Timothy
author_facet MacKinlay, Andrew
Martinez, David
Baldwin, Timothy
author_sort MacKinlay, Andrew
collection PubMed
description BACKGROUND: This work describes a system for identifying event mentions in bio-molecular research abstracts that are either speculative (e.g. analysis of IkappaBalpha phosphorylation, where it is not specified whether phosphorylation did or did not occur) or negated (e.g. inhibition of IkappaBalpha phosphorylation, where phosphorylation did not occur). The data comes from a standard dataset created for the BioNLP 2009 Shared Task. The system uses a machine-learning approach, where the features used for classification are a combination of shallow features derived from the words of the sentences and more complex features based on the semantic outputs produced by a deep parser. METHOD: To detect event modification, we use a Maximum Entropy learner with features extracted from the data relative to the trigger words of the events. The shallow features are bag-of-words features based on a small sliding context window of 3-4 tokens on either side of the trigger word. The deep parser features are derived from parses produced by the English Resource Grammar and the RASP parser. The outputs of these parsers are converted into the Minimal Recursion Semantics formalism, and from this, we extract features motivated by linguistics and the data itself. All of these features are combined to create training or test data for the machine learning algorithm. RESULTS: Over the test data, our methods produce approximately a 4% absolute increase in F-score for detection of event modification compared to a baseline based only on the shallow bag-of-words features. CONCLUSIONS: Our results indicate that grammar-based techniques can enhance the accuracy of methods for detecting event modification.
format Online
Article
Text
id pubmed-3339397
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33393972012-05-02 Detecting modification of biomedical events using a deep parsing approach MacKinlay, Andrew Martinez, David Baldwin, Timothy BMC Med Inform Decis Mak Proceedings BACKGROUND: This work describes a system for identifying event mentions in bio-molecular research abstracts that are either speculative (e.g. analysis of IkappaBalpha phosphorylation, where it is not specified whether phosphorylation did or did not occur) or negated (e.g. inhibition of IkappaBalpha phosphorylation, where phosphorylation did not occur). The data comes from a standard dataset created for the BioNLP 2009 Shared Task. The system uses a machine-learning approach, where the features used for classification are a combination of shallow features derived from the words of the sentences and more complex features based on the semantic outputs produced by a deep parser. METHOD: To detect event modification, we use a Maximum Entropy learner with features extracted from the data relative to the trigger words of the events. The shallow features are bag-of-words features based on a small sliding context window of 3-4 tokens on either side of the trigger word. The deep parser features are derived from parses produced by the English Resource Grammar and the RASP parser. The outputs of these parsers are converted into the Minimal Recursion Semantics formalism, and from this, we extract features motivated by linguistics and the data itself. All of these features are combined to create training or test data for the machine learning algorithm. RESULTS: Over the test data, our methods produce approximately a 4% absolute increase in F-score for detection of event modification compared to a baseline based only on the shallow bag-of-words features. CONCLUSIONS: Our results indicate that grammar-based techniques can enhance the accuracy of methods for detecting event modification. BioMed Central 2012-04-30 /pmc/articles/PMC3339397/ /pubmed/22595089 http://dx.doi.org/10.1186/1472-6947-12-S1-S4 Text en Copyright ©2012 MacKinlay et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
MacKinlay, Andrew
Martinez, David
Baldwin, Timothy
Detecting modification of biomedical events using a deep parsing approach
title Detecting modification of biomedical events using a deep parsing approach
title_full Detecting modification of biomedical events using a deep parsing approach
title_fullStr Detecting modification of biomedical events using a deep parsing approach
title_full_unstemmed Detecting modification of biomedical events using a deep parsing approach
title_short Detecting modification of biomedical events using a deep parsing approach
title_sort detecting modification of biomedical events using a deep parsing approach
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3339397/
https://www.ncbi.nlm.nih.gov/pubmed/22595089
http://dx.doi.org/10.1186/1472-6947-12-S1-S4
work_keys_str_mv AT mackinlayandrew detectingmodificationofbiomedicaleventsusingadeepparsingapproach
AT martinezdavid detectingmodificationofbiomedicaleventsusingadeepparsingapproach
AT baldwintimothy detectingmodificationofbiomedicaleventsusingadeepparsingapproach