Cargando…

Using uncertainty to link and rank evidence from biomedical literature for model curation

MOTIVATION: In recent years, there has been great progress in the field of automated curation of biomedical networks and models, aided by text mining methods that provide evidence from literature. Such methods must not only extract snippets of text that relate to model interactions, but also be able...

Descripción completa

Detalles Bibliográficos
Autores principales: Zerva, Chrysoula, Batista-Navarro, Riza, Day, Philip, Ananiadou, Sophia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860317/
https://www.ncbi.nlm.nih.gov/pubmed/29036627
http://dx.doi.org/10.1093/bioinformatics/btx466
_version_ 1783307965961338880
author Zerva, Chrysoula
Batista-Navarro, Riza
Day, Philip
Ananiadou, Sophia
author_facet Zerva, Chrysoula
Batista-Navarro, Riza
Day, Philip
Ananiadou, Sophia
author_sort Zerva, Chrysoula
collection PubMed
description MOTIVATION: In recent years, there has been great progress in the field of automated curation of biomedical networks and models, aided by text mining methods that provide evidence from literature. Such methods must not only extract snippets of text that relate to model interactions, but also be able to contextualize the evidence and provide additional confidence scores for the interaction in question. Although various approaches calculating confidence scores have focused primarily on the quality of the extracted information, there has been little work on exploring the textual uncertainty conveyed by the author. Despite textual uncertainty being acknowledged in biomedical text mining as an attribute of text mined interactions (events), it is significantly understudied as a means of providing a confidence measure for interactions in pathways or other biomedical models. In this work, we focus on improving identification of textual uncertainty for events and explore how it can be used as an additional measure of confidence for biomedical models. RESULTS: We present a novel method for extracting uncertainty from the literature using a hybrid approach that combines rule induction and machine learning. Variations of this hybrid approach are then discussed, alongside their advantages and disadvantages. We use subjective logic theory to combine multiple uncertainty values extracted from different sources for the same interaction. Our approach achieves F-scores of 0.76 and 0.88 based on the BioNLP-ST and Genia-MK corpora, respectively, making considerable improvements over previously published work. Moreover, we evaluate our proposed system on pathways related to two different areas, namely leukemia and melanoma cancer research. AVAILABILITY AND IMPLEMENTATION: The leukemia pathway model used is available in Pathway Studio while the Ras model is available via PathwayCommons. Online demonstration of the uncertainty extraction system is available for research purposes at http://argo.nactem.ac.uk/test. The related code is available on https://github.com/c-zrv/uncertainty_components.git. Details on the above are available in the Supplementary Material. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-5860317
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58603172018-03-21 Using uncertainty to link and rank evidence from biomedical literature for model curation Zerva, Chrysoula Batista-Navarro, Riza Day, Philip Ananiadou, Sophia Bioinformatics Original Papers MOTIVATION: In recent years, there has been great progress in the field of automated curation of biomedical networks and models, aided by text mining methods that provide evidence from literature. Such methods must not only extract snippets of text that relate to model interactions, but also be able to contextualize the evidence and provide additional confidence scores for the interaction in question. Although various approaches calculating confidence scores have focused primarily on the quality of the extracted information, there has been little work on exploring the textual uncertainty conveyed by the author. Despite textual uncertainty being acknowledged in biomedical text mining as an attribute of text mined interactions (events), it is significantly understudied as a means of providing a confidence measure for interactions in pathways or other biomedical models. In this work, we focus on improving identification of textual uncertainty for events and explore how it can be used as an additional measure of confidence for biomedical models. RESULTS: We present a novel method for extracting uncertainty from the literature using a hybrid approach that combines rule induction and machine learning. Variations of this hybrid approach are then discussed, alongside their advantages and disadvantages. We use subjective logic theory to combine multiple uncertainty values extracted from different sources for the same interaction. Our approach achieves F-scores of 0.76 and 0.88 based on the BioNLP-ST and Genia-MK corpora, respectively, making considerable improvements over previously published work. Moreover, we evaluate our proposed system on pathways related to two different areas, namely leukemia and melanoma cancer research. AVAILABILITY AND IMPLEMENTATION: The leukemia pathway model used is available in Pathway Studio while the Ras model is available via PathwayCommons. Online demonstration of the uncertainty extraction system is available for research purposes at http://argo.nactem.ac.uk/test. The related code is available on https://github.com/c-zrv/uncertainty_components.git. Details on the above are available in the Supplementary Material. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2017-12-01 2017-07-24 /pmc/articles/PMC5860317/ /pubmed/29036627 http://dx.doi.org/10.1093/bioinformatics/btx466 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Zerva, Chrysoula
Batista-Navarro, Riza
Day, Philip
Ananiadou, Sophia
Using uncertainty to link and rank evidence from biomedical literature for model curation
title Using uncertainty to link and rank evidence from biomedical literature for model curation
title_full Using uncertainty to link and rank evidence from biomedical literature for model curation
title_fullStr Using uncertainty to link and rank evidence from biomedical literature for model curation
title_full_unstemmed Using uncertainty to link and rank evidence from biomedical literature for model curation
title_short Using uncertainty to link and rank evidence from biomedical literature for model curation
title_sort using uncertainty to link and rank evidence from biomedical literature for model curation
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860317/
https://www.ncbi.nlm.nih.gov/pubmed/29036627
http://dx.doi.org/10.1093/bioinformatics/btx466
work_keys_str_mv AT zervachrysoula usinguncertaintytolinkandrankevidencefrombiomedicalliteratureformodelcuration
AT batistanavarroriza usinguncertaintytolinkandrankevidencefrombiomedicalliteratureformodelcuration
AT dayphilip usinguncertaintytolinkandrankevidencefrombiomedicalliteratureformodelcuration
AT ananiadousophia usinguncertaintytolinkandrankevidencefrombiomedicalliteratureformodelcuration