Cargando…

A method for integrating and ranking the evidence for biochemical pathways by mining reactions from text

Motivation: To create, verify and maintain pathway models, curators must discover and assess knowledge distributed over the vast body of biological literature. Methods supporting these tasks must understand both the pathway model representations and the natural language in the literature. These meth...

Descripción completa

Detalles Bibliográficos
Autores principales: Miwa, Makoto, Ohta, Tomoko, Rak, Rafal, Rowley, Andrew, Kell, Douglas B., Pyysalo, Sampo, Ananiadou, Sophia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694679/
https://www.ncbi.nlm.nih.gov/pubmed/23813008
http://dx.doi.org/10.1093/bioinformatics/btt227
_version_ 1782274887698612224
author Miwa, Makoto
Ohta, Tomoko
Rak, Rafal
Rowley, Andrew
Kell, Douglas B.
Pyysalo, Sampo
Ananiadou, Sophia
author_facet Miwa, Makoto
Ohta, Tomoko
Rak, Rafal
Rowley, Andrew
Kell, Douglas B.
Pyysalo, Sampo
Ananiadou, Sophia
author_sort Miwa, Makoto
collection PubMed
description Motivation: To create, verify and maintain pathway models, curators must discover and assess knowledge distributed over the vast body of biological literature. Methods supporting these tasks must understand both the pathway model representations and the natural language in the literature. These methods should identify and order documents by relevance to any given pathway reaction. No existing system has addressed all aspects of this challenge. Method: We present novel methods for associating pathway model reactions with relevant publications. Our approach extracts the reactions directly from the models and then turns them into queries for three text mining-based MEDLINE literature search systems. These queries are executed, and the resulting documents are combined and ranked according to their relevance to the reactions of interest. We manually annotate document-reaction pairs with the relevance of the document to the reaction and use this annotation to study several ranking methods, using various heuristic and machine-learning approaches. Results: Our evaluation shows that the annotated document-reaction pairs can be used to create a rule-based document ranking system, and that machine learning can be used to rank documents by their relevance to pathway reactions. We find that a Support Vector Machine-based system outperforms several baselines and matches the performance of the rule-based system. The success of the query extraction and ranking methods are used to update our existing pathway search system, PathText. Availability: An online demonstration of PathText 2 and the annotated corpus are available for research purposes at http://www.nactem.ac.uk/pathtext2/. Contact: makoto.miwa@manchester.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3694679
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-36946792013-06-27 A method for integrating and ranking the evidence for biochemical pathways by mining reactions from text Miwa, Makoto Ohta, Tomoko Rak, Rafal Rowley, Andrew Kell, Douglas B. Pyysalo, Sampo Ananiadou, Sophia Bioinformatics Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany Motivation: To create, verify and maintain pathway models, curators must discover and assess knowledge distributed over the vast body of biological literature. Methods supporting these tasks must understand both the pathway model representations and the natural language in the literature. These methods should identify and order documents by relevance to any given pathway reaction. No existing system has addressed all aspects of this challenge. Method: We present novel methods for associating pathway model reactions with relevant publications. Our approach extracts the reactions directly from the models and then turns them into queries for three text mining-based MEDLINE literature search systems. These queries are executed, and the resulting documents are combined and ranked according to their relevance to the reactions of interest. We manually annotate document-reaction pairs with the relevance of the document to the reaction and use this annotation to study several ranking methods, using various heuristic and machine-learning approaches. Results: Our evaluation shows that the annotated document-reaction pairs can be used to create a rule-based document ranking system, and that machine learning can be used to rank documents by their relevance to pathway reactions. We find that a Support Vector Machine-based system outperforms several baselines and matches the performance of the rule-based system. The success of the query extraction and ranking methods are used to update our existing pathway search system, PathText. Availability: An online demonstration of PathText 2 and the annotated corpus are available for research purposes at http://www.nactem.ac.uk/pathtext2/. Contact: makoto.miwa@manchester.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2013-07-01 2013-06-19 /pmc/articles/PMC3694679/ /pubmed/23813008 http://dx.doi.org/10.1093/bioinformatics/btt227 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany
Miwa, Makoto
Ohta, Tomoko
Rak, Rafal
Rowley, Andrew
Kell, Douglas B.
Pyysalo, Sampo
Ananiadou, Sophia
A method for integrating and ranking the evidence for biochemical pathways by mining reactions from text
title A method for integrating and ranking the evidence for biochemical pathways by mining reactions from text
title_full A method for integrating and ranking the evidence for biochemical pathways by mining reactions from text
title_fullStr A method for integrating and ranking the evidence for biochemical pathways by mining reactions from text
title_full_unstemmed A method for integrating and ranking the evidence for biochemical pathways by mining reactions from text
title_short A method for integrating and ranking the evidence for biochemical pathways by mining reactions from text
title_sort method for integrating and ranking the evidence for biochemical pathways by mining reactions from text
topic Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694679/
https://www.ncbi.nlm.nih.gov/pubmed/23813008
http://dx.doi.org/10.1093/bioinformatics/btt227
work_keys_str_mv AT miwamakoto amethodforintegratingandrankingtheevidenceforbiochemicalpathwaysbyminingreactionsfromtext
AT ohtatomoko amethodforintegratingandrankingtheevidenceforbiochemicalpathwaysbyminingreactionsfromtext
AT rakrafal amethodforintegratingandrankingtheevidenceforbiochemicalpathwaysbyminingreactionsfromtext
AT rowleyandrew amethodforintegratingandrankingtheevidenceforbiochemicalpathwaysbyminingreactionsfromtext
AT kelldouglasb amethodforintegratingandrankingtheevidenceforbiochemicalpathwaysbyminingreactionsfromtext
AT pyysalosampo amethodforintegratingandrankingtheevidenceforbiochemicalpathwaysbyminingreactionsfromtext
AT ananiadousophia amethodforintegratingandrankingtheevidenceforbiochemicalpathwaysbyminingreactionsfromtext
AT miwamakoto methodforintegratingandrankingtheevidenceforbiochemicalpathwaysbyminingreactionsfromtext
AT ohtatomoko methodforintegratingandrankingtheevidenceforbiochemicalpathwaysbyminingreactionsfromtext
AT rakrafal methodforintegratingandrankingtheevidenceforbiochemicalpathwaysbyminingreactionsfromtext
AT rowleyandrew methodforintegratingandrankingtheevidenceforbiochemicalpathwaysbyminingreactionsfromtext
AT kelldouglasb methodforintegratingandrankingtheevidenceforbiochemicalpathwaysbyminingreactionsfromtext
AT pyysalosampo methodforintegratingandrankingtheevidenceforbiochemicalpathwaysbyminingreactionsfromtext
AT ananiadousophia methodforintegratingandrankingtheevidenceforbiochemicalpathwaysbyminingreactionsfromtext