Cargando…

Mining metabolites: extracting the yeast metabolome from the literature

Text mining methods have added considerably to our capacity to extract biological knowledge from the literature. Recently the field of systems biology has begun to model and simulate metabolic networks, requiring knowledge of the set of molecules involved. While genomics and proteomics technologies...

Descripción completa

Detalles Bibliográficos
Autores principales: Nobata, Chikashi, Dobson, Paul D., Iqbal, Syed A., Mendes, Pedro, Tsujii, Jun’ichi, Kell, Douglas B., Ananiadou, Sophia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3111869/
https://www.ncbi.nlm.nih.gov/pubmed/21687783
http://dx.doi.org/10.1007/s11306-010-0251-6
_version_ 1782205678400569344
author Nobata, Chikashi
Dobson, Paul D.
Iqbal, Syed A.
Mendes, Pedro
Tsujii, Jun’ichi
Kell, Douglas B.
Ananiadou, Sophia
author_facet Nobata, Chikashi
Dobson, Paul D.
Iqbal, Syed A.
Mendes, Pedro
Tsujii, Jun’ichi
Kell, Douglas B.
Ananiadou, Sophia
author_sort Nobata, Chikashi
collection PubMed
description Text mining methods have added considerably to our capacity to extract biological knowledge from the literature. Recently the field of systems biology has begun to model and simulate metabolic networks, requiring knowledge of the set of molecules involved. While genomics and proteomics technologies are able to supply the macromolecular parts list, the metabolites are less easily assembled. Most metabolites are known and reported through the scientific literature, rather than through large-scale experimental surveys. Thus it is important to recover them from the literature. Here we present a novel tool to automatically identify metabolite names in the literature, and associate structures where possible, to define the reported yeast metabolome. With ten-fold cross validation on a manually annotated corpus, our recognition tool generates an f-score of 78.49 (precision of 83.02) and demonstrates greater suitability in identifying metabolite names than other existing recognition tools for general chemical molecules. The metabolite recognition tool has been applied to the literature covering an important model organism, the yeast Saccharomyces cerevisiae, to define its reported metabolome. By coupling to ChemSpider, a major chemical database, we have identified structures for much of the reported metabolome and, where structure identification fails, been able to suggest extensions to ChemSpider. Our manually annotated gold-standard data on 296 abstracts are available as supplementary materials. Metabolite names and, where appropriate, structures are also available as supplementary materials. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11306-010-0251-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-3111869
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-31118692011-06-15 Mining metabolites: extracting the yeast metabolome from the literature Nobata, Chikashi Dobson, Paul D. Iqbal, Syed A. Mendes, Pedro Tsujii, Jun’ichi Kell, Douglas B. Ananiadou, Sophia Metabolomics Original Article Text mining methods have added considerably to our capacity to extract biological knowledge from the literature. Recently the field of systems biology has begun to model and simulate metabolic networks, requiring knowledge of the set of molecules involved. While genomics and proteomics technologies are able to supply the macromolecular parts list, the metabolites are less easily assembled. Most metabolites are known and reported through the scientific literature, rather than through large-scale experimental surveys. Thus it is important to recover them from the literature. Here we present a novel tool to automatically identify metabolite names in the literature, and associate structures where possible, to define the reported yeast metabolome. With ten-fold cross validation on a manually annotated corpus, our recognition tool generates an f-score of 78.49 (precision of 83.02) and demonstrates greater suitability in identifying metabolite names than other existing recognition tools for general chemical molecules. The metabolite recognition tool has been applied to the literature covering an important model organism, the yeast Saccharomyces cerevisiae, to define its reported metabolome. By coupling to ChemSpider, a major chemical database, we have identified structures for much of the reported metabolome and, where structure identification fails, been able to suggest extensions to ChemSpider. Our manually annotated gold-standard data on 296 abstracts are available as supplementary materials. Metabolite names and, where appropriate, structures are also available as supplementary materials. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11306-010-0251-6) contains supplementary material, which is available to authorized users. Springer US 2010-10-31 2011 /pmc/articles/PMC3111869/ /pubmed/21687783 http://dx.doi.org/10.1007/s11306-010-0251-6 Text en © The Author(s) 2010 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
spellingShingle Original Article
Nobata, Chikashi
Dobson, Paul D.
Iqbal, Syed A.
Mendes, Pedro
Tsujii, Jun’ichi
Kell, Douglas B.
Ananiadou, Sophia
Mining metabolites: extracting the yeast metabolome from the literature
title Mining metabolites: extracting the yeast metabolome from the literature
title_full Mining metabolites: extracting the yeast metabolome from the literature
title_fullStr Mining metabolites: extracting the yeast metabolome from the literature
title_full_unstemmed Mining metabolites: extracting the yeast metabolome from the literature
title_short Mining metabolites: extracting the yeast metabolome from the literature
title_sort mining metabolites: extracting the yeast metabolome from the literature
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3111869/
https://www.ncbi.nlm.nih.gov/pubmed/21687783
http://dx.doi.org/10.1007/s11306-010-0251-6
work_keys_str_mv AT nobatachikashi miningmetabolitesextractingtheyeastmetabolomefromtheliterature
AT dobsonpauld miningmetabolitesextractingtheyeastmetabolomefromtheliterature
AT iqbalsyeda miningmetabolitesextractingtheyeastmetabolomefromtheliterature
AT mendespedro miningmetabolitesextractingtheyeastmetabolomefromtheliterature
AT tsujiijunichi miningmetabolitesextractingtheyeastmetabolomefromtheliterature
AT kelldouglasb miningmetabolitesextractingtheyeastmetabolomefromtheliterature
AT ananiadousophia miningmetabolitesextractingtheyeastmetabolomefromtheliterature