Cargando…

A literature-based similarity metric for biological processes

BACKGROUND: Recent analyses in systems biology pursue the discovery of functional modules within the cell. Recognition of such modules requires the integrative analysis of genome-wide experimental data together with available functional schemes. In this line, methods to bridge the gap between the ab...

Descripción completa

Detalles Bibliográficos
Autores principales: Chagoyen, Monica, Carmona-Saez, Pedro, Gil, Concha, Carazo, Jose M, Pascual-Montano, Alberto
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1579237/
https://www.ncbi.nlm.nih.gov/pubmed/16872502
http://dx.doi.org/10.1186/1471-2105-7-363
_version_ 1782130317198360576
author Chagoyen, Monica
Carmona-Saez, Pedro
Gil, Concha
Carazo, Jose M
Pascual-Montano, Alberto
author_facet Chagoyen, Monica
Carmona-Saez, Pedro
Gil, Concha
Carazo, Jose M
Pascual-Montano, Alberto
author_sort Chagoyen, Monica
collection PubMed
description BACKGROUND: Recent analyses in systems biology pursue the discovery of functional modules within the cell. Recognition of such modules requires the integrative analysis of genome-wide experimental data together with available functional schemes. In this line, methods to bridge the gap between the abstract definitions of cellular processes in current schemes and the interlinked nature of biological networks are required. RESULTS: This work explores the use of the scientific literature to establish potential relationships among cellular processes. To this end we haveused a document based similarity method to compute pair-wise similarities of the biological processes described in the Gene Ontology (GO). The method has been applied to the biological processes annotated for the Saccharomyces cerevisiae genome. We compared our results with similarities obtained with two ontology-based metrics, as well as with gene product annotation relationships. We show that the literature-based metric conserves most direct ontological relationships, while reveals biologically sounded similarities that are not obtained using ontology-based metrics and/or genome annotation. CONCLUSION: The scientific literature is a valuable source of information from which to compute similarities among biological processes. The associations discovered by literature analysis are a valuable complement to those encoded in existing functional schemes, and those that arise by genome annotation. These similarities can be used to conveniently map the interlinked structure of cellular processes in a particular organism.
format Text
id pubmed-1579237
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15792372006-10-02 A literature-based similarity metric for biological processes Chagoyen, Monica Carmona-Saez, Pedro Gil, Concha Carazo, Jose M Pascual-Montano, Alberto BMC Bioinformatics Methodology Article BACKGROUND: Recent analyses in systems biology pursue the discovery of functional modules within the cell. Recognition of such modules requires the integrative analysis of genome-wide experimental data together with available functional schemes. In this line, methods to bridge the gap between the abstract definitions of cellular processes in current schemes and the interlinked nature of biological networks are required. RESULTS: This work explores the use of the scientific literature to establish potential relationships among cellular processes. To this end we haveused a document based similarity method to compute pair-wise similarities of the biological processes described in the Gene Ontology (GO). The method has been applied to the biological processes annotated for the Saccharomyces cerevisiae genome. We compared our results with similarities obtained with two ontology-based metrics, as well as with gene product annotation relationships. We show that the literature-based metric conserves most direct ontological relationships, while reveals biologically sounded similarities that are not obtained using ontology-based metrics and/or genome annotation. CONCLUSION: The scientific literature is a valuable source of information from which to compute similarities among biological processes. The associations discovered by literature analysis are a valuable complement to those encoded in existing functional schemes, and those that arise by genome annotation. These similarities can be used to conveniently map the interlinked structure of cellular processes in a particular organism. BioMed Central 2006-07-26 /pmc/articles/PMC1579237/ /pubmed/16872502 http://dx.doi.org/10.1186/1471-2105-7-363 Text en Copyright © 2006 Chagoyen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Chagoyen, Monica
Carmona-Saez, Pedro
Gil, Concha
Carazo, Jose M
Pascual-Montano, Alberto
A literature-based similarity metric for biological processes
title A literature-based similarity metric for biological processes
title_full A literature-based similarity metric for biological processes
title_fullStr A literature-based similarity metric for biological processes
title_full_unstemmed A literature-based similarity metric for biological processes
title_short A literature-based similarity metric for biological processes
title_sort literature-based similarity metric for biological processes
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1579237/
https://www.ncbi.nlm.nih.gov/pubmed/16872502
http://dx.doi.org/10.1186/1471-2105-7-363
work_keys_str_mv AT chagoyenmonica aliteraturebasedsimilaritymetricforbiologicalprocesses
AT carmonasaezpedro aliteraturebasedsimilaritymetricforbiologicalprocesses
AT gilconcha aliteraturebasedsimilaritymetricforbiologicalprocesses
AT carazojosem aliteraturebasedsimilaritymetricforbiologicalprocesses
AT pascualmontanoalberto aliteraturebasedsimilaritymetricforbiologicalprocesses
AT chagoyenmonica literaturebasedsimilaritymetricforbiologicalprocesses
AT carmonasaezpedro literaturebasedsimilaritymetricforbiologicalprocesses
AT gilconcha literaturebasedsimilaritymetricforbiologicalprocesses
AT carazojosem literaturebasedsimilaritymetricforbiologicalprocesses
AT pascualmontanoalberto literaturebasedsimilaritymetricforbiologicalprocesses