Cargando…
A literature-based similarity metric for biological processes
BACKGROUND: Recent analyses in systems biology pursue the discovery of functional modules within the cell. Recognition of such modules requires the integrative analysis of genome-wide experimental data together with available functional schemes. In this line, methods to bridge the gap between the ab...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1579237/ https://www.ncbi.nlm.nih.gov/pubmed/16872502 http://dx.doi.org/10.1186/1471-2105-7-363 |
_version_ | 1782130317198360576 |
---|---|
author | Chagoyen, Monica Carmona-Saez, Pedro Gil, Concha Carazo, Jose M Pascual-Montano, Alberto |
author_facet | Chagoyen, Monica Carmona-Saez, Pedro Gil, Concha Carazo, Jose M Pascual-Montano, Alberto |
author_sort | Chagoyen, Monica |
collection | PubMed |
description | BACKGROUND: Recent analyses in systems biology pursue the discovery of functional modules within the cell. Recognition of such modules requires the integrative analysis of genome-wide experimental data together with available functional schemes. In this line, methods to bridge the gap between the abstract definitions of cellular processes in current schemes and the interlinked nature of biological networks are required. RESULTS: This work explores the use of the scientific literature to establish potential relationships among cellular processes. To this end we haveused a document based similarity method to compute pair-wise similarities of the biological processes described in the Gene Ontology (GO). The method has been applied to the biological processes annotated for the Saccharomyces cerevisiae genome. We compared our results with similarities obtained with two ontology-based metrics, as well as with gene product annotation relationships. We show that the literature-based metric conserves most direct ontological relationships, while reveals biologically sounded similarities that are not obtained using ontology-based metrics and/or genome annotation. CONCLUSION: The scientific literature is a valuable source of information from which to compute similarities among biological processes. The associations discovered by literature analysis are a valuable complement to those encoded in existing functional schemes, and those that arise by genome annotation. These similarities can be used to conveniently map the interlinked structure of cellular processes in a particular organism. |
format | Text |
id | pubmed-1579237 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-15792372006-10-02 A literature-based similarity metric for biological processes Chagoyen, Monica Carmona-Saez, Pedro Gil, Concha Carazo, Jose M Pascual-Montano, Alberto BMC Bioinformatics Methodology Article BACKGROUND: Recent analyses in systems biology pursue the discovery of functional modules within the cell. Recognition of such modules requires the integrative analysis of genome-wide experimental data together with available functional schemes. In this line, methods to bridge the gap between the abstract definitions of cellular processes in current schemes and the interlinked nature of biological networks are required. RESULTS: This work explores the use of the scientific literature to establish potential relationships among cellular processes. To this end we haveused a document based similarity method to compute pair-wise similarities of the biological processes described in the Gene Ontology (GO). The method has been applied to the biological processes annotated for the Saccharomyces cerevisiae genome. We compared our results with similarities obtained with two ontology-based metrics, as well as with gene product annotation relationships. We show that the literature-based metric conserves most direct ontological relationships, while reveals biologically sounded similarities that are not obtained using ontology-based metrics and/or genome annotation. CONCLUSION: The scientific literature is a valuable source of information from which to compute similarities among biological processes. The associations discovered by literature analysis are a valuable complement to those encoded in existing functional schemes, and those that arise by genome annotation. These similarities can be used to conveniently map the interlinked structure of cellular processes in a particular organism. BioMed Central 2006-07-26 /pmc/articles/PMC1579237/ /pubmed/16872502 http://dx.doi.org/10.1186/1471-2105-7-363 Text en Copyright © 2006 Chagoyen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Chagoyen, Monica Carmona-Saez, Pedro Gil, Concha Carazo, Jose M Pascual-Montano, Alberto A literature-based similarity metric for biological processes |
title | A literature-based similarity metric for biological processes |
title_full | A literature-based similarity metric for biological processes |
title_fullStr | A literature-based similarity metric for biological processes |
title_full_unstemmed | A literature-based similarity metric for biological processes |
title_short | A literature-based similarity metric for biological processes |
title_sort | literature-based similarity metric for biological processes |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1579237/ https://www.ncbi.nlm.nih.gov/pubmed/16872502 http://dx.doi.org/10.1186/1471-2105-7-363 |
work_keys_str_mv | AT chagoyenmonica aliteraturebasedsimilaritymetricforbiologicalprocesses AT carmonasaezpedro aliteraturebasedsimilaritymetricforbiologicalprocesses AT gilconcha aliteraturebasedsimilaritymetricforbiologicalprocesses AT carazojosem aliteraturebasedsimilaritymetricforbiologicalprocesses AT pascualmontanoalberto aliteraturebasedsimilaritymetricforbiologicalprocesses AT chagoyenmonica literaturebasedsimilaritymetricforbiologicalprocesses AT carmonasaezpedro literaturebasedsimilaritymetricforbiologicalprocesses AT gilconcha literaturebasedsimilaritymetricforbiologicalprocesses AT carazojosem literaturebasedsimilaritymetricforbiologicalprocesses AT pascualmontanoalberto literaturebasedsimilaritymetricforbiologicalprocesses |