Cargando…
An Integrative Genomic Approach to Uncover Molecular Mechanisms of Prokaryotic Traits
With mounting availability of genomic and phenotypic databases, data integration and mining become increasingly challenging. While efforts have been put forward to analyze prokaryotic phenotypes, current computational technologies either lack high throughput capacity for genomic scale analysis, or a...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1636675/ https://www.ncbi.nlm.nih.gov/pubmed/17112314 http://dx.doi.org/10.1371/journal.pcbi.0020159 |
_version_ | 1782130772136689664 |
---|---|
author | Liu, Yang Li, Jianrong Sam, Lee Goh, Chern-Sing Gerstein, Mark Lussier, Yves A |
author_facet | Liu, Yang Li, Jianrong Sam, Lee Goh, Chern-Sing Gerstein, Mark Lussier, Yves A |
author_sort | Liu, Yang |
collection | PubMed |
description | With mounting availability of genomic and phenotypic databases, data integration and mining become increasingly challenging. While efforts have been put forward to analyze prokaryotic phenotypes, current computational technologies either lack high throughput capacity for genomic scale analysis, or are limited in their capability to integrate and mine data across different scales of biology. Consequently, simultaneous analysis of associations among genomes, phenotypes, and gene functions is prohibited. Here, we developed a high throughput computational approach, and demonstrated for the first time the feasibility of integrating large quantities of prokaryotic phenotypes along with genomic datasets for mining across multiple scales of biology (protein domains, pathways, molecular functions, and cellular processes). Applying this method over 59 fully sequenced prokaryotic species, we identified genetic basis and molecular mechanisms underlying the phenotypes in bacteria. We identified 3,711 significant correlations between 1,499 distinct Pfam and 63 phenotypes, with 2,650 correlations and 1,061 anti-correlations. Manual evaluation of a random sample of these significant correlations showed a minimal precision of 30% (95% confidence interval: 20%–42%; n = 50). We stratified the most significant 478 predictions and subjected 100 to manual evaluation, of which 60 were corroborated in the literature. We furthermore unveiled 10 significant correlations between phenotypes and KEGG pathways, eight of which were corroborated in the evaluation, and 309 significant correlations between phenotypes and 166 GO concepts evaluated using a random sample (minimal precision = 72%; 95% confidence interval: 60%–80%; n = 50). Additionally, we conducted a novel large-scale phenomic visualization analysis to provide insight into the modular nature of common molecular mechanisms spanning multiple biological scales and reused by related phenotypes (metaphenotypes). We propose that this method elucidates which classes of molecular mechanisms are associated with phenotypes or metaphenotypes and holds promise in facilitating a computable systems biology approach to genomic and biomedical research. |
format | Text |
id | pubmed-1636675 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-16366752006-11-17 An Integrative Genomic Approach to Uncover Molecular Mechanisms of Prokaryotic Traits Liu, Yang Li, Jianrong Sam, Lee Goh, Chern-Sing Gerstein, Mark Lussier, Yves A PLoS Comput Biol Research Article With mounting availability of genomic and phenotypic databases, data integration and mining become increasingly challenging. While efforts have been put forward to analyze prokaryotic phenotypes, current computational technologies either lack high throughput capacity for genomic scale analysis, or are limited in their capability to integrate and mine data across different scales of biology. Consequently, simultaneous analysis of associations among genomes, phenotypes, and gene functions is prohibited. Here, we developed a high throughput computational approach, and demonstrated for the first time the feasibility of integrating large quantities of prokaryotic phenotypes along with genomic datasets for mining across multiple scales of biology (protein domains, pathways, molecular functions, and cellular processes). Applying this method over 59 fully sequenced prokaryotic species, we identified genetic basis and molecular mechanisms underlying the phenotypes in bacteria. We identified 3,711 significant correlations between 1,499 distinct Pfam and 63 phenotypes, with 2,650 correlations and 1,061 anti-correlations. Manual evaluation of a random sample of these significant correlations showed a minimal precision of 30% (95% confidence interval: 20%–42%; n = 50). We stratified the most significant 478 predictions and subjected 100 to manual evaluation, of which 60 were corroborated in the literature. We furthermore unveiled 10 significant correlations between phenotypes and KEGG pathways, eight of which were corroborated in the evaluation, and 309 significant correlations between phenotypes and 166 GO concepts evaluated using a random sample (minimal precision = 72%; 95% confidence interval: 60%–80%; n = 50). Additionally, we conducted a novel large-scale phenomic visualization analysis to provide insight into the modular nature of common molecular mechanisms spanning multiple biological scales and reused by related phenotypes (metaphenotypes). We propose that this method elucidates which classes of molecular mechanisms are associated with phenotypes or metaphenotypes and holds promise in facilitating a computable systems biology approach to genomic and biomedical research. Public Library of Science 2006-11 2006-11-17 /pmc/articles/PMC1636675/ /pubmed/17112314 http://dx.doi.org/10.1371/journal.pcbi.0020159 Text en © 2006 Liu et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Liu, Yang Li, Jianrong Sam, Lee Goh, Chern-Sing Gerstein, Mark Lussier, Yves A An Integrative Genomic Approach to Uncover Molecular Mechanisms of Prokaryotic Traits |
title | An Integrative Genomic Approach to Uncover Molecular Mechanisms of Prokaryotic Traits |
title_full | An Integrative Genomic Approach to Uncover Molecular Mechanisms of Prokaryotic Traits |
title_fullStr | An Integrative Genomic Approach to Uncover Molecular Mechanisms of Prokaryotic Traits |
title_full_unstemmed | An Integrative Genomic Approach to Uncover Molecular Mechanisms of Prokaryotic Traits |
title_short | An Integrative Genomic Approach to Uncover Molecular Mechanisms of Prokaryotic Traits |
title_sort | integrative genomic approach to uncover molecular mechanisms of prokaryotic traits |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1636675/ https://www.ncbi.nlm.nih.gov/pubmed/17112314 http://dx.doi.org/10.1371/journal.pcbi.0020159 |
work_keys_str_mv | AT liuyang anintegrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits AT lijianrong anintegrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits AT samlee anintegrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits AT gohchernsing anintegrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits AT gersteinmark anintegrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits AT lussieryvesa anintegrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits AT liuyang integrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits AT lijianrong integrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits AT samlee integrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits AT gohchernsing integrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits AT gersteinmark integrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits AT lussieryvesa integrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits |