Cargando…

An Integrative Genomic Approach to Uncover Molecular Mechanisms of Prokaryotic Traits

With mounting availability of genomic and phenotypic databases, data integration and mining become increasingly challenging. While efforts have been put forward to analyze prokaryotic phenotypes, current computational technologies either lack high throughput capacity for genomic scale analysis, or a...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Yang, Li, Jianrong, Sam, Lee, Goh, Chern-Sing, Gerstein, Mark, Lussier, Yves A
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1636675/
https://www.ncbi.nlm.nih.gov/pubmed/17112314
http://dx.doi.org/10.1371/journal.pcbi.0020159
_version_ 1782130772136689664
author Liu, Yang
Li, Jianrong
Sam, Lee
Goh, Chern-Sing
Gerstein, Mark
Lussier, Yves A
author_facet Liu, Yang
Li, Jianrong
Sam, Lee
Goh, Chern-Sing
Gerstein, Mark
Lussier, Yves A
author_sort Liu, Yang
collection PubMed
description With mounting availability of genomic and phenotypic databases, data integration and mining become increasingly challenging. While efforts have been put forward to analyze prokaryotic phenotypes, current computational technologies either lack high throughput capacity for genomic scale analysis, or are limited in their capability to integrate and mine data across different scales of biology. Consequently, simultaneous analysis of associations among genomes, phenotypes, and gene functions is prohibited. Here, we developed a high throughput computational approach, and demonstrated for the first time the feasibility of integrating large quantities of prokaryotic phenotypes along with genomic datasets for mining across multiple scales of biology (protein domains, pathways, molecular functions, and cellular processes). Applying this method over 59 fully sequenced prokaryotic species, we identified genetic basis and molecular mechanisms underlying the phenotypes in bacteria. We identified 3,711 significant correlations between 1,499 distinct Pfam and 63 phenotypes, with 2,650 correlations and 1,061 anti-correlations. Manual evaluation of a random sample of these significant correlations showed a minimal precision of 30% (95% confidence interval: 20%–42%; n = 50). We stratified the most significant 478 predictions and subjected 100 to manual evaluation, of which 60 were corroborated in the literature. We furthermore unveiled 10 significant correlations between phenotypes and KEGG pathways, eight of which were corroborated in the evaluation, and 309 significant correlations between phenotypes and 166 GO concepts evaluated using a random sample (minimal precision = 72%; 95% confidence interval: 60%–80%; n = 50). Additionally, we conducted a novel large-scale phenomic visualization analysis to provide insight into the modular nature of common molecular mechanisms spanning multiple biological scales and reused by related phenotypes (metaphenotypes). We propose that this method elucidates which classes of molecular mechanisms are associated with phenotypes or metaphenotypes and holds promise in facilitating a computable systems biology approach to genomic and biomedical research.
format Text
id pubmed-1636675
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-16366752006-11-17 An Integrative Genomic Approach to Uncover Molecular Mechanisms of Prokaryotic Traits Liu, Yang Li, Jianrong Sam, Lee Goh, Chern-Sing Gerstein, Mark Lussier, Yves A PLoS Comput Biol Research Article With mounting availability of genomic and phenotypic databases, data integration and mining become increasingly challenging. While efforts have been put forward to analyze prokaryotic phenotypes, current computational technologies either lack high throughput capacity for genomic scale analysis, or are limited in their capability to integrate and mine data across different scales of biology. Consequently, simultaneous analysis of associations among genomes, phenotypes, and gene functions is prohibited. Here, we developed a high throughput computational approach, and demonstrated for the first time the feasibility of integrating large quantities of prokaryotic phenotypes along with genomic datasets for mining across multiple scales of biology (protein domains, pathways, molecular functions, and cellular processes). Applying this method over 59 fully sequenced prokaryotic species, we identified genetic basis and molecular mechanisms underlying the phenotypes in bacteria. We identified 3,711 significant correlations between 1,499 distinct Pfam and 63 phenotypes, with 2,650 correlations and 1,061 anti-correlations. Manual evaluation of a random sample of these significant correlations showed a minimal precision of 30% (95% confidence interval: 20%–42%; n = 50). We stratified the most significant 478 predictions and subjected 100 to manual evaluation, of which 60 were corroborated in the literature. We furthermore unveiled 10 significant correlations between phenotypes and KEGG pathways, eight of which were corroborated in the evaluation, and 309 significant correlations between phenotypes and 166 GO concepts evaluated using a random sample (minimal precision = 72%; 95% confidence interval: 60%–80%; n = 50). Additionally, we conducted a novel large-scale phenomic visualization analysis to provide insight into the modular nature of common molecular mechanisms spanning multiple biological scales and reused by related phenotypes (metaphenotypes). We propose that this method elucidates which classes of molecular mechanisms are associated with phenotypes or metaphenotypes and holds promise in facilitating a computable systems biology approach to genomic and biomedical research. Public Library of Science 2006-11 2006-11-17 /pmc/articles/PMC1636675/ /pubmed/17112314 http://dx.doi.org/10.1371/journal.pcbi.0020159 Text en © 2006 Liu et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Liu, Yang
Li, Jianrong
Sam, Lee
Goh, Chern-Sing
Gerstein, Mark
Lussier, Yves A
An Integrative Genomic Approach to Uncover Molecular Mechanisms of Prokaryotic Traits
title An Integrative Genomic Approach to Uncover Molecular Mechanisms of Prokaryotic Traits
title_full An Integrative Genomic Approach to Uncover Molecular Mechanisms of Prokaryotic Traits
title_fullStr An Integrative Genomic Approach to Uncover Molecular Mechanisms of Prokaryotic Traits
title_full_unstemmed An Integrative Genomic Approach to Uncover Molecular Mechanisms of Prokaryotic Traits
title_short An Integrative Genomic Approach to Uncover Molecular Mechanisms of Prokaryotic Traits
title_sort integrative genomic approach to uncover molecular mechanisms of prokaryotic traits
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1636675/
https://www.ncbi.nlm.nih.gov/pubmed/17112314
http://dx.doi.org/10.1371/journal.pcbi.0020159
work_keys_str_mv AT liuyang anintegrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits
AT lijianrong anintegrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits
AT samlee anintegrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits
AT gohchernsing anintegrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits
AT gersteinmark anintegrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits
AT lussieryvesa anintegrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits
AT liuyang integrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits
AT lijianrong integrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits
AT samlee integrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits
AT gohchernsing integrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits
AT gersteinmark integrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits
AT lussieryvesa integrativegenomicapproachtouncovermolecularmechanismsofprokaryotictraits