Cargando…
A Bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data
BACKGROUND: Metagenomics enables the analysis of bacterial population composition and the study of emergent population features, such as shared metabolic pathways. Recently, we have shown that metagenomics datasets can be leveraged to characterize population-wide transcriptional regulatory networks,...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4938975/ https://www.ncbi.nlm.nih.gov/pubmed/27398089 http://dx.doi.org/10.1186/s13015-016-0082-8 |
_version_ | 1782441937225121792 |
---|---|
author | Hobbs, Elizabeth T. Pereira, Talmo O’Neill, Patrick K. Erill, Ivan |
author_facet | Hobbs, Elizabeth T. Pereira, Talmo O’Neill, Patrick K. Erill, Ivan |
author_sort | Hobbs, Elizabeth T. |
collection | PubMed |
description | BACKGROUND: Metagenomics enables the analysis of bacterial population composition and the study of emergent population features, such as shared metabolic pathways. Recently, we have shown that metagenomics datasets can be leveraged to characterize population-wide transcriptional regulatory networks, or meta-regulons, providing insights into how bacterial populations respond collectively to specific triggers. Here we formalize a Bayesian inference framework to analyze the composition of transcriptional regulatory networks in metagenomes by determining the probability of regulation of orthologous gene sequences. We assess the performance of this approach on synthetic datasets and we validate it by analyzing the copper-homeostasis network of Firmicutes species in the human gut microbiome. RESULTS: Assessment on synthetic datasets shows that our method provides a robust and interpretable metric for assessing putative regulation by a transcription factor on sets of promoter sequences mapping to an orthologous gene cluster. The inference framework integrates the regulatory contribution of secondary sites and can discern false positives arising from multiple instances of a clonal sequence. Posterior probabilities for orthologous gene clusters decline sharply when less than 20 % of mapped promoters have binding sites, but we introduce a sensitivity adjustment procedure to speed up computation that enhances regulation assessment in heterogeneous ortholog clusters. Analysis of the copper-homeostasis regulon governed by CsoR in the human gut microbiome Firmicutes reveals that CsoR controls itself and copper-translocating P-type ATPases, but not CopZ-type copper chaperones. Our analysis also indicates that CsoR frequently targets promoters with dual CsoR-binding sites, suggesting that it exploits higher-order binding conformations to fine-tune its activity. CONCLUSIONS: We introduce and validate a method for the analysis of transcriptional regulatory networks from metagenomic data that enables inference of meta-regulons in a systematic and interpretable way. Validation of this method on the CsoR meta-regulon of gut microbiome Firmicutes illustrates the usefulness of the approach, revealing novel properties of the copper-homeostasis network in poorly characterized bacterial species and putting forward evidence of new mechanisms of DNA binding for this transcriptional regulator. Our approach will enable the comparative analysis of regulatory networks across metagenomes, yielding novel insights into the evolution of transcriptional regulatory networks. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13015-016-0082-8) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4938975 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-49389752016-07-10 A Bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data Hobbs, Elizabeth T. Pereira, Talmo O’Neill, Patrick K. Erill, Ivan Algorithms Mol Biol Research BACKGROUND: Metagenomics enables the analysis of bacterial population composition and the study of emergent population features, such as shared metabolic pathways. Recently, we have shown that metagenomics datasets can be leveraged to characterize population-wide transcriptional regulatory networks, or meta-regulons, providing insights into how bacterial populations respond collectively to specific triggers. Here we formalize a Bayesian inference framework to analyze the composition of transcriptional regulatory networks in metagenomes by determining the probability of regulation of orthologous gene sequences. We assess the performance of this approach on synthetic datasets and we validate it by analyzing the copper-homeostasis network of Firmicutes species in the human gut microbiome. RESULTS: Assessment on synthetic datasets shows that our method provides a robust and interpretable metric for assessing putative regulation by a transcription factor on sets of promoter sequences mapping to an orthologous gene cluster. The inference framework integrates the regulatory contribution of secondary sites and can discern false positives arising from multiple instances of a clonal sequence. Posterior probabilities for orthologous gene clusters decline sharply when less than 20 % of mapped promoters have binding sites, but we introduce a sensitivity adjustment procedure to speed up computation that enhances regulation assessment in heterogeneous ortholog clusters. Analysis of the copper-homeostasis regulon governed by CsoR in the human gut microbiome Firmicutes reveals that CsoR controls itself and copper-translocating P-type ATPases, but not CopZ-type copper chaperones. Our analysis also indicates that CsoR frequently targets promoters with dual CsoR-binding sites, suggesting that it exploits higher-order binding conformations to fine-tune its activity. CONCLUSIONS: We introduce and validate a method for the analysis of transcriptional regulatory networks from metagenomic data that enables inference of meta-regulons in a systematic and interpretable way. Validation of this method on the CsoR meta-regulon of gut microbiome Firmicutes illustrates the usefulness of the approach, revealing novel properties of the copper-homeostasis network in poorly characterized bacterial species and putting forward evidence of new mechanisms of DNA binding for this transcriptional regulator. Our approach will enable the comparative analysis of regulatory networks across metagenomes, yielding novel insights into the evolution of transcriptional regulatory networks. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13015-016-0082-8) contains supplementary material, which is available to authorized users. BioMed Central 2016-07-08 /pmc/articles/PMC4938975/ /pubmed/27398089 http://dx.doi.org/10.1186/s13015-016-0082-8 Text en © The Author(s) 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Hobbs, Elizabeth T. Pereira, Talmo O’Neill, Patrick K. Erill, Ivan A Bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data |
title | A Bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data |
title_full | A Bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data |
title_fullStr | A Bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data |
title_full_unstemmed | A Bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data |
title_short | A Bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data |
title_sort | bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4938975/ https://www.ncbi.nlm.nih.gov/pubmed/27398089 http://dx.doi.org/10.1186/s13015-016-0082-8 |
work_keys_str_mv | AT hobbselizabetht abayesianinferencemethodfortheanalysisoftranscriptionalregulatorynetworksinmetagenomicdata AT pereiratalmo abayesianinferencemethodfortheanalysisoftranscriptionalregulatorynetworksinmetagenomicdata AT oneillpatrickk abayesianinferencemethodfortheanalysisoftranscriptionalregulatorynetworksinmetagenomicdata AT erillivan abayesianinferencemethodfortheanalysisoftranscriptionalregulatorynetworksinmetagenomicdata AT hobbselizabetht bayesianinferencemethodfortheanalysisoftranscriptionalregulatorynetworksinmetagenomicdata AT pereiratalmo bayesianinferencemethodfortheanalysisoftranscriptionalregulatorynetworksinmetagenomicdata AT oneillpatrickk bayesianinferencemethodfortheanalysisoftranscriptionalregulatorynetworksinmetagenomicdata AT erillivan bayesianinferencemethodfortheanalysisoftranscriptionalregulatorynetworksinmetagenomicdata |