Cargando…

A Bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data

BACKGROUND: Metagenomics enables the analysis of bacterial population composition and the study of emergent population features, such as shared metabolic pathways. Recently, we have shown that metagenomics datasets can be leveraged to characterize population-wide transcriptional regulatory networks,...

Descripción completa

Detalles Bibliográficos
Autores principales: Hobbs, Elizabeth T., Pereira, Talmo, O’Neill, Patrick K., Erill, Ivan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4938975/
https://www.ncbi.nlm.nih.gov/pubmed/27398089
http://dx.doi.org/10.1186/s13015-016-0082-8
_version_ 1782441937225121792
author Hobbs, Elizabeth T.
Pereira, Talmo
O’Neill, Patrick K.
Erill, Ivan
author_facet Hobbs, Elizabeth T.
Pereira, Talmo
O’Neill, Patrick K.
Erill, Ivan
author_sort Hobbs, Elizabeth T.
collection PubMed
description BACKGROUND: Metagenomics enables the analysis of bacterial population composition and the study of emergent population features, such as shared metabolic pathways. Recently, we have shown that metagenomics datasets can be leveraged to characterize population-wide transcriptional regulatory networks, or meta-regulons, providing insights into how bacterial populations respond collectively to specific triggers. Here we formalize a Bayesian inference framework to analyze the composition of transcriptional regulatory networks in metagenomes by determining the probability of regulation of orthologous gene sequences. We assess the performance of this approach on synthetic datasets and we validate it by analyzing the copper-homeostasis network of Firmicutes species in the human gut microbiome. RESULTS: Assessment on synthetic datasets shows that our method provides a robust and interpretable metric for assessing putative regulation by a transcription factor on sets of promoter sequences mapping to an orthologous gene cluster. The inference framework integrates the regulatory contribution of secondary sites and can discern false positives arising from multiple instances of a clonal sequence. Posterior probabilities for orthologous gene clusters decline sharply when less than 20 % of mapped promoters have binding sites, but we introduce a sensitivity adjustment procedure to speed up computation that enhances regulation assessment in heterogeneous ortholog clusters. Analysis of the copper-homeostasis regulon governed by CsoR in the human gut microbiome Firmicutes reveals that CsoR controls itself and copper-translocating P-type ATPases, but not CopZ-type copper chaperones. Our analysis also indicates that CsoR frequently targets promoters with dual CsoR-binding sites, suggesting that it exploits higher-order binding conformations to fine-tune its activity. CONCLUSIONS: We introduce and validate a method for the analysis of transcriptional regulatory networks from metagenomic data that enables inference of meta-regulons in a systematic and interpretable way. Validation of this method on the CsoR meta-regulon of gut microbiome Firmicutes illustrates the usefulness of the approach, revealing novel properties of the copper-homeostasis network in poorly characterized bacterial species and putting forward evidence of new mechanisms of DNA binding for this transcriptional regulator. Our approach will enable the comparative analysis of regulatory networks across metagenomes, yielding novel insights into the evolution of transcriptional regulatory networks. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13015-016-0082-8) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4938975
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49389752016-07-10 A Bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data Hobbs, Elizabeth T. Pereira, Talmo O’Neill, Patrick K. Erill, Ivan Algorithms Mol Biol Research BACKGROUND: Metagenomics enables the analysis of bacterial population composition and the study of emergent population features, such as shared metabolic pathways. Recently, we have shown that metagenomics datasets can be leveraged to characterize population-wide transcriptional regulatory networks, or meta-regulons, providing insights into how bacterial populations respond collectively to specific triggers. Here we formalize a Bayesian inference framework to analyze the composition of transcriptional regulatory networks in metagenomes by determining the probability of regulation of orthologous gene sequences. We assess the performance of this approach on synthetic datasets and we validate it by analyzing the copper-homeostasis network of Firmicutes species in the human gut microbiome. RESULTS: Assessment on synthetic datasets shows that our method provides a robust and interpretable metric for assessing putative regulation by a transcription factor on sets of promoter sequences mapping to an orthologous gene cluster. The inference framework integrates the regulatory contribution of secondary sites and can discern false positives arising from multiple instances of a clonal sequence. Posterior probabilities for orthologous gene clusters decline sharply when less than 20 % of mapped promoters have binding sites, but we introduce a sensitivity adjustment procedure to speed up computation that enhances regulation assessment in heterogeneous ortholog clusters. Analysis of the copper-homeostasis regulon governed by CsoR in the human gut microbiome Firmicutes reveals that CsoR controls itself and copper-translocating P-type ATPases, but not CopZ-type copper chaperones. Our analysis also indicates that CsoR frequently targets promoters with dual CsoR-binding sites, suggesting that it exploits higher-order binding conformations to fine-tune its activity. CONCLUSIONS: We introduce and validate a method for the analysis of transcriptional regulatory networks from metagenomic data that enables inference of meta-regulons in a systematic and interpretable way. Validation of this method on the CsoR meta-regulon of gut microbiome Firmicutes illustrates the usefulness of the approach, revealing novel properties of the copper-homeostasis network in poorly characterized bacterial species and putting forward evidence of new mechanisms of DNA binding for this transcriptional regulator. Our approach will enable the comparative analysis of regulatory networks across metagenomes, yielding novel insights into the evolution of transcriptional regulatory networks. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13015-016-0082-8) contains supplementary material, which is available to authorized users. BioMed Central 2016-07-08 /pmc/articles/PMC4938975/ /pubmed/27398089 http://dx.doi.org/10.1186/s13015-016-0082-8 Text en © The Author(s) 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Hobbs, Elizabeth T.
Pereira, Talmo
O’Neill, Patrick K.
Erill, Ivan
A Bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data
title A Bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data
title_full A Bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data
title_fullStr A Bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data
title_full_unstemmed A Bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data
title_short A Bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data
title_sort bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4938975/
https://www.ncbi.nlm.nih.gov/pubmed/27398089
http://dx.doi.org/10.1186/s13015-016-0082-8
work_keys_str_mv AT hobbselizabetht abayesianinferencemethodfortheanalysisoftranscriptionalregulatorynetworksinmetagenomicdata
AT pereiratalmo abayesianinferencemethodfortheanalysisoftranscriptionalregulatorynetworksinmetagenomicdata
AT oneillpatrickk abayesianinferencemethodfortheanalysisoftranscriptionalregulatorynetworksinmetagenomicdata
AT erillivan abayesianinferencemethodfortheanalysisoftranscriptionalregulatorynetworksinmetagenomicdata
AT hobbselizabetht bayesianinferencemethodfortheanalysisoftranscriptionalregulatorynetworksinmetagenomicdata
AT pereiratalmo bayesianinferencemethodfortheanalysisoftranscriptionalregulatorynetworksinmetagenomicdata
AT oneillpatrickk bayesianinferencemethodfortheanalysisoftranscriptionalregulatorynetworksinmetagenomicdata
AT erillivan bayesianinferencemethodfortheanalysisoftranscriptionalregulatorynetworksinmetagenomicdata