Cargando…

ChIP-GSM: Inferring active transcription factor modules to predict functional regulatory elements

Transcription factors (TFs) often function as a module including both master factors and mediators binding at cis-regulatory regions to modulate nearby gene transcription. ChIP-seq profiling of multiple TFs makes it feasible to infer functional TF modules. However, when inferring TF modules based on...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Xi, Neuwald, Andrew F., Hilakivi-Clarke, Leena, Clarke, Robert, Xuan, Jianhua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8330942/
https://www.ncbi.nlm.nih.gov/pubmed/34292930
http://dx.doi.org/10.1371/journal.pcbi.1009203
_version_ 1783732829611360256
author Chen, Xi
Neuwald, Andrew F.
Hilakivi-Clarke, Leena
Clarke, Robert
Xuan, Jianhua
author_facet Chen, Xi
Neuwald, Andrew F.
Hilakivi-Clarke, Leena
Clarke, Robert
Xuan, Jianhua
author_sort Chen, Xi
collection PubMed
description Transcription factors (TFs) often function as a module including both master factors and mediators binding at cis-regulatory regions to modulate nearby gene transcription. ChIP-seq profiling of multiple TFs makes it feasible to infer functional TF modules. However, when inferring TF modules based on co-localization of ChIP-seq peaks, often many weak binding events are missed, especially for mediators, resulting in incomplete identification of modules. To address this problem, we develop a ChIP-seq data-driven Gibbs Sampler to infer Modules (ChIP-GSM) using a Bayesian framework that integrates ChIP-seq profiles of multiple TFs. ChIP-GSM samples read counts of module TFs iteratively to estimate the binding potential of a module to each region and, across all regions, estimates the module abundance. Using inferred module-region probabilistic bindings as feature units, ChIP-GSM then employs logistic regression to predict active regulatory elements. Validation of ChIP-GSM predicted regulatory regions on multiple independent datasets sharing the same context confirms the advantage of using TF modules for predicting regulatory activity. In a case study of K562 cells, we demonstrate that the ChIP-GSM inferred modules form as groups, activate gene expression at different time points, and mediate diverse functional cellular processes. Hence, ChIP-GSM infers biologically meaningful TF modules and improves the prediction accuracy of regulatory region activities.
format Online
Article
Text
id pubmed-8330942
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-83309422021-08-04 ChIP-GSM: Inferring active transcription factor modules to predict functional regulatory elements Chen, Xi Neuwald, Andrew F. Hilakivi-Clarke, Leena Clarke, Robert Xuan, Jianhua PLoS Comput Biol Research Article Transcription factors (TFs) often function as a module including both master factors and mediators binding at cis-regulatory regions to modulate nearby gene transcription. ChIP-seq profiling of multiple TFs makes it feasible to infer functional TF modules. However, when inferring TF modules based on co-localization of ChIP-seq peaks, often many weak binding events are missed, especially for mediators, resulting in incomplete identification of modules. To address this problem, we develop a ChIP-seq data-driven Gibbs Sampler to infer Modules (ChIP-GSM) using a Bayesian framework that integrates ChIP-seq profiles of multiple TFs. ChIP-GSM samples read counts of module TFs iteratively to estimate the binding potential of a module to each region and, across all regions, estimates the module abundance. Using inferred module-region probabilistic bindings as feature units, ChIP-GSM then employs logistic regression to predict active regulatory elements. Validation of ChIP-GSM predicted regulatory regions on multiple independent datasets sharing the same context confirms the advantage of using TF modules for predicting regulatory activity. In a case study of K562 cells, we demonstrate that the ChIP-GSM inferred modules form as groups, activate gene expression at different time points, and mediate diverse functional cellular processes. Hence, ChIP-GSM infers biologically meaningful TF modules and improves the prediction accuracy of regulatory region activities. Public Library of Science 2021-07-22 /pmc/articles/PMC8330942/ /pubmed/34292930 http://dx.doi.org/10.1371/journal.pcbi.1009203 Text en © 2021 Chen et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Chen, Xi
Neuwald, Andrew F.
Hilakivi-Clarke, Leena
Clarke, Robert
Xuan, Jianhua
ChIP-GSM: Inferring active transcription factor modules to predict functional regulatory elements
title ChIP-GSM: Inferring active transcription factor modules to predict functional regulatory elements
title_full ChIP-GSM: Inferring active transcription factor modules to predict functional regulatory elements
title_fullStr ChIP-GSM: Inferring active transcription factor modules to predict functional regulatory elements
title_full_unstemmed ChIP-GSM: Inferring active transcription factor modules to predict functional regulatory elements
title_short ChIP-GSM: Inferring active transcription factor modules to predict functional regulatory elements
title_sort chip-gsm: inferring active transcription factor modules to predict functional regulatory elements
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8330942/
https://www.ncbi.nlm.nih.gov/pubmed/34292930
http://dx.doi.org/10.1371/journal.pcbi.1009203
work_keys_str_mv AT chenxi chipgsminferringactivetranscriptionfactormodulestopredictfunctionalregulatoryelements
AT neuwaldandrewf chipgsminferringactivetranscriptionfactormodulestopredictfunctionalregulatoryelements
AT hilakiviclarkeleena chipgsminferringactivetranscriptionfactormodulestopredictfunctionalregulatoryelements
AT clarkerobert chipgsminferringactivetranscriptionfactormodulestopredictfunctionalregulatoryelements
AT xuanjianhua chipgsminferringactivetranscriptionfactormodulestopredictfunctionalregulatoryelements