Cargando…
Evidence of widespread, independent sequence signature for transcription factor cobinding
Transcription factors (TFs) are the vocabulary that genomes use to regulate gene expression and phenotypes. The interactions among TFs enrich this vocabulary and orchestrate diverse biological processes. Although simple models identify open chromatin and the presence of TF motifs as the two major co...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7849410/ https://www.ncbi.nlm.nih.gov/pubmed/33303494 http://dx.doi.org/10.1101/gr.267310.120 |
_version_ | 1783645294938816512 |
---|---|
author | Zhou, Manqi Li, Hongyang Wang, Xueqing Guan, Yuanfang |
author_facet | Zhou, Manqi Li, Hongyang Wang, Xueqing Guan, Yuanfang |
author_sort | Zhou, Manqi |
collection | PubMed |
description | Transcription factors (TFs) are the vocabulary that genomes use to regulate gene expression and phenotypes. The interactions among TFs enrich this vocabulary and orchestrate diverse biological processes. Although simple models identify open chromatin and the presence of TF motifs as the two major contributors to TF binding patterns, it remains elusive what contributes to the in vivo TF cobinding landscape. In this study, we developed a machine learning algorithm to explore the contributors of the cobinding patterns. The algorithm substantially outperforms the state-of-the-field models for TF cobinding prediction. Game theory–based feature importance analysis reveals that, for most of the TF pairs we studied, independent motif sequences contribute one or more of the two TFs under investigation to their cobinding patterns. Such independent motif sequences include, but are not limited to, transcription initiation–related proteins and known TF complexes. We found the motif sequence signatures and the TFs are rarely mutual, corroborating a hierarchical and directional organization of the regulatory network and refuting the possibility of artifacts caused by shared sequence similarity with the TFs under investigation. We modeled such regulatory language with directed graphs, which reveal shared, global factors that are related to many binding and cobinding patterns. |
format | Online Article Text |
id | pubmed-7849410 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-78494102021-08-01 Evidence of widespread, independent sequence signature for transcription factor cobinding Zhou, Manqi Li, Hongyang Wang, Xueqing Guan, Yuanfang Genome Res Method Transcription factors (TFs) are the vocabulary that genomes use to regulate gene expression and phenotypes. The interactions among TFs enrich this vocabulary and orchestrate diverse biological processes. Although simple models identify open chromatin and the presence of TF motifs as the two major contributors to TF binding patterns, it remains elusive what contributes to the in vivo TF cobinding landscape. In this study, we developed a machine learning algorithm to explore the contributors of the cobinding patterns. The algorithm substantially outperforms the state-of-the-field models for TF cobinding prediction. Game theory–based feature importance analysis reveals that, for most of the TF pairs we studied, independent motif sequences contribute one or more of the two TFs under investigation to their cobinding patterns. Such independent motif sequences include, but are not limited to, transcription initiation–related proteins and known TF complexes. We found the motif sequence signatures and the TFs are rarely mutual, corroborating a hierarchical and directional organization of the regulatory network and refuting the possibility of artifacts caused by shared sequence similarity with the TFs under investigation. We modeled such regulatory language with directed graphs, which reveal shared, global factors that are related to many binding and cobinding patterns. Cold Spring Harbor Laboratory Press 2021-02 /pmc/articles/PMC7849410/ /pubmed/33303494 http://dx.doi.org/10.1101/gr.267310.120 Text en © 2021 Zhou et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/. |
spellingShingle | Method Zhou, Manqi Li, Hongyang Wang, Xueqing Guan, Yuanfang Evidence of widespread, independent sequence signature for transcription factor cobinding |
title | Evidence of widespread, independent sequence signature for transcription factor cobinding |
title_full | Evidence of widespread, independent sequence signature for transcription factor cobinding |
title_fullStr | Evidence of widespread, independent sequence signature for transcription factor cobinding |
title_full_unstemmed | Evidence of widespread, independent sequence signature for transcription factor cobinding |
title_short | Evidence of widespread, independent sequence signature for transcription factor cobinding |
title_sort | evidence of widespread, independent sequence signature for transcription factor cobinding |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7849410/ https://www.ncbi.nlm.nih.gov/pubmed/33303494 http://dx.doi.org/10.1101/gr.267310.120 |
work_keys_str_mv | AT zhoumanqi evidenceofwidespreadindependentsequencesignaturefortranscriptionfactorcobinding AT lihongyang evidenceofwidespreadindependentsequencesignaturefortranscriptionfactorcobinding AT wangxueqing evidenceofwidespreadindependentsequencesignaturefortranscriptionfactorcobinding AT guanyuanfang evidenceofwidespreadindependentsequencesignaturefortranscriptionfactorcobinding |