Cargando…

Evidence of widespread, independent sequence signature for transcription factor cobinding

Transcription factors (TFs) are the vocabulary that genomes use to regulate gene expression and phenotypes. The interactions among TFs enrich this vocabulary and orchestrate diverse biological processes. Although simple models identify open chromatin and the presence of TF motifs as the two major co...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Manqi, Li, Hongyang, Wang, Xueqing, Guan, Yuanfang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7849410/
https://www.ncbi.nlm.nih.gov/pubmed/33303494
http://dx.doi.org/10.1101/gr.267310.120
_version_ 1783645294938816512
author Zhou, Manqi
Li, Hongyang
Wang, Xueqing
Guan, Yuanfang
author_facet Zhou, Manqi
Li, Hongyang
Wang, Xueqing
Guan, Yuanfang
author_sort Zhou, Manqi
collection PubMed
description Transcription factors (TFs) are the vocabulary that genomes use to regulate gene expression and phenotypes. The interactions among TFs enrich this vocabulary and orchestrate diverse biological processes. Although simple models identify open chromatin and the presence of TF motifs as the two major contributors to TF binding patterns, it remains elusive what contributes to the in vivo TF cobinding landscape. In this study, we developed a machine learning algorithm to explore the contributors of the cobinding patterns. The algorithm substantially outperforms the state-of-the-field models for TF cobinding prediction. Game theory–based feature importance analysis reveals that, for most of the TF pairs we studied, independent motif sequences contribute one or more of the two TFs under investigation to their cobinding patterns. Such independent motif sequences include, but are not limited to, transcription initiation–related proteins and known TF complexes. We found the motif sequence signatures and the TFs are rarely mutual, corroborating a hierarchical and directional organization of the regulatory network and refuting the possibility of artifacts caused by shared sequence similarity with the TFs under investigation. We modeled such regulatory language with directed graphs, which reveal shared, global factors that are related to many binding and cobinding patterns.
format Online
Article
Text
id pubmed-7849410
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-78494102021-08-01 Evidence of widespread, independent sequence signature for transcription factor cobinding Zhou, Manqi Li, Hongyang Wang, Xueqing Guan, Yuanfang Genome Res Method Transcription factors (TFs) are the vocabulary that genomes use to regulate gene expression and phenotypes. The interactions among TFs enrich this vocabulary and orchestrate diverse biological processes. Although simple models identify open chromatin and the presence of TF motifs as the two major contributors to TF binding patterns, it remains elusive what contributes to the in vivo TF cobinding landscape. In this study, we developed a machine learning algorithm to explore the contributors of the cobinding patterns. The algorithm substantially outperforms the state-of-the-field models for TF cobinding prediction. Game theory–based feature importance analysis reveals that, for most of the TF pairs we studied, independent motif sequences contribute one or more of the two TFs under investigation to their cobinding patterns. Such independent motif sequences include, but are not limited to, transcription initiation–related proteins and known TF complexes. We found the motif sequence signatures and the TFs are rarely mutual, corroborating a hierarchical and directional organization of the regulatory network and refuting the possibility of artifacts caused by shared sequence similarity with the TFs under investigation. We modeled such regulatory language with directed graphs, which reveal shared, global factors that are related to many binding and cobinding patterns. Cold Spring Harbor Laboratory Press 2021-02 /pmc/articles/PMC7849410/ /pubmed/33303494 http://dx.doi.org/10.1101/gr.267310.120 Text en © 2021 Zhou et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Method
Zhou, Manqi
Li, Hongyang
Wang, Xueqing
Guan, Yuanfang
Evidence of widespread, independent sequence signature for transcription factor cobinding
title Evidence of widespread, independent sequence signature for transcription factor cobinding
title_full Evidence of widespread, independent sequence signature for transcription factor cobinding
title_fullStr Evidence of widespread, independent sequence signature for transcription factor cobinding
title_full_unstemmed Evidence of widespread, independent sequence signature for transcription factor cobinding
title_short Evidence of widespread, independent sequence signature for transcription factor cobinding
title_sort evidence of widespread, independent sequence signature for transcription factor cobinding
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7849410/
https://www.ncbi.nlm.nih.gov/pubmed/33303494
http://dx.doi.org/10.1101/gr.267310.120
work_keys_str_mv AT zhoumanqi evidenceofwidespreadindependentsequencesignaturefortranscriptionfactorcobinding
AT lihongyang evidenceofwidespreadindependentsequencesignaturefortranscriptionfactorcobinding
AT wangxueqing evidenceofwidespreadindependentsequencesignaturefortranscriptionfactorcobinding
AT guanyuanfang evidenceofwidespreadindependentsequencesignaturefortranscriptionfactorcobinding