Cargando…

Uncovering the Relationship between Tissue-Specific TF-DNA Binding and Chromatin Features through a Transformer-Based Model

Chromatin features can reveal tissue-specific TF-DNA binding, which leads to a better understanding of many critical physiological processes. Accurately identifying TF-DNA bindings and constructing their relationships with chromatin features is a long-standing goal in the bioinformatic field. Howeve...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yongqing, Liu, Yuhang, Wang, Zixuan, Wang, Maocheng, Xiong, Shuwen, Huang, Guo, Gong, Meiqin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9690320/
https://www.ncbi.nlm.nih.gov/pubmed/36360189
http://dx.doi.org/10.3390/genes13111952
_version_ 1784836757914648576
author Zhang, Yongqing
Liu, Yuhang
Wang, Zixuan
Wang, Maocheng
Xiong, Shuwen
Huang, Guo
Gong, Meiqin
author_facet Zhang, Yongqing
Liu, Yuhang
Wang, Zixuan
Wang, Maocheng
Xiong, Shuwen
Huang, Guo
Gong, Meiqin
author_sort Zhang, Yongqing
collection PubMed
description Chromatin features can reveal tissue-specific TF-DNA binding, which leads to a better understanding of many critical physiological processes. Accurately identifying TF-DNA bindings and constructing their relationships with chromatin features is a long-standing goal in the bioinformatic field. However, this has remained elusive due to the complex binding mechanisms and heterogeneity among inputs. Here, we have developed the GHTNet (General Hybrid Transformer Network), a transformer-based model to predict TF-DNA binding specificity. The GHTNet decodes the relationship between tissue-specific TF-DNA binding and chromatin features via a specific input scheme of alternative inputs and reveals important gene regions and tissue-specific motifs. Our experiments show that the GHTNet has excellent performance, achieving about a 5% absolute improvement over existing methods. The TF-DNA binding mechanism analysis shows that the importance of TF-DNA binding features varies across tissues. The best predictor is based on the DNA sequence, followed by epigenomics and shape. In addition, cross-species studies address the limited data, thus providing new ideas in this case. Moreover, the GHTNet is applied to interpret the relationship among TFs, chromatin features, and diseases associated with AD46 tissue. This paper demonstrates that the GHTNet is an accurate and robust framework for deciphering tissue-specific TF-DNA binding and interpreting non-coding regions.
format Online
Article
Text
id pubmed-9690320
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96903202022-11-25 Uncovering the Relationship between Tissue-Specific TF-DNA Binding and Chromatin Features through a Transformer-Based Model Zhang, Yongqing Liu, Yuhang Wang, Zixuan Wang, Maocheng Xiong, Shuwen Huang, Guo Gong, Meiqin Genes (Basel) Article Chromatin features can reveal tissue-specific TF-DNA binding, which leads to a better understanding of many critical physiological processes. Accurately identifying TF-DNA bindings and constructing their relationships with chromatin features is a long-standing goal in the bioinformatic field. However, this has remained elusive due to the complex binding mechanisms and heterogeneity among inputs. Here, we have developed the GHTNet (General Hybrid Transformer Network), a transformer-based model to predict TF-DNA binding specificity. The GHTNet decodes the relationship between tissue-specific TF-DNA binding and chromatin features via a specific input scheme of alternative inputs and reveals important gene regions and tissue-specific motifs. Our experiments show that the GHTNet has excellent performance, achieving about a 5% absolute improvement over existing methods. The TF-DNA binding mechanism analysis shows that the importance of TF-DNA binding features varies across tissues. The best predictor is based on the DNA sequence, followed by epigenomics and shape. In addition, cross-species studies address the limited data, thus providing new ideas in this case. Moreover, the GHTNet is applied to interpret the relationship among TFs, chromatin features, and diseases associated with AD46 tissue. This paper demonstrates that the GHTNet is an accurate and robust framework for deciphering tissue-specific TF-DNA binding and interpreting non-coding regions. MDPI 2022-10-26 /pmc/articles/PMC9690320/ /pubmed/36360189 http://dx.doi.org/10.3390/genes13111952 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhang, Yongqing
Liu, Yuhang
Wang, Zixuan
Wang, Maocheng
Xiong, Shuwen
Huang, Guo
Gong, Meiqin
Uncovering the Relationship between Tissue-Specific TF-DNA Binding and Chromatin Features through a Transformer-Based Model
title Uncovering the Relationship between Tissue-Specific TF-DNA Binding and Chromatin Features through a Transformer-Based Model
title_full Uncovering the Relationship between Tissue-Specific TF-DNA Binding and Chromatin Features through a Transformer-Based Model
title_fullStr Uncovering the Relationship between Tissue-Specific TF-DNA Binding and Chromatin Features through a Transformer-Based Model
title_full_unstemmed Uncovering the Relationship between Tissue-Specific TF-DNA Binding and Chromatin Features through a Transformer-Based Model
title_short Uncovering the Relationship between Tissue-Specific TF-DNA Binding and Chromatin Features through a Transformer-Based Model
title_sort uncovering the relationship between tissue-specific tf-dna binding and chromatin features through a transformer-based model
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9690320/
https://www.ncbi.nlm.nih.gov/pubmed/36360189
http://dx.doi.org/10.3390/genes13111952
work_keys_str_mv AT zhangyongqing uncoveringtherelationshipbetweentissuespecifictfdnabindingandchromatinfeaturesthroughatransformerbasedmodel
AT liuyuhang uncoveringtherelationshipbetweentissuespecifictfdnabindingandchromatinfeaturesthroughatransformerbasedmodel
AT wangzixuan uncoveringtherelationshipbetweentissuespecifictfdnabindingandchromatinfeaturesthroughatransformerbasedmodel
AT wangmaocheng uncoveringtherelationshipbetweentissuespecifictfdnabindingandchromatinfeaturesthroughatransformerbasedmodel
AT xiongshuwen uncoveringtherelationshipbetweentissuespecifictfdnabindingandchromatinfeaturesthroughatransformerbasedmodel
AT huangguo uncoveringtherelationshipbetweentissuespecifictfdnabindingandchromatinfeaturesthroughatransformerbasedmodel
AT gongmeiqin uncoveringtherelationshipbetweentissuespecifictfdnabindingandchromatinfeaturesthroughatransformerbasedmodel