Cargando…

Systematic Evaluation of DNA Sequence Variations on in vivo Transcription Factor Binding Affinity

The majority of the single nucleotide variants (SNVs) identified by genome-wide association studies (GWAS) fall outside of the protein-coding regions. Elucidating the functional implications of these variants has been a major challenge. A possible mechanism for functional non-coding variants is that...

Descripción completa

Detalles Bibliográficos
Autores principales: Jin, Yutong, Jiang, Jiahui, Wang, Ruixuan, Qin, Zhaohui S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8458901/
https://www.ncbi.nlm.nih.gov/pubmed/34567058
http://dx.doi.org/10.3389/fgene.2021.667866
_version_ 1784571401810739200
author Jin, Yutong
Jiang, Jiahui
Wang, Ruixuan
Qin, Zhaohui S.
author_facet Jin, Yutong
Jiang, Jiahui
Wang, Ruixuan
Qin, Zhaohui S.
author_sort Jin, Yutong
collection PubMed
description The majority of the single nucleotide variants (SNVs) identified by genome-wide association studies (GWAS) fall outside of the protein-coding regions. Elucidating the functional implications of these variants has been a major challenge. A possible mechanism for functional non-coding variants is that they disrupted the canonical transcription factor (TF) binding sites that affect the in vivo binding of the TF. However, their impact varies since many positions within a TF binding motif are not well conserved. Therefore, simply annotating all variants located in putative TF binding sites may overestimate the functional impact of these SNVs. We conducted a comprehensive survey to study the effect of SNVs on the TF binding affinity. A sequence-based machine learning method was used to estimate the change in binding affinity for each SNV located inside a putative motif site. From the results obtained on 18 TF binding motifs, we found that there is a substantial variation in terms of a SNV’s impact on TF binding affinity. We found that only about 20% of SNVs located inside putative TF binding sites would likely to have significant impact on the TF-DNA binding.
format Online
Article
Text
id pubmed-8458901
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-84589012021-09-24 Systematic Evaluation of DNA Sequence Variations on in vivo Transcription Factor Binding Affinity Jin, Yutong Jiang, Jiahui Wang, Ruixuan Qin, Zhaohui S. Front Genet Genetics The majority of the single nucleotide variants (SNVs) identified by genome-wide association studies (GWAS) fall outside of the protein-coding regions. Elucidating the functional implications of these variants has been a major challenge. A possible mechanism for functional non-coding variants is that they disrupted the canonical transcription factor (TF) binding sites that affect the in vivo binding of the TF. However, their impact varies since many positions within a TF binding motif are not well conserved. Therefore, simply annotating all variants located in putative TF binding sites may overestimate the functional impact of these SNVs. We conducted a comprehensive survey to study the effect of SNVs on the TF binding affinity. A sequence-based machine learning method was used to estimate the change in binding affinity for each SNV located inside a putative motif site. From the results obtained on 18 TF binding motifs, we found that there is a substantial variation in terms of a SNV’s impact on TF binding affinity. We found that only about 20% of SNVs located inside putative TF binding sites would likely to have significant impact on the TF-DNA binding. Frontiers Media S.A. 2021-09-09 /pmc/articles/PMC8458901/ /pubmed/34567058 http://dx.doi.org/10.3389/fgene.2021.667866 Text en Copyright © 2021 Jin, Jiang, Wang and Qin. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Jin, Yutong
Jiang, Jiahui
Wang, Ruixuan
Qin, Zhaohui S.
Systematic Evaluation of DNA Sequence Variations on in vivo Transcription Factor Binding Affinity
title Systematic Evaluation of DNA Sequence Variations on in vivo Transcription Factor Binding Affinity
title_full Systematic Evaluation of DNA Sequence Variations on in vivo Transcription Factor Binding Affinity
title_fullStr Systematic Evaluation of DNA Sequence Variations on in vivo Transcription Factor Binding Affinity
title_full_unstemmed Systematic Evaluation of DNA Sequence Variations on in vivo Transcription Factor Binding Affinity
title_short Systematic Evaluation of DNA Sequence Variations on in vivo Transcription Factor Binding Affinity
title_sort systematic evaluation of dna sequence variations on in vivo transcription factor binding affinity
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8458901/
https://www.ncbi.nlm.nih.gov/pubmed/34567058
http://dx.doi.org/10.3389/fgene.2021.667866
work_keys_str_mv AT jinyutong systematicevaluationofdnasequencevariationsoninvivotranscriptionfactorbindingaffinity
AT jiangjiahui systematicevaluationofdnasequencevariationsoninvivotranscriptionfactorbindingaffinity
AT wangruixuan systematicevaluationofdnasequencevariationsoninvivotranscriptionfactorbindingaffinity
AT qinzhaohuis systematicevaluationofdnasequencevariationsoninvivotranscriptionfactorbindingaffinity