Cargando…
Systematic Evaluation of DNA Sequence Variations on in vivo Transcription Factor Binding Affinity
The majority of the single nucleotide variants (SNVs) identified by genome-wide association studies (GWAS) fall outside of the protein-coding regions. Elucidating the functional implications of these variants has been a major challenge. A possible mechanism for functional non-coding variants is that...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8458901/ https://www.ncbi.nlm.nih.gov/pubmed/34567058 http://dx.doi.org/10.3389/fgene.2021.667866 |
_version_ | 1784571401810739200 |
---|---|
author | Jin, Yutong Jiang, Jiahui Wang, Ruixuan Qin, Zhaohui S. |
author_facet | Jin, Yutong Jiang, Jiahui Wang, Ruixuan Qin, Zhaohui S. |
author_sort | Jin, Yutong |
collection | PubMed |
description | The majority of the single nucleotide variants (SNVs) identified by genome-wide association studies (GWAS) fall outside of the protein-coding regions. Elucidating the functional implications of these variants has been a major challenge. A possible mechanism for functional non-coding variants is that they disrupted the canonical transcription factor (TF) binding sites that affect the in vivo binding of the TF. However, their impact varies since many positions within a TF binding motif are not well conserved. Therefore, simply annotating all variants located in putative TF binding sites may overestimate the functional impact of these SNVs. We conducted a comprehensive survey to study the effect of SNVs on the TF binding affinity. A sequence-based machine learning method was used to estimate the change in binding affinity for each SNV located inside a putative motif site. From the results obtained on 18 TF binding motifs, we found that there is a substantial variation in terms of a SNV’s impact on TF binding affinity. We found that only about 20% of SNVs located inside putative TF binding sites would likely to have significant impact on the TF-DNA binding. |
format | Online Article Text |
id | pubmed-8458901 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-84589012021-09-24 Systematic Evaluation of DNA Sequence Variations on in vivo Transcription Factor Binding Affinity Jin, Yutong Jiang, Jiahui Wang, Ruixuan Qin, Zhaohui S. Front Genet Genetics The majority of the single nucleotide variants (SNVs) identified by genome-wide association studies (GWAS) fall outside of the protein-coding regions. Elucidating the functional implications of these variants has been a major challenge. A possible mechanism for functional non-coding variants is that they disrupted the canonical transcription factor (TF) binding sites that affect the in vivo binding of the TF. However, their impact varies since many positions within a TF binding motif are not well conserved. Therefore, simply annotating all variants located in putative TF binding sites may overestimate the functional impact of these SNVs. We conducted a comprehensive survey to study the effect of SNVs on the TF binding affinity. A sequence-based machine learning method was used to estimate the change in binding affinity for each SNV located inside a putative motif site. From the results obtained on 18 TF binding motifs, we found that there is a substantial variation in terms of a SNV’s impact on TF binding affinity. We found that only about 20% of SNVs located inside putative TF binding sites would likely to have significant impact on the TF-DNA binding. Frontiers Media S.A. 2021-09-09 /pmc/articles/PMC8458901/ /pubmed/34567058 http://dx.doi.org/10.3389/fgene.2021.667866 Text en Copyright © 2021 Jin, Jiang, Wang and Qin. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Jin, Yutong Jiang, Jiahui Wang, Ruixuan Qin, Zhaohui S. Systematic Evaluation of DNA Sequence Variations on in vivo Transcription Factor Binding Affinity |
title | Systematic Evaluation of DNA Sequence Variations on in vivo Transcription Factor Binding Affinity |
title_full | Systematic Evaluation of DNA Sequence Variations on in vivo Transcription Factor Binding Affinity |
title_fullStr | Systematic Evaluation of DNA Sequence Variations on in vivo Transcription Factor Binding Affinity |
title_full_unstemmed | Systematic Evaluation of DNA Sequence Variations on in vivo Transcription Factor Binding Affinity |
title_short | Systematic Evaluation of DNA Sequence Variations on in vivo Transcription Factor Binding Affinity |
title_sort | systematic evaluation of dna sequence variations on in vivo transcription factor binding affinity |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8458901/ https://www.ncbi.nlm.nih.gov/pubmed/34567058 http://dx.doi.org/10.3389/fgene.2021.667866 |
work_keys_str_mv | AT jinyutong systematicevaluationofdnasequencevariationsoninvivotranscriptionfactorbindingaffinity AT jiangjiahui systematicevaluationofdnasequencevariationsoninvivotranscriptionfactorbindingaffinity AT wangruixuan systematicevaluationofdnasequencevariationsoninvivotranscriptionfactorbindingaffinity AT qinzhaohuis systematicevaluationofdnasequencevariationsoninvivotranscriptionfactorbindingaffinity |