Cargando…

Variable number tandem repeats mediate the expression of proximal genes

Variable number tandem repeats (VNTRs) account for significant genetic variation in many organisms. In humans, VNTRs have been implicated in both Mendelian and complex disorders, but are largely ignored by genomic pipelines due to the complexity of genotyping and the computational expense. We descri...

Descripción completa

Detalles Bibliográficos
Autores principales: Bakhtiari, Mehrdad, Park, Jonghun, Ding, Yuan-Chun, Shleizer-Burko, Sharona, Neuhausen, Susan L., Halldórsson, Bjarni V., Stefánsson, Kári, Gymrek, Melissa, Bafna, Vineet
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8024321/
https://www.ncbi.nlm.nih.gov/pubmed/33824302
http://dx.doi.org/10.1038/s41467-021-22206-z
_version_ 1783675289629360128
author Bakhtiari, Mehrdad
Park, Jonghun
Ding, Yuan-Chun
Shleizer-Burko, Sharona
Neuhausen, Susan L.
Halldórsson, Bjarni V.
Stefánsson, Kári
Gymrek, Melissa
Bafna, Vineet
author_facet Bakhtiari, Mehrdad
Park, Jonghun
Ding, Yuan-Chun
Shleizer-Burko, Sharona
Neuhausen, Susan L.
Halldórsson, Bjarni V.
Stefánsson, Kári
Gymrek, Melissa
Bafna, Vineet
author_sort Bakhtiari, Mehrdad
collection PubMed
description Variable number tandem repeats (VNTRs) account for significant genetic variation in many organisms. In humans, VNTRs have been implicated in both Mendelian and complex disorders, but are largely ignored by genomic pipelines due to the complexity of genotyping and the computational expense. We describe adVNTR-NN, a method that uses shallow neural networks to genotype a VNTR in 18 seconds on 55X whole genome data, while maintaining high accuracy. We use adVNTR-NN to genotype 10,264 VNTRs in 652 GTEx individuals. Associating VNTR length with gene expression in 46 tissues, we identify 163 “eVNTRs”. Of the 22 eVNTRs in blood where independent data is available, 21 (95%) are replicated in terms of significance and direction of association. 49% of the eVNTR loci show a strong and likely causal impact on the expression of genes and 80% have maximum effect size at least 0.3. The impacted genes are involved in diseases including Alzheimer’s, obesity and familial cancers, highlighting the importance of VNTRs for understanding the genetic basis of complex diseases.
format Online
Article
Text
id pubmed-8024321
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-80243212021-04-21 Variable number tandem repeats mediate the expression of proximal genes Bakhtiari, Mehrdad Park, Jonghun Ding, Yuan-Chun Shleizer-Burko, Sharona Neuhausen, Susan L. Halldórsson, Bjarni V. Stefánsson, Kári Gymrek, Melissa Bafna, Vineet Nat Commun Article Variable number tandem repeats (VNTRs) account for significant genetic variation in many organisms. In humans, VNTRs have been implicated in both Mendelian and complex disorders, but are largely ignored by genomic pipelines due to the complexity of genotyping and the computational expense. We describe adVNTR-NN, a method that uses shallow neural networks to genotype a VNTR in 18 seconds on 55X whole genome data, while maintaining high accuracy. We use adVNTR-NN to genotype 10,264 VNTRs in 652 GTEx individuals. Associating VNTR length with gene expression in 46 tissues, we identify 163 “eVNTRs”. Of the 22 eVNTRs in blood where independent data is available, 21 (95%) are replicated in terms of significance and direction of association. 49% of the eVNTR loci show a strong and likely causal impact on the expression of genes and 80% have maximum effect size at least 0.3. The impacted genes are involved in diseases including Alzheimer’s, obesity and familial cancers, highlighting the importance of VNTRs for understanding the genetic basis of complex diseases. Nature Publishing Group UK 2021-04-06 /pmc/articles/PMC8024321/ /pubmed/33824302 http://dx.doi.org/10.1038/s41467-021-22206-z Text en © The Author(s) 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Bakhtiari, Mehrdad
Park, Jonghun
Ding, Yuan-Chun
Shleizer-Burko, Sharona
Neuhausen, Susan L.
Halldórsson, Bjarni V.
Stefánsson, Kári
Gymrek, Melissa
Bafna, Vineet
Variable number tandem repeats mediate the expression of proximal genes
title Variable number tandem repeats mediate the expression of proximal genes
title_full Variable number tandem repeats mediate the expression of proximal genes
title_fullStr Variable number tandem repeats mediate the expression of proximal genes
title_full_unstemmed Variable number tandem repeats mediate the expression of proximal genes
title_short Variable number tandem repeats mediate the expression of proximal genes
title_sort variable number tandem repeats mediate the expression of proximal genes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8024321/
https://www.ncbi.nlm.nih.gov/pubmed/33824302
http://dx.doi.org/10.1038/s41467-021-22206-z
work_keys_str_mv AT bakhtiarimehrdad variablenumbertandemrepeatsmediatetheexpressionofproximalgenes
AT parkjonghun variablenumbertandemrepeatsmediatetheexpressionofproximalgenes
AT dingyuanchun variablenumbertandemrepeatsmediatetheexpressionofproximalgenes
AT shleizerburkosharona variablenumbertandemrepeatsmediatetheexpressionofproximalgenes
AT neuhausensusanl variablenumbertandemrepeatsmediatetheexpressionofproximalgenes
AT halldorssonbjarniv variablenumbertandemrepeatsmediatetheexpressionofproximalgenes
AT stefanssonkari variablenumbertandemrepeatsmediatetheexpressionofproximalgenes
AT gymrekmelissa variablenumbertandemrepeatsmediatetheexpressionofproximalgenes
AT bafnavineet variablenumbertandemrepeatsmediatetheexpressionofproximalgenes