Cargando…

Characterizing nucleotide variation and expansion dynamics in human-specific variable number tandem repeats

There are more than 55,000 variable number tandem repeats (VNTRs) in the human genome, notable for both their striking polymorphism and mutability. Despite their role in human evolution and genomic variation, they have yet to be studied collectively and in detail, partially owing to their large size...

Descripción completa

Detalles Bibliográficos
Autores principales: Course, Meredith M., Sulovari, Arvis, Gudsnuk, Kathryn, Eichler, Evan E., Valdmanis, Paul N.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8327921/
https://www.ncbi.nlm.nih.gov/pubmed/34244228
http://dx.doi.org/10.1101/gr.275560.121
_version_ 1783732201607659520
author Course, Meredith M.
Sulovari, Arvis
Gudsnuk, Kathryn
Eichler, Evan E.
Valdmanis, Paul N.
author_facet Course, Meredith M.
Sulovari, Arvis
Gudsnuk, Kathryn
Eichler, Evan E.
Valdmanis, Paul N.
author_sort Course, Meredith M.
collection PubMed
description There are more than 55,000 variable number tandem repeats (VNTRs) in the human genome, notable for both their striking polymorphism and mutability. Despite their role in human evolution and genomic variation, they have yet to be studied collectively and in detail, partially owing to their large size, variability, and predominant location in noncoding regions. Here, we examine 467 VNTRs that are human-specific expansions, unique to one location in the genome, and not associated with retrotransposons. We leverage publicly available long-read genomes, including from the Human Genome Structural Variant Consortium, to ascertain the exact nucleotide composition of these VNTRs and compare their composition of alleles. We then confirm repeat unit composition in more than 3000 short-read samples from the 1000 Genomes Project. Our analysis reveals that these VNTRs contain highly structured repeat motif organization, modified by frequent deletion and duplication events. Although overall VNTR compositions tend to remain similar between 1000 Genomes Project superpopulations, we describe a notable exception with substantial differences in repeat composition (in PCBP3), as well as several VNTRs that are significantly different in length between superpopulations (in ART1, PROP1, DYNC2I1, and LOC102723906). We also observe that most of these VNTRs are expanded in archaic human genomes, yet remain stable in length between single generations. Collectively, our findings indicate that repeat motif variability, repeat composition, and repeat length are all informative modalities to consider when characterizing VNTRs and their contribution to genomic variation.
format Online
Article
Text
id pubmed-8327921
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-83279212022-02-01 Characterizing nucleotide variation and expansion dynamics in human-specific variable number tandem repeats Course, Meredith M. Sulovari, Arvis Gudsnuk, Kathryn Eichler, Evan E. Valdmanis, Paul N. Genome Res Research There are more than 55,000 variable number tandem repeats (VNTRs) in the human genome, notable for both their striking polymorphism and mutability. Despite their role in human evolution and genomic variation, they have yet to be studied collectively and in detail, partially owing to their large size, variability, and predominant location in noncoding regions. Here, we examine 467 VNTRs that are human-specific expansions, unique to one location in the genome, and not associated with retrotransposons. We leverage publicly available long-read genomes, including from the Human Genome Structural Variant Consortium, to ascertain the exact nucleotide composition of these VNTRs and compare their composition of alleles. We then confirm repeat unit composition in more than 3000 short-read samples from the 1000 Genomes Project. Our analysis reveals that these VNTRs contain highly structured repeat motif organization, modified by frequent deletion and duplication events. Although overall VNTR compositions tend to remain similar between 1000 Genomes Project superpopulations, we describe a notable exception with substantial differences in repeat composition (in PCBP3), as well as several VNTRs that are significantly different in length between superpopulations (in ART1, PROP1, DYNC2I1, and LOC102723906). We also observe that most of these VNTRs are expanded in archaic human genomes, yet remain stable in length between single generations. Collectively, our findings indicate that repeat motif variability, repeat composition, and repeat length are all informative modalities to consider when characterizing VNTRs and their contribution to genomic variation. Cold Spring Harbor Laboratory Press 2021-08 /pmc/articles/PMC8327921/ /pubmed/34244228 http://dx.doi.org/10.1101/gr.275560.121 Text en © 2021 Course et al.; Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by-nc/4.0/This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Research
Course, Meredith M.
Sulovari, Arvis
Gudsnuk, Kathryn
Eichler, Evan E.
Valdmanis, Paul N.
Characterizing nucleotide variation and expansion dynamics in human-specific variable number tandem repeats
title Characterizing nucleotide variation and expansion dynamics in human-specific variable number tandem repeats
title_full Characterizing nucleotide variation and expansion dynamics in human-specific variable number tandem repeats
title_fullStr Characterizing nucleotide variation and expansion dynamics in human-specific variable number tandem repeats
title_full_unstemmed Characterizing nucleotide variation and expansion dynamics in human-specific variable number tandem repeats
title_short Characterizing nucleotide variation and expansion dynamics in human-specific variable number tandem repeats
title_sort characterizing nucleotide variation and expansion dynamics in human-specific variable number tandem repeats
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8327921/
https://www.ncbi.nlm.nih.gov/pubmed/34244228
http://dx.doi.org/10.1101/gr.275560.121
work_keys_str_mv AT coursemeredithm characterizingnucleotidevariationandexpansiondynamicsinhumanspecificvariablenumbertandemrepeats
AT sulovariarvis characterizingnucleotidevariationandexpansiondynamicsinhumanspecificvariablenumbertandemrepeats
AT gudsnukkathryn characterizingnucleotidevariationandexpansiondynamicsinhumanspecificvariablenumbertandemrepeats
AT eichlerevane characterizingnucleotidevariationandexpansiondynamicsinhumanspecificvariablenumbertandemrepeats
AT valdmanispauln characterizingnucleotidevariationandexpansiondynamicsinhumanspecificvariablenumbertandemrepeats