Cargando…

The Genomic landscape of short tandem repeats across multiple ancestries

Short Tandem Repeats (STRs) have been found to play a role in a myriad of complex traits and genetic diseases. We examined the variability in the lengths of over 850,000 STR loci in 996 children with suspected genetic disorders and 1,178 parents across six separate ancestral groups: Africans, Europe...

Descripción completa

Detalles Bibliográficos
Autores principales: Vijayaraghavan, Prashanth, Batalov, Sergey, Ding, Yan, Sanford, Erica, Kingsmore, Stephen F., Dimmock, David, Hobbs, Charlotte, Bainbridge, Matthew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9879404/
https://www.ncbi.nlm.nih.gov/pubmed/36701310
http://dx.doi.org/10.1371/journal.pone.0279430
_version_ 1784878679892951040
author Vijayaraghavan, Prashanth
Batalov, Sergey
Ding, Yan
Sanford, Erica
Kingsmore, Stephen F.
Dimmock, David
Hobbs, Charlotte
Bainbridge, Matthew
author_facet Vijayaraghavan, Prashanth
Batalov, Sergey
Ding, Yan
Sanford, Erica
Kingsmore, Stephen F.
Dimmock, David
Hobbs, Charlotte
Bainbridge, Matthew
author_sort Vijayaraghavan, Prashanth
collection PubMed
description Short Tandem Repeats (STRs) have been found to play a role in a myriad of complex traits and genetic diseases. We examined the variability in the lengths of over 850,000 STR loci in 996 children with suspected genetic disorders and 1,178 parents across six separate ancestral groups: Africans, Europeans, East Asians, Admixed Americans, Non-admixed Americans, and Pacific Islanders. For each STR locus we compared allele length between and within each ancestry group. In relation to Europeans, admixed Americans had the most similar STR lengths with only 623 positions either significantly expanded or contracted, while the divergence was highest in Africans, with 4,933 chromosomal positions contracted or expanded. We also examined probands to identify STR expansions at known pathogenic loci. The genes TCF4, AR, and DMPK showed significant expansions with lengths 250% greater than their various average allele lengths in 49, 162, and 11 individuals respectively. All 49 individuals containing an expansion in TCF4 and six individuals containing an expansion in DMPK presented with allele lengths longer than the known pathogenic length for these genes. Next, we identified individuals with significant expansions in highly conserved loci across all ancestries. Eighty loci in conserved regions met criteria for divergence. Two of these individuals were found to have exonic STR expansions: one in ZBTB4 and the other in SLC9A7, which is associated with X-linked mental retardation. Finally, we used parent-child trios to detect and analyze de novo mutations. In total, we observed 3,219 de novo expansions, where proband allele lengths are greater than twice the longest parental allele length. This work helps lay the foundation for understanding STR lengths genome-wide across ancestries and may help identify new disease genes and novel mechanisms of pathogenicity in known disease genes.
format Online
Article
Text
id pubmed-9879404
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-98794042023-01-27 The Genomic landscape of short tandem repeats across multiple ancestries Vijayaraghavan, Prashanth Batalov, Sergey Ding, Yan Sanford, Erica Kingsmore, Stephen F. Dimmock, David Hobbs, Charlotte Bainbridge, Matthew PLoS One Research Article Short Tandem Repeats (STRs) have been found to play a role in a myriad of complex traits and genetic diseases. We examined the variability in the lengths of over 850,000 STR loci in 996 children with suspected genetic disorders and 1,178 parents across six separate ancestral groups: Africans, Europeans, East Asians, Admixed Americans, Non-admixed Americans, and Pacific Islanders. For each STR locus we compared allele length between and within each ancestry group. In relation to Europeans, admixed Americans had the most similar STR lengths with only 623 positions either significantly expanded or contracted, while the divergence was highest in Africans, with 4,933 chromosomal positions contracted or expanded. We also examined probands to identify STR expansions at known pathogenic loci. The genes TCF4, AR, and DMPK showed significant expansions with lengths 250% greater than their various average allele lengths in 49, 162, and 11 individuals respectively. All 49 individuals containing an expansion in TCF4 and six individuals containing an expansion in DMPK presented with allele lengths longer than the known pathogenic length for these genes. Next, we identified individuals with significant expansions in highly conserved loci across all ancestries. Eighty loci in conserved regions met criteria for divergence. Two of these individuals were found to have exonic STR expansions: one in ZBTB4 and the other in SLC9A7, which is associated with X-linked mental retardation. Finally, we used parent-child trios to detect and analyze de novo mutations. In total, we observed 3,219 de novo expansions, where proband allele lengths are greater than twice the longest parental allele length. This work helps lay the foundation for understanding STR lengths genome-wide across ancestries and may help identify new disease genes and novel mechanisms of pathogenicity in known disease genes. Public Library of Science 2023-01-26 /pmc/articles/PMC9879404/ /pubmed/36701310 http://dx.doi.org/10.1371/journal.pone.0279430 Text en © 2023 Vijayaraghavan et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Vijayaraghavan, Prashanth
Batalov, Sergey
Ding, Yan
Sanford, Erica
Kingsmore, Stephen F.
Dimmock, David
Hobbs, Charlotte
Bainbridge, Matthew
The Genomic landscape of short tandem repeats across multiple ancestries
title The Genomic landscape of short tandem repeats across multiple ancestries
title_full The Genomic landscape of short tandem repeats across multiple ancestries
title_fullStr The Genomic landscape of short tandem repeats across multiple ancestries
title_full_unstemmed The Genomic landscape of short tandem repeats across multiple ancestries
title_short The Genomic landscape of short tandem repeats across multiple ancestries
title_sort genomic landscape of short tandem repeats across multiple ancestries
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9879404/
https://www.ncbi.nlm.nih.gov/pubmed/36701310
http://dx.doi.org/10.1371/journal.pone.0279430
work_keys_str_mv AT vijayaraghavanprashanth thegenomiclandscapeofshorttandemrepeatsacrossmultipleancestries
AT batalovsergey thegenomiclandscapeofshorttandemrepeatsacrossmultipleancestries
AT dingyan thegenomiclandscapeofshorttandemrepeatsacrossmultipleancestries
AT sanforderica thegenomiclandscapeofshorttandemrepeatsacrossmultipleancestries
AT kingsmorestephenf thegenomiclandscapeofshorttandemrepeatsacrossmultipleancestries
AT dimmockdavid thegenomiclandscapeofshorttandemrepeatsacrossmultipleancestries
AT hobbscharlotte thegenomiclandscapeofshorttandemrepeatsacrossmultipleancestries
AT bainbridgematthew thegenomiclandscapeofshorttandemrepeatsacrossmultipleancestries
AT vijayaraghavanprashanth genomiclandscapeofshorttandemrepeatsacrossmultipleancestries
AT batalovsergey genomiclandscapeofshorttandemrepeatsacrossmultipleancestries
AT dingyan genomiclandscapeofshorttandemrepeatsacrossmultipleancestries
AT sanforderica genomiclandscapeofshorttandemrepeatsacrossmultipleancestries
AT kingsmorestephenf genomiclandscapeofshorttandemrepeatsacrossmultipleancestries
AT dimmockdavid genomiclandscapeofshorttandemrepeatsacrossmultipleancestries
AT hobbscharlotte genomiclandscapeofshorttandemrepeatsacrossmultipleancestries
AT bainbridgematthew genomiclandscapeofshorttandemrepeatsacrossmultipleancestries