Cargando…
The mutational dynamics of short tandem repeats in large, multigenerational families
BACKGROUND: Short tandem repeats (STRs) compose approximately 3% of the genome, and mutations at STR loci have been linked to dozens of human diseases including amyotrophic lateral sclerosis, Friedreich ataxia, Huntington disease, and fragile X syndrome. Improving our understanding of these mutation...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9743774/ https://www.ncbi.nlm.nih.gov/pubmed/36510265 http://dx.doi.org/10.1186/s13059-022-02818-4 |
_version_ | 1784848796029550592 |
---|---|
author | Steely, Cody J. Watkins, W. Scott Baird, Lisa Jorde, Lynn B. |
author_facet | Steely, Cody J. Watkins, W. Scott Baird, Lisa Jorde, Lynn B. |
author_sort | Steely, Cody J. |
collection | PubMed |
description | BACKGROUND: Short tandem repeats (STRs) compose approximately 3% of the genome, and mutations at STR loci have been linked to dozens of human diseases including amyotrophic lateral sclerosis, Friedreich ataxia, Huntington disease, and fragile X syndrome. Improving our understanding of these mutations would increase our knowledge of the mutational dynamics of the genome and may uncover additional loci that contribute to disease. To estimate the genome-wide pattern of mutations at STR loci, we analyze blood-derived whole-genome sequencing data for 544 individuals from 29 three-generation CEPH pedigrees. These pedigrees contain both sets of grandparents, the parents, and an average of 9 grandchildren per family. RESULTS: We use HipSTR to identify de novo STR mutations in the 2nd generation of these pedigrees and require transmission to the third generation for validation. Analyzing approximately 1.6 million STR loci, we estimate the empirical de novo STR mutation rate to be 5.24 × 10(−5) mutations per locus per generation. Perfect repeats mutate about 2 × more often than imperfect repeats. De novo STRs are significantly enriched in Alu elements. CONCLUSIONS: Approximately 30% of new STR mutations occur within Alu elements, which compose only 11% of the genome, but only 10% are found in LINE-1 insertions, which compose 17% of the genome. Phasing these mutations to the parent of origin shows that parental transmission biases vary among families. We estimate the average number of de novo genome-wide STR mutations per individual to be approximately 85, which is similar to the average number of observed de novo single nucleotide variants. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02818-4. |
format | Online Article Text |
id | pubmed-9743774 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-97437742022-12-13 The mutational dynamics of short tandem repeats in large, multigenerational families Steely, Cody J. Watkins, W. Scott Baird, Lisa Jorde, Lynn B. Genome Biol Research BACKGROUND: Short tandem repeats (STRs) compose approximately 3% of the genome, and mutations at STR loci have been linked to dozens of human diseases including amyotrophic lateral sclerosis, Friedreich ataxia, Huntington disease, and fragile X syndrome. Improving our understanding of these mutations would increase our knowledge of the mutational dynamics of the genome and may uncover additional loci that contribute to disease. To estimate the genome-wide pattern of mutations at STR loci, we analyze blood-derived whole-genome sequencing data for 544 individuals from 29 three-generation CEPH pedigrees. These pedigrees contain both sets of grandparents, the parents, and an average of 9 grandchildren per family. RESULTS: We use HipSTR to identify de novo STR mutations in the 2nd generation of these pedigrees and require transmission to the third generation for validation. Analyzing approximately 1.6 million STR loci, we estimate the empirical de novo STR mutation rate to be 5.24 × 10(−5) mutations per locus per generation. Perfect repeats mutate about 2 × more often than imperfect repeats. De novo STRs are significantly enriched in Alu elements. CONCLUSIONS: Approximately 30% of new STR mutations occur within Alu elements, which compose only 11% of the genome, but only 10% are found in LINE-1 insertions, which compose 17% of the genome. Phasing these mutations to the parent of origin shows that parental transmission biases vary among families. We estimate the average number of de novo genome-wide STR mutations per individual to be approximately 85, which is similar to the average number of observed de novo single nucleotide variants. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02818-4. BioMed Central 2022-12-12 /pmc/articles/PMC9743774/ /pubmed/36510265 http://dx.doi.org/10.1186/s13059-022-02818-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Steely, Cody J. Watkins, W. Scott Baird, Lisa Jorde, Lynn B. The mutational dynamics of short tandem repeats in large, multigenerational families |
title | The mutational dynamics of short tandem repeats in large, multigenerational families |
title_full | The mutational dynamics of short tandem repeats in large, multigenerational families |
title_fullStr | The mutational dynamics of short tandem repeats in large, multigenerational families |
title_full_unstemmed | The mutational dynamics of short tandem repeats in large, multigenerational families |
title_short | The mutational dynamics of short tandem repeats in large, multigenerational families |
title_sort | mutational dynamics of short tandem repeats in large, multigenerational families |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9743774/ https://www.ncbi.nlm.nih.gov/pubmed/36510265 http://dx.doi.org/10.1186/s13059-022-02818-4 |
work_keys_str_mv | AT steelycodyj themutationaldynamicsofshorttandemrepeatsinlargemultigenerationalfamilies AT watkinswscott themutationaldynamicsofshorttandemrepeatsinlargemultigenerationalfamilies AT bairdlisa themutationaldynamicsofshorttandemrepeatsinlargemultigenerationalfamilies AT jordelynnb themutationaldynamicsofshorttandemrepeatsinlargemultigenerationalfamilies AT steelycodyj mutationaldynamicsofshorttandemrepeatsinlargemultigenerationalfamilies AT watkinswscott mutationaldynamicsofshorttandemrepeatsinlargemultigenerationalfamilies AT bairdlisa mutationaldynamicsofshorttandemrepeatsinlargemultigenerationalfamilies AT jordelynnb mutationaldynamicsofshorttandemrepeatsinlargemultigenerationalfamilies |