Cargando…

A landscape of complex tandem repeats within individual human genomes

Markedly expanded tandem repeats (TRs) have been correlated with ~60 diseases. TR diversity has been considered a clue toward understanding missing heritability. However, haplotype-resolved long TRs remain mostly hidden or blacked out because their complex structures (TRs composed of various units a...

Descripción completa

Detalles Bibliográficos
Autores principales: Ichikawa, Kazuki, Kawahara, Riki, Asano, Takeshi, Morishita, Shinichi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10502081/
https://www.ncbi.nlm.nih.gov/pubmed/37709751
http://dx.doi.org/10.1038/s41467-023-41262-1
_version_ 1785106242536996864
author Ichikawa, Kazuki
Kawahara, Riki
Asano, Takeshi
Morishita, Shinichi
author_facet Ichikawa, Kazuki
Kawahara, Riki
Asano, Takeshi
Morishita, Shinichi
author_sort Ichikawa, Kazuki
collection PubMed
description Markedly expanded tandem repeats (TRs) have been correlated with ~60 diseases. TR diversity has been considered a clue toward understanding missing heritability. However, haplotype-resolved long TRs remain mostly hidden or blacked out because their complex structures (TRs composed of various units and minisatellites containing >10-bp units) make them difficult to determine accurately with existing methods. Here, using a high-precision algorithm to determine complex TR structures from long, accurate reads of PacBio HiFi, an investigation of 270 Japanese control samples yields several genome-wide findings. Approximately 322,000 TRs are difficult to impute from the surrounding single-nucleotide variants. Greater genetic divergence of TR loci is significantly correlated with more events of younger replication slippage. Complex TRs are more abundant than single-unit TRs, and a tendency for complex TRs to consist of <10-bp units and single-unit TRs to be minisatellites is statistically significant at loci with ≥500-bp TRs. Of note, 8909 loci with extended TRs (>100b longer than the mode) contain several known disease-associated TRs and are considered candidates for association with disorders. Overall, complex TRs and minisatellites are found to be abundant and diverse, even in genetically small Japanese populations, yielding insights into the landscape of long TRs.
format Online
Article
Text
id pubmed-10502081
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-105020812023-09-16 A landscape of complex tandem repeats within individual human genomes Ichikawa, Kazuki Kawahara, Riki Asano, Takeshi Morishita, Shinichi Nat Commun Article Markedly expanded tandem repeats (TRs) have been correlated with ~60 diseases. TR diversity has been considered a clue toward understanding missing heritability. However, haplotype-resolved long TRs remain mostly hidden or blacked out because their complex structures (TRs composed of various units and minisatellites containing >10-bp units) make them difficult to determine accurately with existing methods. Here, using a high-precision algorithm to determine complex TR structures from long, accurate reads of PacBio HiFi, an investigation of 270 Japanese control samples yields several genome-wide findings. Approximately 322,000 TRs are difficult to impute from the surrounding single-nucleotide variants. Greater genetic divergence of TR loci is significantly correlated with more events of younger replication slippage. Complex TRs are more abundant than single-unit TRs, and a tendency for complex TRs to consist of <10-bp units and single-unit TRs to be minisatellites is statistically significant at loci with ≥500-bp TRs. Of note, 8909 loci with extended TRs (>100b longer than the mode) contain several known disease-associated TRs and are considered candidates for association with disorders. Overall, complex TRs and minisatellites are found to be abundant and diverse, even in genetically small Japanese populations, yielding insights into the landscape of long TRs. Nature Publishing Group UK 2023-09-14 /pmc/articles/PMC10502081/ /pubmed/37709751 http://dx.doi.org/10.1038/s41467-023-41262-1 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Ichikawa, Kazuki
Kawahara, Riki
Asano, Takeshi
Morishita, Shinichi
A landscape of complex tandem repeats within individual human genomes
title A landscape of complex tandem repeats within individual human genomes
title_full A landscape of complex tandem repeats within individual human genomes
title_fullStr A landscape of complex tandem repeats within individual human genomes
title_full_unstemmed A landscape of complex tandem repeats within individual human genomes
title_short A landscape of complex tandem repeats within individual human genomes
title_sort landscape of complex tandem repeats within individual human genomes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10502081/
https://www.ncbi.nlm.nih.gov/pubmed/37709751
http://dx.doi.org/10.1038/s41467-023-41262-1
work_keys_str_mv AT ichikawakazuki alandscapeofcomplextandemrepeatswithinindividualhumangenomes
AT kawaharariki alandscapeofcomplextandemrepeatswithinindividualhumangenomes
AT asanotakeshi alandscapeofcomplextandemrepeatswithinindividualhumangenomes
AT morishitashinichi alandscapeofcomplextandemrepeatswithinindividualhumangenomes
AT ichikawakazuki landscapeofcomplextandemrepeatswithinindividualhumangenomes
AT kawaharariki landscapeofcomplextandemrepeatswithinindividualhumangenomes
AT asanotakeshi landscapeofcomplextandemrepeatswithinindividualhumangenomes
AT morishitashinichi landscapeofcomplextandemrepeatswithinindividualhumangenomes