Cargando…

Characterization of genome-wide STR variation in 6487 human genomes

Short tandem repeats (STRs) are abundant and highly mutagenic in the human genome. Many STR loci have been associated with a range of human genetic disorders. However, most population-scale studies on STR variation in humans have focused on European ancestry cohorts or are limited by sequencing dept...

Descripción completa

Detalles Bibliográficos
Autores principales: Shi, Yirong, Niu, Yiwei, Zhang, Peng, Luo, Huaxia, Liu, Shuai, Zhang, Sijia, Wang, Jiajia, Li, Yanyan, Liu, Xinyue, Song, Tingrui, Xu, Tao, He, Shunmin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10097659/
https://www.ncbi.nlm.nih.gov/pubmed/37045857
http://dx.doi.org/10.1038/s41467-023-37690-8
Descripción
Sumario:Short tandem repeats (STRs) are abundant and highly mutagenic in the human genome. Many STR loci have been associated with a range of human genetic disorders. However, most population-scale studies on STR variation in humans have focused on European ancestry cohorts or are limited by sequencing depth. Here, we depicted a comprehensive map of 366,013 polymorphic STRs (pSTRs) constructed from 6487 deeply sequenced genomes, comprising 3983 Chinese samples (~31.5x, NyuWa) and 2504 samples from the 1000 Genomes Project (~33.3x, 1KGP). We found that STR mutations were affected by motif length, chromosome context and epigenetic features. We identified 3273 and 1117 pSTRs whose repeat numbers were associated with gene expression and 3′UTR alternative polyadenylation, respectively. We also implemented population analysis, investigated population differentiated signatures, and genotyped 60 known disease-causing STRs. Overall, this study further extends the scale of STR variation in humans and propels our understanding of the semantics of STRs.