Cargando…

Profiling the Genome-Wide Landscape of Short Tandem Repeats by Long-Read Sequencing

Background: Short tandem repeats (STRs) are highly variable elements that play a pivotal role in multiple genetic diseases and the regulation of gene expression. Long-read sequencing (LRS) offers a potential solution to genome-wide STR analysis. However, characterizing STRs in human genomes using LR...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Zhenhua, Zhao, Guihu, Xiao, Yuhui, Zeng, Sheng, Yuan, Yanchun, Zhou, Xun, Fang, Zhenghuan, He, Runcheng, Li, Bin, Zhao, Yuwen, Pan, Hongxu, Wang, Yige, Yu, Guoliang, Peng, I-Feng, Wang, Depeng, Meng, Qingtuan, Xu, Qian, Sun, Qiying, Yan, Xinxiang, Shen, Lu, Jiang, Hong, Xia, Kun, Wang, Junling, Guo, Jifeng, Liang, Fan, Li, Jinchen, Tang, Beisha
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9117641/
https://www.ncbi.nlm.nih.gov/pubmed/35601492
http://dx.doi.org/10.3389/fgene.2022.810595
_version_ 1784710353591992320
author Liu, Zhenhua
Zhao, Guihu
Xiao, Yuhui
Zeng, Sheng
Yuan, Yanchun
Zhou, Xun
Fang, Zhenghuan
He, Runcheng
Li, Bin
Zhao, Yuwen
Pan, Hongxu
Wang, Yige
Yu, Guoliang
Peng, I-Feng
Wang, Depeng
Meng, Qingtuan
Xu, Qian
Sun, Qiying
Yan, Xinxiang
Shen, Lu
Jiang, Hong
Xia, Kun
Wang, Junling
Guo, Jifeng
Liang, Fan
Li, Jinchen
Tang, Beisha
author_facet Liu, Zhenhua
Zhao, Guihu
Xiao, Yuhui
Zeng, Sheng
Yuan, Yanchun
Zhou, Xun
Fang, Zhenghuan
He, Runcheng
Li, Bin
Zhao, Yuwen
Pan, Hongxu
Wang, Yige
Yu, Guoliang
Peng, I-Feng
Wang, Depeng
Meng, Qingtuan
Xu, Qian
Sun, Qiying
Yan, Xinxiang
Shen, Lu
Jiang, Hong
Xia, Kun
Wang, Junling
Guo, Jifeng
Liang, Fan
Li, Jinchen
Tang, Beisha
author_sort Liu, Zhenhua
collection PubMed
description Background: Short tandem repeats (STRs) are highly variable elements that play a pivotal role in multiple genetic diseases and the regulation of gene expression. Long-read sequencing (LRS) offers a potential solution to genome-wide STR analysis. However, characterizing STRs in human genomes using LRS on a large population scale has not been reported. Methods: We conducted the large LRS-based STR analysis in 193 unrelated samples of the Chinese population and performed genome-wide profiling of STR variation in the human genome. The repeat dynamic index (RDI) was introduced to evaluate the variability of STR. We sourced the expression data from the Genotype-Tissue Expression to explore the tissue specificity of highly variable STRs related genes across tissues. Enrichment analyses were also conducted to identify potential functional roles of the high variable STRs. Results: This study reports the large-scale analysis of human STR variation by LRS and offers a reference STR database based on the LRS dataset. We found that the disease-associated STRs (dSTRs) and STRs associated with the expression of nearby genes (eSTRs) were highly variable in the general population. Moreover, tissue-specific expression analysis showed that those highly variable STRs related genes presented the highest expression level in brain tissues, and enrichment pathways analysis found those STRs are involved in synaptic function-related pathways. Conclusion: Our study profiled the genome-wide landscape of STR using LRS and highlighted the highly variable STRs in the human genome, which provide a valuable resource for studying the role of STRs in human disease and complex traits.
format Online
Article
Text
id pubmed-9117641
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-91176412022-05-20 Profiling the Genome-Wide Landscape of Short Tandem Repeats by Long-Read Sequencing Liu, Zhenhua Zhao, Guihu Xiao, Yuhui Zeng, Sheng Yuan, Yanchun Zhou, Xun Fang, Zhenghuan He, Runcheng Li, Bin Zhao, Yuwen Pan, Hongxu Wang, Yige Yu, Guoliang Peng, I-Feng Wang, Depeng Meng, Qingtuan Xu, Qian Sun, Qiying Yan, Xinxiang Shen, Lu Jiang, Hong Xia, Kun Wang, Junling Guo, Jifeng Liang, Fan Li, Jinchen Tang, Beisha Front Genet Genetics Background: Short tandem repeats (STRs) are highly variable elements that play a pivotal role in multiple genetic diseases and the regulation of gene expression. Long-read sequencing (LRS) offers a potential solution to genome-wide STR analysis. However, characterizing STRs in human genomes using LRS on a large population scale has not been reported. Methods: We conducted the large LRS-based STR analysis in 193 unrelated samples of the Chinese population and performed genome-wide profiling of STR variation in the human genome. The repeat dynamic index (RDI) was introduced to evaluate the variability of STR. We sourced the expression data from the Genotype-Tissue Expression to explore the tissue specificity of highly variable STRs related genes across tissues. Enrichment analyses were also conducted to identify potential functional roles of the high variable STRs. Results: This study reports the large-scale analysis of human STR variation by LRS and offers a reference STR database based on the LRS dataset. We found that the disease-associated STRs (dSTRs) and STRs associated with the expression of nearby genes (eSTRs) were highly variable in the general population. Moreover, tissue-specific expression analysis showed that those highly variable STRs related genes presented the highest expression level in brain tissues, and enrichment pathways analysis found those STRs are involved in synaptic function-related pathways. Conclusion: Our study profiled the genome-wide landscape of STR using LRS and highlighted the highly variable STRs in the human genome, which provide a valuable resource for studying the role of STRs in human disease and complex traits. Frontiers Media S.A. 2022-05-05 /pmc/articles/PMC9117641/ /pubmed/35601492 http://dx.doi.org/10.3389/fgene.2022.810595 Text en Copyright © 2022 Liu, Zhao, Xiao, Zeng, Yuan, Zhou, Fang, He, Li, Zhao, Pan, Wang, Yu, Peng, Wang, Meng, Xu, Sun, Yan, Shen, Jiang, Xia, Wang, Guo, Liang, Li and Tang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Liu, Zhenhua
Zhao, Guihu
Xiao, Yuhui
Zeng, Sheng
Yuan, Yanchun
Zhou, Xun
Fang, Zhenghuan
He, Runcheng
Li, Bin
Zhao, Yuwen
Pan, Hongxu
Wang, Yige
Yu, Guoliang
Peng, I-Feng
Wang, Depeng
Meng, Qingtuan
Xu, Qian
Sun, Qiying
Yan, Xinxiang
Shen, Lu
Jiang, Hong
Xia, Kun
Wang, Junling
Guo, Jifeng
Liang, Fan
Li, Jinchen
Tang, Beisha
Profiling the Genome-Wide Landscape of Short Tandem Repeats by Long-Read Sequencing
title Profiling the Genome-Wide Landscape of Short Tandem Repeats by Long-Read Sequencing
title_full Profiling the Genome-Wide Landscape of Short Tandem Repeats by Long-Read Sequencing
title_fullStr Profiling the Genome-Wide Landscape of Short Tandem Repeats by Long-Read Sequencing
title_full_unstemmed Profiling the Genome-Wide Landscape of Short Tandem Repeats by Long-Read Sequencing
title_short Profiling the Genome-Wide Landscape of Short Tandem Repeats by Long-Read Sequencing
title_sort profiling the genome-wide landscape of short tandem repeats by long-read sequencing
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9117641/
https://www.ncbi.nlm.nih.gov/pubmed/35601492
http://dx.doi.org/10.3389/fgene.2022.810595
work_keys_str_mv AT liuzhenhua profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT zhaoguihu profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT xiaoyuhui profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT zengsheng profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT yuanyanchun profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT zhouxun profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT fangzhenghuan profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT heruncheng profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT libin profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT zhaoyuwen profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT panhongxu profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT wangyige profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT yuguoliang profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT pengifeng profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT wangdepeng profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT mengqingtuan profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT xuqian profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT sunqiying profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT yanxinxiang profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT shenlu profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT jianghong profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT xiakun profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT wangjunling profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT guojifeng profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT liangfan profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT lijinchen profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing
AT tangbeisha profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing