Cargando…
Profiling the Genome-Wide Landscape of Short Tandem Repeats by Long-Read Sequencing
Background: Short tandem repeats (STRs) are highly variable elements that play a pivotal role in multiple genetic diseases and the regulation of gene expression. Long-read sequencing (LRS) offers a potential solution to genome-wide STR analysis. However, characterizing STRs in human genomes using LR...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9117641/ https://www.ncbi.nlm.nih.gov/pubmed/35601492 http://dx.doi.org/10.3389/fgene.2022.810595 |
_version_ | 1784710353591992320 |
---|---|
author | Liu, Zhenhua Zhao, Guihu Xiao, Yuhui Zeng, Sheng Yuan, Yanchun Zhou, Xun Fang, Zhenghuan He, Runcheng Li, Bin Zhao, Yuwen Pan, Hongxu Wang, Yige Yu, Guoliang Peng, I-Feng Wang, Depeng Meng, Qingtuan Xu, Qian Sun, Qiying Yan, Xinxiang Shen, Lu Jiang, Hong Xia, Kun Wang, Junling Guo, Jifeng Liang, Fan Li, Jinchen Tang, Beisha |
author_facet | Liu, Zhenhua Zhao, Guihu Xiao, Yuhui Zeng, Sheng Yuan, Yanchun Zhou, Xun Fang, Zhenghuan He, Runcheng Li, Bin Zhao, Yuwen Pan, Hongxu Wang, Yige Yu, Guoliang Peng, I-Feng Wang, Depeng Meng, Qingtuan Xu, Qian Sun, Qiying Yan, Xinxiang Shen, Lu Jiang, Hong Xia, Kun Wang, Junling Guo, Jifeng Liang, Fan Li, Jinchen Tang, Beisha |
author_sort | Liu, Zhenhua |
collection | PubMed |
description | Background: Short tandem repeats (STRs) are highly variable elements that play a pivotal role in multiple genetic diseases and the regulation of gene expression. Long-read sequencing (LRS) offers a potential solution to genome-wide STR analysis. However, characterizing STRs in human genomes using LRS on a large population scale has not been reported. Methods: We conducted the large LRS-based STR analysis in 193 unrelated samples of the Chinese population and performed genome-wide profiling of STR variation in the human genome. The repeat dynamic index (RDI) was introduced to evaluate the variability of STR. We sourced the expression data from the Genotype-Tissue Expression to explore the tissue specificity of highly variable STRs related genes across tissues. Enrichment analyses were also conducted to identify potential functional roles of the high variable STRs. Results: This study reports the large-scale analysis of human STR variation by LRS and offers a reference STR database based on the LRS dataset. We found that the disease-associated STRs (dSTRs) and STRs associated with the expression of nearby genes (eSTRs) were highly variable in the general population. Moreover, tissue-specific expression analysis showed that those highly variable STRs related genes presented the highest expression level in brain tissues, and enrichment pathways analysis found those STRs are involved in synaptic function-related pathways. Conclusion: Our study profiled the genome-wide landscape of STR using LRS and highlighted the highly variable STRs in the human genome, which provide a valuable resource for studying the role of STRs in human disease and complex traits. |
format | Online Article Text |
id | pubmed-9117641 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-91176412022-05-20 Profiling the Genome-Wide Landscape of Short Tandem Repeats by Long-Read Sequencing Liu, Zhenhua Zhao, Guihu Xiao, Yuhui Zeng, Sheng Yuan, Yanchun Zhou, Xun Fang, Zhenghuan He, Runcheng Li, Bin Zhao, Yuwen Pan, Hongxu Wang, Yige Yu, Guoliang Peng, I-Feng Wang, Depeng Meng, Qingtuan Xu, Qian Sun, Qiying Yan, Xinxiang Shen, Lu Jiang, Hong Xia, Kun Wang, Junling Guo, Jifeng Liang, Fan Li, Jinchen Tang, Beisha Front Genet Genetics Background: Short tandem repeats (STRs) are highly variable elements that play a pivotal role in multiple genetic diseases and the regulation of gene expression. Long-read sequencing (LRS) offers a potential solution to genome-wide STR analysis. However, characterizing STRs in human genomes using LRS on a large population scale has not been reported. Methods: We conducted the large LRS-based STR analysis in 193 unrelated samples of the Chinese population and performed genome-wide profiling of STR variation in the human genome. The repeat dynamic index (RDI) was introduced to evaluate the variability of STR. We sourced the expression data from the Genotype-Tissue Expression to explore the tissue specificity of highly variable STRs related genes across tissues. Enrichment analyses were also conducted to identify potential functional roles of the high variable STRs. Results: This study reports the large-scale analysis of human STR variation by LRS and offers a reference STR database based on the LRS dataset. We found that the disease-associated STRs (dSTRs) and STRs associated with the expression of nearby genes (eSTRs) were highly variable in the general population. Moreover, tissue-specific expression analysis showed that those highly variable STRs related genes presented the highest expression level in brain tissues, and enrichment pathways analysis found those STRs are involved in synaptic function-related pathways. Conclusion: Our study profiled the genome-wide landscape of STR using LRS and highlighted the highly variable STRs in the human genome, which provide a valuable resource for studying the role of STRs in human disease and complex traits. Frontiers Media S.A. 2022-05-05 /pmc/articles/PMC9117641/ /pubmed/35601492 http://dx.doi.org/10.3389/fgene.2022.810595 Text en Copyright © 2022 Liu, Zhao, Xiao, Zeng, Yuan, Zhou, Fang, He, Li, Zhao, Pan, Wang, Yu, Peng, Wang, Meng, Xu, Sun, Yan, Shen, Jiang, Xia, Wang, Guo, Liang, Li and Tang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Liu, Zhenhua Zhao, Guihu Xiao, Yuhui Zeng, Sheng Yuan, Yanchun Zhou, Xun Fang, Zhenghuan He, Runcheng Li, Bin Zhao, Yuwen Pan, Hongxu Wang, Yige Yu, Guoliang Peng, I-Feng Wang, Depeng Meng, Qingtuan Xu, Qian Sun, Qiying Yan, Xinxiang Shen, Lu Jiang, Hong Xia, Kun Wang, Junling Guo, Jifeng Liang, Fan Li, Jinchen Tang, Beisha Profiling the Genome-Wide Landscape of Short Tandem Repeats by Long-Read Sequencing |
title | Profiling the Genome-Wide Landscape of Short Tandem Repeats by Long-Read Sequencing |
title_full | Profiling the Genome-Wide Landscape of Short Tandem Repeats by Long-Read Sequencing |
title_fullStr | Profiling the Genome-Wide Landscape of Short Tandem Repeats by Long-Read Sequencing |
title_full_unstemmed | Profiling the Genome-Wide Landscape of Short Tandem Repeats by Long-Read Sequencing |
title_short | Profiling the Genome-Wide Landscape of Short Tandem Repeats by Long-Read Sequencing |
title_sort | profiling the genome-wide landscape of short tandem repeats by long-read sequencing |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9117641/ https://www.ncbi.nlm.nih.gov/pubmed/35601492 http://dx.doi.org/10.3389/fgene.2022.810595 |
work_keys_str_mv | AT liuzhenhua profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT zhaoguihu profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT xiaoyuhui profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT zengsheng profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT yuanyanchun profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT zhouxun profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT fangzhenghuan profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT heruncheng profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT libin profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT zhaoyuwen profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT panhongxu profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT wangyige profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT yuguoliang profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT pengifeng profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT wangdepeng profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT mengqingtuan profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT xuqian profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT sunqiying profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT yanxinxiang profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT shenlu profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT jianghong profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT xiakun profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT wangjunling profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT guojifeng profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT liangfan profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT lijinchen profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing AT tangbeisha profilingthegenomewidelandscapeofshorttandemrepeatsbylongreadsequencing |