Cargando…

Population-wide sampling of retrotransposon insertion polymorphisms using deep sequencing and efficient detection

Active retrotransposons play important roles during evolution and continue to shape our genomes today, especially in genetic polymorphisms underlying a diverse set of diseases. However, studies of human retrotransposon insertion polymorphisms (RIPs) based on whole-genome deep sequencing at the popul...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Qichao, Zhang, Wei, Zhang, Xiaolong, Zeng, Yongli, Wang, Yeming, Wang, Yanhui, Xu, Liqin, Huang, Xiaoyun, Li, Nannan, Zhou, Xinlan, Lu, Jie, Guo, Xiaosen, Li, Guibo, Hou, Yong, Liu, Shiping, Li, Bo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5603766/
https://www.ncbi.nlm.nih.gov/pubmed/28938719
http://dx.doi.org/10.1093/gigascience/gix066
_version_ 1783264766841585664
author Yu, Qichao
Zhang, Wei
Zhang, Xiaolong
Zeng, Yongli
Wang, Yeming
Wang, Yanhui
Xu, Liqin
Huang, Xiaoyun
Li, Nannan
Zhou, Xinlan
Lu, Jie
Guo, Xiaosen
Li, Guibo
Hou, Yong
Liu, Shiping
Li, Bo
author_facet Yu, Qichao
Zhang, Wei
Zhang, Xiaolong
Zeng, Yongli
Wang, Yeming
Wang, Yanhui
Xu, Liqin
Huang, Xiaoyun
Li, Nannan
Zhou, Xinlan
Lu, Jie
Guo, Xiaosen
Li, Guibo
Hou, Yong
Liu, Shiping
Li, Bo
author_sort Yu, Qichao
collection PubMed
description Active retrotransposons play important roles during evolution and continue to shape our genomes today, especially in genetic polymorphisms underlying a diverse set of diseases. However, studies of human retrotransposon insertion polymorphisms (RIPs) based on whole-genome deep sequencing at the population level have not been sufficiently undertaken, despite the obvious need for a thorough characterization of RIPs in the general population. Herein, we present a novel and efficient computational tool called Specific Insertions Detector (SID) for the detection of non-reference RIPs. We demonstrate that SID is suitable for high-depth whole-genome sequencing data using paired-end reads obtained from simulated and real datasets. We construct a comprehensive RIP database using a large population of 90 Han Chinese individuals with a mean ×68 depth per individual. In total, we identify 9342 recent RIPs, and 8433 of these RIPs are novel compared with dbRIP, including 5826 Alu, 2169 long interspersed nuclear element 1 (L1), 383 SVA, and 55 long terminal repeats. Among the 9342 RIPs, 4828 were located in gene regions and 5 were located in protein-coding regions. We demonstrate that RIPs can, in principle, be an informative resource to perform population evolution and phylogenetic analyses. Taking the demographic effects into account, we identify a weak negative selection on SVA and L1 but an approximately neutral selection for Alu elements based on the frequency spectrum of RIPs. SID is a powerful open-source program for the detection of non-reference RIPs. We built a non-reference RIP dataset that greatly enhanced the diversity of RIPs detected in the general population, and it should be invaluable to researchers interested in many aspects of human evolution, genetics, and disease. As a proof of concept, we demonstrate that the RIPs can be used as biomarkers in a similar way as single nucleotide polymorphisms.
format Online
Article
Text
id pubmed-5603766
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-56037662017-09-25 Population-wide sampling of retrotransposon insertion polymorphisms using deep sequencing and efficient detection Yu, Qichao Zhang, Wei Zhang, Xiaolong Zeng, Yongli Wang, Yeming Wang, Yanhui Xu, Liqin Huang, Xiaoyun Li, Nannan Zhou, Xinlan Lu, Jie Guo, Xiaosen Li, Guibo Hou, Yong Liu, Shiping Li, Bo Gigascience Technical Note Active retrotransposons play important roles during evolution and continue to shape our genomes today, especially in genetic polymorphisms underlying a diverse set of diseases. However, studies of human retrotransposon insertion polymorphisms (RIPs) based on whole-genome deep sequencing at the population level have not been sufficiently undertaken, despite the obvious need for a thorough characterization of RIPs in the general population. Herein, we present a novel and efficient computational tool called Specific Insertions Detector (SID) for the detection of non-reference RIPs. We demonstrate that SID is suitable for high-depth whole-genome sequencing data using paired-end reads obtained from simulated and real datasets. We construct a comprehensive RIP database using a large population of 90 Han Chinese individuals with a mean ×68 depth per individual. In total, we identify 9342 recent RIPs, and 8433 of these RIPs are novel compared with dbRIP, including 5826 Alu, 2169 long interspersed nuclear element 1 (L1), 383 SVA, and 55 long terminal repeats. Among the 9342 RIPs, 4828 were located in gene regions and 5 were located in protein-coding regions. We demonstrate that RIPs can, in principle, be an informative resource to perform population evolution and phylogenetic analyses. Taking the demographic effects into account, we identify a weak negative selection on SVA and L1 but an approximately neutral selection for Alu elements based on the frequency spectrum of RIPs. SID is a powerful open-source program for the detection of non-reference RIPs. We built a non-reference RIP dataset that greatly enhanced the diversity of RIPs detected in the general population, and it should be invaluable to researchers interested in many aspects of human evolution, genetics, and disease. As a proof of concept, we demonstrate that the RIPs can be used as biomarkers in a similar way as single nucleotide polymorphisms. Oxford University Press 2017-07-31 /pmc/articles/PMC5603766/ /pubmed/28938719 http://dx.doi.org/10.1093/gigascience/gix066 Text en © The Authors 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Yu, Qichao
Zhang, Wei
Zhang, Xiaolong
Zeng, Yongli
Wang, Yeming
Wang, Yanhui
Xu, Liqin
Huang, Xiaoyun
Li, Nannan
Zhou, Xinlan
Lu, Jie
Guo, Xiaosen
Li, Guibo
Hou, Yong
Liu, Shiping
Li, Bo
Population-wide sampling of retrotransposon insertion polymorphisms using deep sequencing and efficient detection
title Population-wide sampling of retrotransposon insertion polymorphisms using deep sequencing and efficient detection
title_full Population-wide sampling of retrotransposon insertion polymorphisms using deep sequencing and efficient detection
title_fullStr Population-wide sampling of retrotransposon insertion polymorphisms using deep sequencing and efficient detection
title_full_unstemmed Population-wide sampling of retrotransposon insertion polymorphisms using deep sequencing and efficient detection
title_short Population-wide sampling of retrotransposon insertion polymorphisms using deep sequencing and efficient detection
title_sort population-wide sampling of retrotransposon insertion polymorphisms using deep sequencing and efficient detection
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5603766/
https://www.ncbi.nlm.nih.gov/pubmed/28938719
http://dx.doi.org/10.1093/gigascience/gix066
work_keys_str_mv AT yuqichao populationwidesamplingofretrotransposoninsertionpolymorphismsusingdeepsequencingandefficientdetection
AT zhangwei populationwidesamplingofretrotransposoninsertionpolymorphismsusingdeepsequencingandefficientdetection
AT zhangxiaolong populationwidesamplingofretrotransposoninsertionpolymorphismsusingdeepsequencingandefficientdetection
AT zengyongli populationwidesamplingofretrotransposoninsertionpolymorphismsusingdeepsequencingandefficientdetection
AT wangyeming populationwidesamplingofretrotransposoninsertionpolymorphismsusingdeepsequencingandefficientdetection
AT wangyanhui populationwidesamplingofretrotransposoninsertionpolymorphismsusingdeepsequencingandefficientdetection
AT xuliqin populationwidesamplingofretrotransposoninsertionpolymorphismsusingdeepsequencingandefficientdetection
AT huangxiaoyun populationwidesamplingofretrotransposoninsertionpolymorphismsusingdeepsequencingandefficientdetection
AT linannan populationwidesamplingofretrotransposoninsertionpolymorphismsusingdeepsequencingandefficientdetection
AT zhouxinlan populationwidesamplingofretrotransposoninsertionpolymorphismsusingdeepsequencingandefficientdetection
AT lujie populationwidesamplingofretrotransposoninsertionpolymorphismsusingdeepsequencingandefficientdetection
AT guoxiaosen populationwidesamplingofretrotransposoninsertionpolymorphismsusingdeepsequencingandefficientdetection
AT liguibo populationwidesamplingofretrotransposoninsertionpolymorphismsusingdeepsequencingandefficientdetection
AT houyong populationwidesamplingofretrotransposoninsertionpolymorphismsusingdeepsequencingandefficientdetection
AT liushiping populationwidesamplingofretrotransposoninsertionpolymorphismsusingdeepsequencingandefficientdetection
AT libo populationwidesamplingofretrotransposoninsertionpolymorphismsusingdeepsequencingandefficientdetection