Cargando…

Exome sequencing generates high quality data in non-target regions

BACKGROUND: Exome sequencing using next-generation sequencing technologies is a cost efficient approach to selectively sequencing coding regions of human genome for detection of disease variants. A significant amount of DNA fragments from the capture process fall outside target regions, and sequence...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Yan, Long, Jirong, He, Jing, Li, Chung-I, Cai, Qiuyin, Shu, Xiao-Ou, Zheng, Wei, Li, Chun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3416685/
https://www.ncbi.nlm.nih.gov/pubmed/22607156
http://dx.doi.org/10.1186/1471-2164-13-194
_version_ 1782240422719913984
author Guo, Yan
Long, Jirong
He, Jing
Li, Chung-I
Cai, Qiuyin
Shu, Xiao-Ou
Zheng, Wei
Li, Chun
author_facet Guo, Yan
Long, Jirong
He, Jing
Li, Chung-I
Cai, Qiuyin
Shu, Xiao-Ou
Zheng, Wei
Li, Chun
author_sort Guo, Yan
collection PubMed
description BACKGROUND: Exome sequencing using next-generation sequencing technologies is a cost efficient approach to selectively sequencing coding regions of human genome for detection of disease variants. A significant amount of DNA fragments from the capture process fall outside target regions, and sequence data for positions outside target regions have been mostly ignored after alignment. RESULT: We performed whole exome sequencing on 22 subjects using Agilent SureSelect capture reagent and 6 subjects using Illumina TrueSeq capture reagent. We also downloaded sequencing data for 6 subjects from the 1000 Genomes Project Pilot 3 study. Using these data, we examined the quality of SNPs detected outside target regions by computing consistency rate with genotypes obtained from SNP chips or the Hapmap database, transition-transversion (Ti/Tv) ratio, and percentage of SNPs inside dbSNP. For all three platforms, we obtained high-quality SNPs outside target regions, and some far from target regions. In our Agilent SureSelect data, we obtained 84,049 high-quality SNPs outside target regions compared to 65,231 SNPs inside target regions (a 129% increase). For our Illumina TrueSeq data, we obtained 222,171 high-quality SNPs outside target regions compared to 95,818 SNPs inside target regions (a 232% increase). For the data from the 1000 Genomes Project, we obtained 7,139 high-quality SNPs outside target regions compared to 1,548 SNPs inside target regions (a 461% increase). CONCLUSIONS: These results demonstrate that a significant amount of high quality genotypes outside target regions can be obtained from exome sequencing data. These data should not be ignored in genetic epidemiology studies.
format Online
Article
Text
id pubmed-3416685
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34166852012-08-11 Exome sequencing generates high quality data in non-target regions Guo, Yan Long, Jirong He, Jing Li, Chung-I Cai, Qiuyin Shu, Xiao-Ou Zheng, Wei Li, Chun BMC Genomics Methodology Article BACKGROUND: Exome sequencing using next-generation sequencing technologies is a cost efficient approach to selectively sequencing coding regions of human genome for detection of disease variants. A significant amount of DNA fragments from the capture process fall outside target regions, and sequence data for positions outside target regions have been mostly ignored after alignment. RESULT: We performed whole exome sequencing on 22 subjects using Agilent SureSelect capture reagent and 6 subjects using Illumina TrueSeq capture reagent. We also downloaded sequencing data for 6 subjects from the 1000 Genomes Project Pilot 3 study. Using these data, we examined the quality of SNPs detected outside target regions by computing consistency rate with genotypes obtained from SNP chips or the Hapmap database, transition-transversion (Ti/Tv) ratio, and percentage of SNPs inside dbSNP. For all three platforms, we obtained high-quality SNPs outside target regions, and some far from target regions. In our Agilent SureSelect data, we obtained 84,049 high-quality SNPs outside target regions compared to 65,231 SNPs inside target regions (a 129% increase). For our Illumina TrueSeq data, we obtained 222,171 high-quality SNPs outside target regions compared to 95,818 SNPs inside target regions (a 232% increase). For the data from the 1000 Genomes Project, we obtained 7,139 high-quality SNPs outside target regions compared to 1,548 SNPs inside target regions (a 461% increase). CONCLUSIONS: These results demonstrate that a significant amount of high quality genotypes outside target regions can be obtained from exome sequencing data. These data should not be ignored in genetic epidemiology studies. BioMed Central 2012-05-20 /pmc/articles/PMC3416685/ /pubmed/22607156 http://dx.doi.org/10.1186/1471-2164-13-194 Text en Copyright ©2012 Guo et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Guo, Yan
Long, Jirong
He, Jing
Li, Chung-I
Cai, Qiuyin
Shu, Xiao-Ou
Zheng, Wei
Li, Chun
Exome sequencing generates high quality data in non-target regions
title Exome sequencing generates high quality data in non-target regions
title_full Exome sequencing generates high quality data in non-target regions
title_fullStr Exome sequencing generates high quality data in non-target regions
title_full_unstemmed Exome sequencing generates high quality data in non-target regions
title_short Exome sequencing generates high quality data in non-target regions
title_sort exome sequencing generates high quality data in non-target regions
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3416685/
https://www.ncbi.nlm.nih.gov/pubmed/22607156
http://dx.doi.org/10.1186/1471-2164-13-194
work_keys_str_mv AT guoyan exomesequencinggenerateshighqualitydatainnontargetregions
AT longjirong exomesequencinggenerateshighqualitydatainnontargetregions
AT hejing exomesequencinggenerateshighqualitydatainnontargetregions
AT lichungi exomesequencinggenerateshighqualitydatainnontargetregions
AT caiqiuyin exomesequencinggenerateshighqualitydatainnontargetregions
AT shuxiaoou exomesequencinggenerateshighqualitydatainnontargetregions
AT zhengwei exomesequencinggenerateshighqualitydatainnontargetregions
AT lichun exomesequencinggenerateshighqualitydatainnontargetregions