Cargando…
Exome sequencing generates high quality data in non-target regions
BACKGROUND: Exome sequencing using next-generation sequencing technologies is a cost efficient approach to selectively sequencing coding regions of human genome for detection of disease variants. A significant amount of DNA fragments from the capture process fall outside target regions, and sequence...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3416685/ https://www.ncbi.nlm.nih.gov/pubmed/22607156 http://dx.doi.org/10.1186/1471-2164-13-194 |
_version_ | 1782240422719913984 |
---|---|
author | Guo, Yan Long, Jirong He, Jing Li, Chung-I Cai, Qiuyin Shu, Xiao-Ou Zheng, Wei Li, Chun |
author_facet | Guo, Yan Long, Jirong He, Jing Li, Chung-I Cai, Qiuyin Shu, Xiao-Ou Zheng, Wei Li, Chun |
author_sort | Guo, Yan |
collection | PubMed |
description | BACKGROUND: Exome sequencing using next-generation sequencing technologies is a cost efficient approach to selectively sequencing coding regions of human genome for detection of disease variants. A significant amount of DNA fragments from the capture process fall outside target regions, and sequence data for positions outside target regions have been mostly ignored after alignment. RESULT: We performed whole exome sequencing on 22 subjects using Agilent SureSelect capture reagent and 6 subjects using Illumina TrueSeq capture reagent. We also downloaded sequencing data for 6 subjects from the 1000 Genomes Project Pilot 3 study. Using these data, we examined the quality of SNPs detected outside target regions by computing consistency rate with genotypes obtained from SNP chips or the Hapmap database, transition-transversion (Ti/Tv) ratio, and percentage of SNPs inside dbSNP. For all three platforms, we obtained high-quality SNPs outside target regions, and some far from target regions. In our Agilent SureSelect data, we obtained 84,049 high-quality SNPs outside target regions compared to 65,231 SNPs inside target regions (a 129% increase). For our Illumina TrueSeq data, we obtained 222,171 high-quality SNPs outside target regions compared to 95,818 SNPs inside target regions (a 232% increase). For the data from the 1000 Genomes Project, we obtained 7,139 high-quality SNPs outside target regions compared to 1,548 SNPs inside target regions (a 461% increase). CONCLUSIONS: These results demonstrate that a significant amount of high quality genotypes outside target regions can be obtained from exome sequencing data. These data should not be ignored in genetic epidemiology studies. |
format | Online Article Text |
id | pubmed-3416685 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-34166852012-08-11 Exome sequencing generates high quality data in non-target regions Guo, Yan Long, Jirong He, Jing Li, Chung-I Cai, Qiuyin Shu, Xiao-Ou Zheng, Wei Li, Chun BMC Genomics Methodology Article BACKGROUND: Exome sequencing using next-generation sequencing technologies is a cost efficient approach to selectively sequencing coding regions of human genome for detection of disease variants. A significant amount of DNA fragments from the capture process fall outside target regions, and sequence data for positions outside target regions have been mostly ignored after alignment. RESULT: We performed whole exome sequencing on 22 subjects using Agilent SureSelect capture reagent and 6 subjects using Illumina TrueSeq capture reagent. We also downloaded sequencing data for 6 subjects from the 1000 Genomes Project Pilot 3 study. Using these data, we examined the quality of SNPs detected outside target regions by computing consistency rate with genotypes obtained from SNP chips or the Hapmap database, transition-transversion (Ti/Tv) ratio, and percentage of SNPs inside dbSNP. For all three platforms, we obtained high-quality SNPs outside target regions, and some far from target regions. In our Agilent SureSelect data, we obtained 84,049 high-quality SNPs outside target regions compared to 65,231 SNPs inside target regions (a 129% increase). For our Illumina TrueSeq data, we obtained 222,171 high-quality SNPs outside target regions compared to 95,818 SNPs inside target regions (a 232% increase). For the data from the 1000 Genomes Project, we obtained 7,139 high-quality SNPs outside target regions compared to 1,548 SNPs inside target regions (a 461% increase). CONCLUSIONS: These results demonstrate that a significant amount of high quality genotypes outside target regions can be obtained from exome sequencing data. These data should not be ignored in genetic epidemiology studies. BioMed Central 2012-05-20 /pmc/articles/PMC3416685/ /pubmed/22607156 http://dx.doi.org/10.1186/1471-2164-13-194 Text en Copyright ©2012 Guo et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Guo, Yan Long, Jirong He, Jing Li, Chung-I Cai, Qiuyin Shu, Xiao-Ou Zheng, Wei Li, Chun Exome sequencing generates high quality data in non-target regions |
title | Exome sequencing generates high quality data in non-target regions |
title_full | Exome sequencing generates high quality data in non-target regions |
title_fullStr | Exome sequencing generates high quality data in non-target regions |
title_full_unstemmed | Exome sequencing generates high quality data in non-target regions |
title_short | Exome sequencing generates high quality data in non-target regions |
title_sort | exome sequencing generates high quality data in non-target regions |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3416685/ https://www.ncbi.nlm.nih.gov/pubmed/22607156 http://dx.doi.org/10.1186/1471-2164-13-194 |
work_keys_str_mv | AT guoyan exomesequencinggenerateshighqualitydatainnontargetregions AT longjirong exomesequencinggenerateshighqualitydatainnontargetregions AT hejing exomesequencinggenerateshighqualitydatainnontargetregions AT lichungi exomesequencinggenerateshighqualitydatainnontargetregions AT caiqiuyin exomesequencinggenerateshighqualitydatainnontargetregions AT shuxiaoou exomesequencinggenerateshighqualitydatainnontargetregions AT zhengwei exomesequencinggenerateshighqualitydatainnontargetregions AT lichun exomesequencinggenerateshighqualitydatainnontargetregions |