Cargando…

Comparison of solution-based exome capture methods for next generation sequencing

BACKGROUND: Techniques enabling targeted re-sequencing of the protein coding sequences of the human genome on next generation sequencing instruments are of great interest. We conducted a systematic comparison of the solution-based exome capture kits provided by Agilent and Roche NimbleGen. A control...

Descripción completa

Detalles Bibliográficos
Autores principales: Sulonen, Anna-Maija, Ellonen, Pekka, Almusa, Henrikki, Lepistö, Maija, Eldfors, Samuli, Hannula, Sari, Miettinen, Timo, Tyynismaa, Henna, Salo, Perttu, Heckman, Caroline, Joensuu, Heikki, Raivio, Taneli, Suomalainen, Anu, Saarela, Janna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3308057/
https://www.ncbi.nlm.nih.gov/pubmed/21955854
http://dx.doi.org/10.1186/gb-2011-12-9-r94
_version_ 1782227386869219328
author Sulonen, Anna-Maija
Ellonen, Pekka
Almusa, Henrikki
Lepistö, Maija
Eldfors, Samuli
Hannula, Sari
Miettinen, Timo
Tyynismaa, Henna
Salo, Perttu
Heckman, Caroline
Joensuu, Heikki
Raivio, Taneli
Suomalainen, Anu
Saarela, Janna
author_facet Sulonen, Anna-Maija
Ellonen, Pekka
Almusa, Henrikki
Lepistö, Maija
Eldfors, Samuli
Hannula, Sari
Miettinen, Timo
Tyynismaa, Henna
Salo, Perttu
Heckman, Caroline
Joensuu, Heikki
Raivio, Taneli
Suomalainen, Anu
Saarela, Janna
author_sort Sulonen, Anna-Maija
collection PubMed
description BACKGROUND: Techniques enabling targeted re-sequencing of the protein coding sequences of the human genome on next generation sequencing instruments are of great interest. We conducted a systematic comparison of the solution-based exome capture kits provided by Agilent and Roche NimbleGen. A control DNA sample was captured with all four capture methods and prepared for Illumina GAII sequencing. Sequence data from additional samples prepared with the same protocols were also used in the comparison. RESULTS: We developed a bioinformatics pipeline for quality control, short read alignment, variant identification and annotation of the sequence data. In our analysis, a larger percentage of the high quality reads from the NimbleGen captures than from the Agilent captures aligned to the capture target regions. High GC content of the target sequence was associated with poor capture success in all exome enrichment methods. Comparison of mean allele balances for heterozygous variants indicated a tendency to have more reference bases than variant bases in the heterozygous variant positions within the target regions in all methods. There was virtually no difference in the genotype concordance compared to genotypes derived from SNP arrays. A minimum of 11× coverage was required to make a heterozygote genotype call with 99% accuracy when compared to common SNPs on genome-wide association arrays. CONCLUSIONS: Libraries captured with NimbleGen kits aligned more accurately to the target regions. The updated NimbleGen kit most efficiently covered the exome with a minimum coverage of 20×, yet none of the kits captured all the Consensus Coding Sequence annotated exons.
format Online
Article
Text
id pubmed-3308057
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33080572012-03-20 Comparison of solution-based exome capture methods for next generation sequencing Sulonen, Anna-Maija Ellonen, Pekka Almusa, Henrikki Lepistö, Maija Eldfors, Samuli Hannula, Sari Miettinen, Timo Tyynismaa, Henna Salo, Perttu Heckman, Caroline Joensuu, Heikki Raivio, Taneli Suomalainen, Anu Saarela, Janna Genome Biol Research BACKGROUND: Techniques enabling targeted re-sequencing of the protein coding sequences of the human genome on next generation sequencing instruments are of great interest. We conducted a systematic comparison of the solution-based exome capture kits provided by Agilent and Roche NimbleGen. A control DNA sample was captured with all four capture methods and prepared for Illumina GAII sequencing. Sequence data from additional samples prepared with the same protocols were also used in the comparison. RESULTS: We developed a bioinformatics pipeline for quality control, short read alignment, variant identification and annotation of the sequence data. In our analysis, a larger percentage of the high quality reads from the NimbleGen captures than from the Agilent captures aligned to the capture target regions. High GC content of the target sequence was associated with poor capture success in all exome enrichment methods. Comparison of mean allele balances for heterozygous variants indicated a tendency to have more reference bases than variant bases in the heterozygous variant positions within the target regions in all methods. There was virtually no difference in the genotype concordance compared to genotypes derived from SNP arrays. A minimum of 11× coverage was required to make a heterozygote genotype call with 99% accuracy when compared to common SNPs on genome-wide association arrays. CONCLUSIONS: Libraries captured with NimbleGen kits aligned more accurately to the target regions. The updated NimbleGen kit most efficiently covered the exome with a minimum coverage of 20×, yet none of the kits captured all the Consensus Coding Sequence annotated exons. BioMed Central 2011 2011-09-28 /pmc/articles/PMC3308057/ /pubmed/21955854 http://dx.doi.org/10.1186/gb-2011-12-9-r94 Text en Copyright ©2011 Sulonen et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Sulonen, Anna-Maija
Ellonen, Pekka
Almusa, Henrikki
Lepistö, Maija
Eldfors, Samuli
Hannula, Sari
Miettinen, Timo
Tyynismaa, Henna
Salo, Perttu
Heckman, Caroline
Joensuu, Heikki
Raivio, Taneli
Suomalainen, Anu
Saarela, Janna
Comparison of solution-based exome capture methods for next generation sequencing
title Comparison of solution-based exome capture methods for next generation sequencing
title_full Comparison of solution-based exome capture methods for next generation sequencing
title_fullStr Comparison of solution-based exome capture methods for next generation sequencing
title_full_unstemmed Comparison of solution-based exome capture methods for next generation sequencing
title_short Comparison of solution-based exome capture methods for next generation sequencing
title_sort comparison of solution-based exome capture methods for next generation sequencing
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3308057/
https://www.ncbi.nlm.nih.gov/pubmed/21955854
http://dx.doi.org/10.1186/gb-2011-12-9-r94
work_keys_str_mv AT sulonenannamaija comparisonofsolutionbasedexomecapturemethodsfornextgenerationsequencing
AT ellonenpekka comparisonofsolutionbasedexomecapturemethodsfornextgenerationsequencing
AT almusahenrikki comparisonofsolutionbasedexomecapturemethodsfornextgenerationsequencing
AT lepistomaija comparisonofsolutionbasedexomecapturemethodsfornextgenerationsequencing
AT eldforssamuli comparisonofsolutionbasedexomecapturemethodsfornextgenerationsequencing
AT hannulasari comparisonofsolutionbasedexomecapturemethodsfornextgenerationsequencing
AT miettinentimo comparisonofsolutionbasedexomecapturemethodsfornextgenerationsequencing
AT tyynismaahenna comparisonofsolutionbasedexomecapturemethodsfornextgenerationsequencing
AT saloperttu comparisonofsolutionbasedexomecapturemethodsfornextgenerationsequencing
AT heckmancaroline comparisonofsolutionbasedexomecapturemethodsfornextgenerationsequencing
AT joensuuheikki comparisonofsolutionbasedexomecapturemethodsfornextgenerationsequencing
AT raiviotaneli comparisonofsolutionbasedexomecapturemethodsfornextgenerationsequencing
AT suomalainenanu comparisonofsolutionbasedexomecapturemethodsfornextgenerationsequencing
AT saarelajanna comparisonofsolutionbasedexomecapturemethodsfornextgenerationsequencing