Cargando…

Analytical validation of whole exome and whole genome sequencing for clinical applications

BACKGROUND: Whole exome and genome sequencing (WES/WGS) is now routinely offered as a clinical test by a growing number of laboratories. As part of the test design process each laboratory must determine the performance characteristics of the platform, test and informatics pipeline. This report docum...

Descripción completa

Detalles Bibliográficos
Autores principales: Linderman, Michael D, Brandt, Tracy, Edelmann, Lisa, Jabado, Omar, Kasai, Yumi, Kornreich, Ruth, Mahajan, Milind, Shah, Hardik, Kasarskis, Andrew, Schadt, Eric E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4022392/
https://www.ncbi.nlm.nih.gov/pubmed/24758382
http://dx.doi.org/10.1186/1755-8794-7-20
_version_ 1782316396254855168
author Linderman, Michael D
Brandt, Tracy
Edelmann, Lisa
Jabado, Omar
Kasai, Yumi
Kornreich, Ruth
Mahajan, Milind
Shah, Hardik
Kasarskis, Andrew
Schadt, Eric E
author_facet Linderman, Michael D
Brandt, Tracy
Edelmann, Lisa
Jabado, Omar
Kasai, Yumi
Kornreich, Ruth
Mahajan, Milind
Shah, Hardik
Kasarskis, Andrew
Schadt, Eric E
author_sort Linderman, Michael D
collection PubMed
description BACKGROUND: Whole exome and genome sequencing (WES/WGS) is now routinely offered as a clinical test by a growing number of laboratories. As part of the test design process each laboratory must determine the performance characteristics of the platform, test and informatics pipeline. This report documents one such characterization of WES/WGS. METHODS: Whole exome and whole genome sequencing was performed on multiple technical replicates of five reference samples using the Illumina HiSeq 2000/2500. The sequencing data was processed with a GATK-based genome analysis pipeline to evaluate: intra-run, inter-run, inter-mode, inter-machine and inter-library consistency, concordance with orthogonal technologies (microarray, Sanger) and sensitivity and accuracy relative to known variant sets. RESULTS: Concordance to high-density microarrays consistently exceeds 97% (and typically exceeds 99%) and concordance between sequencing replicates also exceeds 97%, with no observable differences between different flow cells, runs, machines or modes. Sensitivity relative to high-density microarray variants exceeds 95%. In a detailed study of a 129 kb region, sensitivity was lower with some validated single-base insertions and deletions “not called”. Different variants are "not called" in each replicate: of all variants identified in WES data from the NA12878 reference sample 74% of indels and 89% of SNVs were called in all seven replicates, in NA12878 WGS 52% of indels and 88% of SNVs were called in all six replicates. Key sources of non-uniformity are variance in depth of coverage, artifactual variants resulting from repetitive regions and larger structural variants.
format Online
Article
Text
id pubmed-4022392
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40223922014-05-16 Analytical validation of whole exome and whole genome sequencing for clinical applications Linderman, Michael D Brandt, Tracy Edelmann, Lisa Jabado, Omar Kasai, Yumi Kornreich, Ruth Mahajan, Milind Shah, Hardik Kasarskis, Andrew Schadt, Eric E BMC Med Genomics Research Article BACKGROUND: Whole exome and genome sequencing (WES/WGS) is now routinely offered as a clinical test by a growing number of laboratories. As part of the test design process each laboratory must determine the performance characteristics of the platform, test and informatics pipeline. This report documents one such characterization of WES/WGS. METHODS: Whole exome and whole genome sequencing was performed on multiple technical replicates of five reference samples using the Illumina HiSeq 2000/2500. The sequencing data was processed with a GATK-based genome analysis pipeline to evaluate: intra-run, inter-run, inter-mode, inter-machine and inter-library consistency, concordance with orthogonal technologies (microarray, Sanger) and sensitivity and accuracy relative to known variant sets. RESULTS: Concordance to high-density microarrays consistently exceeds 97% (and typically exceeds 99%) and concordance between sequencing replicates also exceeds 97%, with no observable differences between different flow cells, runs, machines or modes. Sensitivity relative to high-density microarray variants exceeds 95%. In a detailed study of a 129 kb region, sensitivity was lower with some validated single-base insertions and deletions “not called”. Different variants are "not called" in each replicate: of all variants identified in WES data from the NA12878 reference sample 74% of indels and 89% of SNVs were called in all seven replicates, in NA12878 WGS 52% of indels and 88% of SNVs were called in all six replicates. Key sources of non-uniformity are variance in depth of coverage, artifactual variants resulting from repetitive regions and larger structural variants. BioMed Central 2014-04-23 /pmc/articles/PMC4022392/ /pubmed/24758382 http://dx.doi.org/10.1186/1755-8794-7-20 Text en Copyright © 2014 Linderman et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Linderman, Michael D
Brandt, Tracy
Edelmann, Lisa
Jabado, Omar
Kasai, Yumi
Kornreich, Ruth
Mahajan, Milind
Shah, Hardik
Kasarskis, Andrew
Schadt, Eric E
Analytical validation of whole exome and whole genome sequencing for clinical applications
title Analytical validation of whole exome and whole genome sequencing for clinical applications
title_full Analytical validation of whole exome and whole genome sequencing for clinical applications
title_fullStr Analytical validation of whole exome and whole genome sequencing for clinical applications
title_full_unstemmed Analytical validation of whole exome and whole genome sequencing for clinical applications
title_short Analytical validation of whole exome and whole genome sequencing for clinical applications
title_sort analytical validation of whole exome and whole genome sequencing for clinical applications
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4022392/
https://www.ncbi.nlm.nih.gov/pubmed/24758382
http://dx.doi.org/10.1186/1755-8794-7-20
work_keys_str_mv AT lindermanmichaeld analyticalvalidationofwholeexomeandwholegenomesequencingforclinicalapplications
AT brandttracy analyticalvalidationofwholeexomeandwholegenomesequencingforclinicalapplications
AT edelmannlisa analyticalvalidationofwholeexomeandwholegenomesequencingforclinicalapplications
AT jabadoomar analyticalvalidationofwholeexomeandwholegenomesequencingforclinicalapplications
AT kasaiyumi analyticalvalidationofwholeexomeandwholegenomesequencingforclinicalapplications
AT kornreichruth analyticalvalidationofwholeexomeandwholegenomesequencingforclinicalapplications
AT mahajanmilind analyticalvalidationofwholeexomeandwholegenomesequencingforclinicalapplications
AT shahhardik analyticalvalidationofwholeexomeandwholegenomesequencingforclinicalapplications
AT kasarskisandrew analyticalvalidationofwholeexomeandwholegenomesequencingforclinicalapplications
AT schadterice analyticalvalidationofwholeexomeandwholegenomesequencingforclinicalapplications