Cargando…
Analytical validation of whole exome and whole genome sequencing for clinical applications
BACKGROUND: Whole exome and genome sequencing (WES/WGS) is now routinely offered as a clinical test by a growing number of laboratories. As part of the test design process each laboratory must determine the performance characteristics of the platform, test and informatics pipeline. This report docum...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4022392/ https://www.ncbi.nlm.nih.gov/pubmed/24758382 http://dx.doi.org/10.1186/1755-8794-7-20 |
_version_ | 1782316396254855168 |
---|---|
author | Linderman, Michael D Brandt, Tracy Edelmann, Lisa Jabado, Omar Kasai, Yumi Kornreich, Ruth Mahajan, Milind Shah, Hardik Kasarskis, Andrew Schadt, Eric E |
author_facet | Linderman, Michael D Brandt, Tracy Edelmann, Lisa Jabado, Omar Kasai, Yumi Kornreich, Ruth Mahajan, Milind Shah, Hardik Kasarskis, Andrew Schadt, Eric E |
author_sort | Linderman, Michael D |
collection | PubMed |
description | BACKGROUND: Whole exome and genome sequencing (WES/WGS) is now routinely offered as a clinical test by a growing number of laboratories. As part of the test design process each laboratory must determine the performance characteristics of the platform, test and informatics pipeline. This report documents one such characterization of WES/WGS. METHODS: Whole exome and whole genome sequencing was performed on multiple technical replicates of five reference samples using the Illumina HiSeq 2000/2500. The sequencing data was processed with a GATK-based genome analysis pipeline to evaluate: intra-run, inter-run, inter-mode, inter-machine and inter-library consistency, concordance with orthogonal technologies (microarray, Sanger) and sensitivity and accuracy relative to known variant sets. RESULTS: Concordance to high-density microarrays consistently exceeds 97% (and typically exceeds 99%) and concordance between sequencing replicates also exceeds 97%, with no observable differences between different flow cells, runs, machines or modes. Sensitivity relative to high-density microarray variants exceeds 95%. In a detailed study of a 129 kb region, sensitivity was lower with some validated single-base insertions and deletions “not called”. Different variants are "not called" in each replicate: of all variants identified in WES data from the NA12878 reference sample 74% of indels and 89% of SNVs were called in all seven replicates, in NA12878 WGS 52% of indels and 88% of SNVs were called in all six replicates. Key sources of non-uniformity are variance in depth of coverage, artifactual variants resulting from repetitive regions and larger structural variants. |
format | Online Article Text |
id | pubmed-4022392 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40223922014-05-16 Analytical validation of whole exome and whole genome sequencing for clinical applications Linderman, Michael D Brandt, Tracy Edelmann, Lisa Jabado, Omar Kasai, Yumi Kornreich, Ruth Mahajan, Milind Shah, Hardik Kasarskis, Andrew Schadt, Eric E BMC Med Genomics Research Article BACKGROUND: Whole exome and genome sequencing (WES/WGS) is now routinely offered as a clinical test by a growing number of laboratories. As part of the test design process each laboratory must determine the performance characteristics of the platform, test and informatics pipeline. This report documents one such characterization of WES/WGS. METHODS: Whole exome and whole genome sequencing was performed on multiple technical replicates of five reference samples using the Illumina HiSeq 2000/2500. The sequencing data was processed with a GATK-based genome analysis pipeline to evaluate: intra-run, inter-run, inter-mode, inter-machine and inter-library consistency, concordance with orthogonal technologies (microarray, Sanger) and sensitivity and accuracy relative to known variant sets. RESULTS: Concordance to high-density microarrays consistently exceeds 97% (and typically exceeds 99%) and concordance between sequencing replicates also exceeds 97%, with no observable differences between different flow cells, runs, machines or modes. Sensitivity relative to high-density microarray variants exceeds 95%. In a detailed study of a 129 kb region, sensitivity was lower with some validated single-base insertions and deletions “not called”. Different variants are "not called" in each replicate: of all variants identified in WES data from the NA12878 reference sample 74% of indels and 89% of SNVs were called in all seven replicates, in NA12878 WGS 52% of indels and 88% of SNVs were called in all six replicates. Key sources of non-uniformity are variance in depth of coverage, artifactual variants resulting from repetitive regions and larger structural variants. BioMed Central 2014-04-23 /pmc/articles/PMC4022392/ /pubmed/24758382 http://dx.doi.org/10.1186/1755-8794-7-20 Text en Copyright © 2014 Linderman et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Linderman, Michael D Brandt, Tracy Edelmann, Lisa Jabado, Omar Kasai, Yumi Kornreich, Ruth Mahajan, Milind Shah, Hardik Kasarskis, Andrew Schadt, Eric E Analytical validation of whole exome and whole genome sequencing for clinical applications |
title | Analytical validation of whole exome and whole genome sequencing for clinical applications |
title_full | Analytical validation of whole exome and whole genome sequencing for clinical applications |
title_fullStr | Analytical validation of whole exome and whole genome sequencing for clinical applications |
title_full_unstemmed | Analytical validation of whole exome and whole genome sequencing for clinical applications |
title_short | Analytical validation of whole exome and whole genome sequencing for clinical applications |
title_sort | analytical validation of whole exome and whole genome sequencing for clinical applications |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4022392/ https://www.ncbi.nlm.nih.gov/pubmed/24758382 http://dx.doi.org/10.1186/1755-8794-7-20 |
work_keys_str_mv | AT lindermanmichaeld analyticalvalidationofwholeexomeandwholegenomesequencingforclinicalapplications AT brandttracy analyticalvalidationofwholeexomeandwholegenomesequencingforclinicalapplications AT edelmannlisa analyticalvalidationofwholeexomeandwholegenomesequencingforclinicalapplications AT jabadoomar analyticalvalidationofwholeexomeandwholegenomesequencingforclinicalapplications AT kasaiyumi analyticalvalidationofwholeexomeandwholegenomesequencingforclinicalapplications AT kornreichruth analyticalvalidationofwholeexomeandwholegenomesequencingforclinicalapplications AT mahajanmilind analyticalvalidationofwholeexomeandwholegenomesequencingforclinicalapplications AT shahhardik analyticalvalidationofwholeexomeandwholegenomesequencingforclinicalapplications AT kasarskisandrew analyticalvalidationofwholeexomeandwholegenomesequencingforclinicalapplications AT schadterice analyticalvalidationofwholeexomeandwholegenomesequencingforclinicalapplications |