Cargando…

Discriminating somatic and germline mutations in tumor DNA samples without matching normals

Tumor analyses commonly employ a correction with a matched normal (MN), a sample from healthy tissue of the same individual, in order to distinguish germline mutations from somatic mutations. Since the majority of variants found in an individual are thought to be common within the population, we con...

Descripción completa

Detalles Bibliográficos
Autores principales: Hiltemann, Saskia, Jenster, Guido, Trapman, Jan, van der Spek, Peter, Stubbs, Andrew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4561496/
https://www.ncbi.nlm.nih.gov/pubmed/26209359
http://dx.doi.org/10.1101/gr.183053.114
_version_ 1782389047234134016
author Hiltemann, Saskia
Jenster, Guido
Trapman, Jan
van der Spek, Peter
Stubbs, Andrew
author_facet Hiltemann, Saskia
Jenster, Guido
Trapman, Jan
van der Spek, Peter
Stubbs, Andrew
author_sort Hiltemann, Saskia
collection PubMed
description Tumor analyses commonly employ a correction with a matched normal (MN), a sample from healthy tissue of the same individual, in order to distinguish germline mutations from somatic mutations. Since the majority of variants found in an individual are thought to be common within the population, we constructed a set of 931 samples from healthy, unrelated individuals, originating from two different sequencing platforms, to serve as a virtual normal (VN) in the absence of such an associated normal sample. Our approach removed (1) >96% of the germline variants also removed by the MN sample and (2) a large number (2%–8%) of additional variants not corrected for by the associated normal. The combination of the VN with the MN improved the correction for polymorphisms significantly, with up to ∼30% compared with MN and ∼15% compared with VN only. We determined the number of unrelated genomes needed in order to correct at least as efficiently as the MN is about 200 for structural variations (SVs) and about 400 for single-nucleotide variants (SNVs) and indels. In addition, we propose that the removal of common variants with purely position-based methods is inaccurate and incurs additional false-positive somatic variants, and more sophisticated algorithms, which are capable of leveraging information about the area surrounding variants, are needed for optimal accuracy. Our VN correction method can be used to analyze any list of variants, regardless of sequencing platform of origin. This VN methodology is available for use on our public Galaxy server.
format Online
Article
Text
id pubmed-4561496
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-45614962016-03-01 Discriminating somatic and germline mutations in tumor DNA samples without matching normals Hiltemann, Saskia Jenster, Guido Trapman, Jan van der Spek, Peter Stubbs, Andrew Genome Res Method Tumor analyses commonly employ a correction with a matched normal (MN), a sample from healthy tissue of the same individual, in order to distinguish germline mutations from somatic mutations. Since the majority of variants found in an individual are thought to be common within the population, we constructed a set of 931 samples from healthy, unrelated individuals, originating from two different sequencing platforms, to serve as a virtual normal (VN) in the absence of such an associated normal sample. Our approach removed (1) >96% of the germline variants also removed by the MN sample and (2) a large number (2%–8%) of additional variants not corrected for by the associated normal. The combination of the VN with the MN improved the correction for polymorphisms significantly, with up to ∼30% compared with MN and ∼15% compared with VN only. We determined the number of unrelated genomes needed in order to correct at least as efficiently as the MN is about 200 for structural variations (SVs) and about 400 for single-nucleotide variants (SNVs) and indels. In addition, we propose that the removal of common variants with purely position-based methods is inaccurate and incurs additional false-positive somatic variants, and more sophisticated algorithms, which are capable of leveraging information about the area surrounding variants, are needed for optimal accuracy. Our VN correction method can be used to analyze any list of variants, regardless of sequencing platform of origin. This VN methodology is available for use on our public Galaxy server. Cold Spring Harbor Laboratory Press 2015-09 /pmc/articles/PMC4561496/ /pubmed/26209359 http://dx.doi.org/10.1101/gr.183053.114 Text en © 2015 Hiltemann et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Method
Hiltemann, Saskia
Jenster, Guido
Trapman, Jan
van der Spek, Peter
Stubbs, Andrew
Discriminating somatic and germline mutations in tumor DNA samples without matching normals
title Discriminating somatic and germline mutations in tumor DNA samples without matching normals
title_full Discriminating somatic and germline mutations in tumor DNA samples without matching normals
title_fullStr Discriminating somatic and germline mutations in tumor DNA samples without matching normals
title_full_unstemmed Discriminating somatic and germline mutations in tumor DNA samples without matching normals
title_short Discriminating somatic and germline mutations in tumor DNA samples without matching normals
title_sort discriminating somatic and germline mutations in tumor dna samples without matching normals
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4561496/
https://www.ncbi.nlm.nih.gov/pubmed/26209359
http://dx.doi.org/10.1101/gr.183053.114
work_keys_str_mv AT hiltemannsaskia discriminatingsomaticandgermlinemutationsintumordnasampleswithoutmatchingnormals
AT jensterguido discriminatingsomaticandgermlinemutationsintumordnasampleswithoutmatchingnormals
AT trapmanjan discriminatingsomaticandgermlinemutationsintumordnasampleswithoutmatchingnormals
AT vanderspekpeter discriminatingsomaticandgermlinemutationsintumordnasampleswithoutmatchingnormals
AT stubbsandrew discriminatingsomaticandgermlinemutationsintumordnasampleswithoutmatchingnormals