Cargando…

Reproducibility of SNV-calling in multiple sequencing runs from single tumors

We examined 55 technical sequencing replicates of Glioblastoma multiforme (GBM) tumors from The Cancer Genome Atlas (TCGA) to ascertain the degree of repeatability in calling single-nucleotide variants (SNVs). We used the same mutation-calling pipeline on all pairs of samples, and we measured the ex...

Descripción completa

Detalles Bibliográficos
Autores principales: Derryberry, Dakota Z., Cowperthwaite, Matthew C., Wilke, Claus O.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4741064/
https://www.ncbi.nlm.nih.gov/pubmed/26855855
http://dx.doi.org/10.7717/peerj.1508
_version_ 1782413940095975424
author Derryberry, Dakota Z.
Cowperthwaite, Matthew C.
Wilke, Claus O.
author_facet Derryberry, Dakota Z.
Cowperthwaite, Matthew C.
Wilke, Claus O.
author_sort Derryberry, Dakota Z.
collection PubMed
description We examined 55 technical sequencing replicates of Glioblastoma multiforme (GBM) tumors from The Cancer Genome Atlas (TCGA) to ascertain the degree of repeatability in calling single-nucleotide variants (SNVs). We used the same mutation-calling pipeline on all pairs of samples, and we measured the extent of the overlap between two replicates; that is, how many specific point mutations were found in both replicates. We further tested whether additional filtering increased or decreased the size of the overlap. We found that about half of the putative mutations identified in one sequencing run of a given sample were also identified in the second, and that this percentage remained steady throughout orders of magnitude of variation in the total number of mutations identified (from 23 to 10,966). We further found that using filtering after SNV-calling removed the overlap completely. We concluded that there is variation in the frequency of mutations in GBMs, and that while some filtering approaches preferentially removed putative mutations found in only one replicate, others removed a large fraction of putative mutations found in both.
format Online
Article
Text
id pubmed-4741064
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-47410642016-02-05 Reproducibility of SNV-calling in multiple sequencing runs from single tumors Derryberry, Dakota Z. Cowperthwaite, Matthew C. Wilke, Claus O. PeerJ Bioinformatics We examined 55 technical sequencing replicates of Glioblastoma multiforme (GBM) tumors from The Cancer Genome Atlas (TCGA) to ascertain the degree of repeatability in calling single-nucleotide variants (SNVs). We used the same mutation-calling pipeline on all pairs of samples, and we measured the extent of the overlap between two replicates; that is, how many specific point mutations were found in both replicates. We further tested whether additional filtering increased or decreased the size of the overlap. We found that about half of the putative mutations identified in one sequencing run of a given sample were also identified in the second, and that this percentage remained steady throughout orders of magnitude of variation in the total number of mutations identified (from 23 to 10,966). We further found that using filtering after SNV-calling removed the overlap completely. We concluded that there is variation in the frequency of mutations in GBMs, and that while some filtering approaches preferentially removed putative mutations found in only one replicate, others removed a large fraction of putative mutations found in both. PeerJ Inc. 2016-01-04 /pmc/articles/PMC4741064/ /pubmed/26855855 http://dx.doi.org/10.7717/peerj.1508 Text en ©2016 Derryberry et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Derryberry, Dakota Z.
Cowperthwaite, Matthew C.
Wilke, Claus O.
Reproducibility of SNV-calling in multiple sequencing runs from single tumors
title Reproducibility of SNV-calling in multiple sequencing runs from single tumors
title_full Reproducibility of SNV-calling in multiple sequencing runs from single tumors
title_fullStr Reproducibility of SNV-calling in multiple sequencing runs from single tumors
title_full_unstemmed Reproducibility of SNV-calling in multiple sequencing runs from single tumors
title_short Reproducibility of SNV-calling in multiple sequencing runs from single tumors
title_sort reproducibility of snv-calling in multiple sequencing runs from single tumors
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4741064/
https://www.ncbi.nlm.nih.gov/pubmed/26855855
http://dx.doi.org/10.7717/peerj.1508
work_keys_str_mv AT derryberrydakotaz reproducibilityofsnvcallinginmultiplesequencingrunsfromsingletumors
AT cowperthwaitematthewc reproducibilityofsnvcallinginmultiplesequencingrunsfromsingletumors
AT wilkeclauso reproducibilityofsnvcallinginmultiplesequencingrunsfromsingletumors