Cargando…

Combining tumor genome simulation with crowdsourcing to benchmark somatic single-nucleotide-variant detection

The detection of somatic mutations from cancer genome sequences is key to understanding the genetic basis of disease progression, patient survival and response to therapy. Benchmarking is needed for tool assessment and improvement but is complicated by a lack of gold standards, by extensive resource...

Descripción completa

Detalles Bibliográficos
Autores principales: Ewing, Adam D, Houlahan, Kathleen E, Hu, Yin, Ellrott, Kyle, Caloian, Cristian, Yamaguchi, Takafumi N, Bare, J Christopher, P’ng, Christine, Waggott, Daryl, Sabelnykova, Veronica Y, Kellen, Michael R, Norman, Thea C, Haussler, David, Friend, Stephen H, Stolovitzky, Gustavo, Margolin, Adam A, Stuart, Joshua M, Boutros, Paul C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4856034/
https://www.ncbi.nlm.nih.gov/pubmed/25984700
http://dx.doi.org/10.1038/nmeth.3407
_version_ 1782430449164877824
author Ewing, Adam D
Houlahan, Kathleen E
Hu, Yin
Ellrott, Kyle
Caloian, Cristian
Yamaguchi, Takafumi N
Bare, J Christopher
P’ng, Christine
Waggott, Daryl
Sabelnykova, Veronica Y
Kellen, Michael R
Norman, Thea C
Haussler, David
Friend, Stephen H
Stolovitzky, Gustavo
Margolin, Adam A
Stuart, Joshua M
Boutros, Paul C
author_facet Ewing, Adam D
Houlahan, Kathleen E
Hu, Yin
Ellrott, Kyle
Caloian, Cristian
Yamaguchi, Takafumi N
Bare, J Christopher
P’ng, Christine
Waggott, Daryl
Sabelnykova, Veronica Y
Kellen, Michael R
Norman, Thea C
Haussler, David
Friend, Stephen H
Stolovitzky, Gustavo
Margolin, Adam A
Stuart, Joshua M
Boutros, Paul C
author_sort Ewing, Adam D
collection PubMed
description The detection of somatic mutations from cancer genome sequences is key to understanding the genetic basis of disease progression, patient survival and response to therapy. Benchmarking is needed for tool assessment and improvement but is complicated by a lack of gold standards, by extensive resource requirements and by difficulties in sharing personal genomic information. To resolve these issues, we launched the ICGC-TCGA DREAM Somatic Mutation Calling Challenge, a crowdsourced benchmark of somatic mutation detection algorithms. Here we report the BAMSurgeon tool for simulating cancer genomes and the results of 248 analyses of three in silico tumors created with it. Different algorithms exhibit characteristic error profiles, and, intriguingly, false positives show a trinucleotide profile very similar to one found in human tumors. Although the three simulated tumors differ in sequence contamination (deviation from normal cell sequence) and in subclonality, an ensemble of pipelines outperforms the best individual pipeline in all cases. BAMSurgeon is available at https://github.com/adamewing/bamsurgeon/.
format Online
Article
Text
id pubmed-4856034
institution National Center for Biotechnology Information
language English
publishDate 2015
record_format MEDLINE/PubMed
spelling pubmed-48560342016-05-04 Combining tumor genome simulation with crowdsourcing to benchmark somatic single-nucleotide-variant detection Ewing, Adam D Houlahan, Kathleen E Hu, Yin Ellrott, Kyle Caloian, Cristian Yamaguchi, Takafumi N Bare, J Christopher P’ng, Christine Waggott, Daryl Sabelnykova, Veronica Y Kellen, Michael R Norman, Thea C Haussler, David Friend, Stephen H Stolovitzky, Gustavo Margolin, Adam A Stuart, Joshua M Boutros, Paul C Nat Methods Article The detection of somatic mutations from cancer genome sequences is key to understanding the genetic basis of disease progression, patient survival and response to therapy. Benchmarking is needed for tool assessment and improvement but is complicated by a lack of gold standards, by extensive resource requirements and by difficulties in sharing personal genomic information. To resolve these issues, we launched the ICGC-TCGA DREAM Somatic Mutation Calling Challenge, a crowdsourced benchmark of somatic mutation detection algorithms. Here we report the BAMSurgeon tool for simulating cancer genomes and the results of 248 analyses of three in silico tumors created with it. Different algorithms exhibit characteristic error profiles, and, intriguingly, false positives show a trinucleotide profile very similar to one found in human tumors. Although the three simulated tumors differ in sequence contamination (deviation from normal cell sequence) and in subclonality, an ensemble of pipelines outperforms the best individual pipeline in all cases. BAMSurgeon is available at https://github.com/adamewing/bamsurgeon/. 2015-05-18 2015-07 /pmc/articles/PMC4856034/ /pubmed/25984700 http://dx.doi.org/10.1038/nmeth.3407 Text en Reprints and permissions information is available online at http://www.nature.com/reprints/index.html. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/.
spellingShingle Article
Ewing, Adam D
Houlahan, Kathleen E
Hu, Yin
Ellrott, Kyle
Caloian, Cristian
Yamaguchi, Takafumi N
Bare, J Christopher
P’ng, Christine
Waggott, Daryl
Sabelnykova, Veronica Y
Kellen, Michael R
Norman, Thea C
Haussler, David
Friend, Stephen H
Stolovitzky, Gustavo
Margolin, Adam A
Stuart, Joshua M
Boutros, Paul C
Combining tumor genome simulation with crowdsourcing to benchmark somatic single-nucleotide-variant detection
title Combining tumor genome simulation with crowdsourcing to benchmark somatic single-nucleotide-variant detection
title_full Combining tumor genome simulation with crowdsourcing to benchmark somatic single-nucleotide-variant detection
title_fullStr Combining tumor genome simulation with crowdsourcing to benchmark somatic single-nucleotide-variant detection
title_full_unstemmed Combining tumor genome simulation with crowdsourcing to benchmark somatic single-nucleotide-variant detection
title_short Combining tumor genome simulation with crowdsourcing to benchmark somatic single-nucleotide-variant detection
title_sort combining tumor genome simulation with crowdsourcing to benchmark somatic single-nucleotide-variant detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4856034/
https://www.ncbi.nlm.nih.gov/pubmed/25984700
http://dx.doi.org/10.1038/nmeth.3407
work_keys_str_mv AT ewingadamd combiningtumorgenomesimulationwithcrowdsourcingtobenchmarksomaticsinglenucleotidevariantdetection
AT houlahankathleene combiningtumorgenomesimulationwithcrowdsourcingtobenchmarksomaticsinglenucleotidevariantdetection
AT huyin combiningtumorgenomesimulationwithcrowdsourcingtobenchmarksomaticsinglenucleotidevariantdetection
AT ellrottkyle combiningtumorgenomesimulationwithcrowdsourcingtobenchmarksomaticsinglenucleotidevariantdetection
AT caloiancristian combiningtumorgenomesimulationwithcrowdsourcingtobenchmarksomaticsinglenucleotidevariantdetection
AT yamaguchitakafumin combiningtumorgenomesimulationwithcrowdsourcingtobenchmarksomaticsinglenucleotidevariantdetection
AT barejchristopher combiningtumorgenomesimulationwithcrowdsourcingtobenchmarksomaticsinglenucleotidevariantdetection
AT pngchristine combiningtumorgenomesimulationwithcrowdsourcingtobenchmarksomaticsinglenucleotidevariantdetection
AT waggottdaryl combiningtumorgenomesimulationwithcrowdsourcingtobenchmarksomaticsinglenucleotidevariantdetection
AT sabelnykovaveronicay combiningtumorgenomesimulationwithcrowdsourcingtobenchmarksomaticsinglenucleotidevariantdetection
AT combiningtumorgenomesimulationwithcrowdsourcingtobenchmarksomaticsinglenucleotidevariantdetection
AT kellenmichaelr combiningtumorgenomesimulationwithcrowdsourcingtobenchmarksomaticsinglenucleotidevariantdetection
AT normantheac combiningtumorgenomesimulationwithcrowdsourcingtobenchmarksomaticsinglenucleotidevariantdetection
AT hausslerdavid combiningtumorgenomesimulationwithcrowdsourcingtobenchmarksomaticsinglenucleotidevariantdetection
AT friendstephenh combiningtumorgenomesimulationwithcrowdsourcingtobenchmarksomaticsinglenucleotidevariantdetection
AT stolovitzkygustavo combiningtumorgenomesimulationwithcrowdsourcingtobenchmarksomaticsinglenucleotidevariantdetection
AT margolinadama combiningtumorgenomesimulationwithcrowdsourcingtobenchmarksomaticsinglenucleotidevariantdetection
AT stuartjoshuam combiningtumorgenomesimulationwithcrowdsourcingtobenchmarksomaticsinglenucleotidevariantdetection
AT boutrospaulc combiningtumorgenomesimulationwithcrowdsourcingtobenchmarksomaticsinglenucleotidevariantdetection