Cargando…

Combining calls from multiple somatic mutation-callers

BACKGROUND: Accurate somatic mutation-calling is essential for insightful mutation analyses in cancer studies. Several mutation-callers are publicly available and more are likely to appear. Nonetheless, mutation-calling is still challenging and there is unlikely to be one established caller that sys...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Su Yeon, Jacob, Laurent, Speed, Terence P
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4035752/
https://www.ncbi.nlm.nih.gov/pubmed/24885750
http://dx.doi.org/10.1186/1471-2105-15-154
_version_ 1782318098622185472
author Kim, Su Yeon
Jacob, Laurent
Speed, Terence P
author_facet Kim, Su Yeon
Jacob, Laurent
Speed, Terence P
author_sort Kim, Su Yeon
collection PubMed
description BACKGROUND: Accurate somatic mutation-calling is essential for insightful mutation analyses in cancer studies. Several mutation-callers are publicly available and more are likely to appear. Nonetheless, mutation-calling is still challenging and there is unlikely to be one established caller that systematically outperforms all others. Therefore, fully utilizing multiple callers can be a powerful way to construct a list of final calls for one’s research. RESULTS: Using a set of mutations from multiple callers that are impartially validated, we present a statistical approach for building a combined caller, which can be applied to combine calls in a wider dataset generated using a similar protocol. Using the mutation outputs and the validation data from The Cancer Genome Atlas endometrial study (6,746 sites), we demonstrate how to build a statistical model that predicts the probability of each call being a somatic mutation, based on the detection status of multiple callers and a few associated features. CONCLUSION: The approach allows us to build a combined caller across the full range of stringency levels, which outperforms all of the individual callers.
format Online
Article
Text
id pubmed-4035752
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40357522014-06-11 Combining calls from multiple somatic mutation-callers Kim, Su Yeon Jacob, Laurent Speed, Terence P BMC Bioinformatics Research Article BACKGROUND: Accurate somatic mutation-calling is essential for insightful mutation analyses in cancer studies. Several mutation-callers are publicly available and more are likely to appear. Nonetheless, mutation-calling is still challenging and there is unlikely to be one established caller that systematically outperforms all others. Therefore, fully utilizing multiple callers can be a powerful way to construct a list of final calls for one’s research. RESULTS: Using a set of mutations from multiple callers that are impartially validated, we present a statistical approach for building a combined caller, which can be applied to combine calls in a wider dataset generated using a similar protocol. Using the mutation outputs and the validation data from The Cancer Genome Atlas endometrial study (6,746 sites), we demonstrate how to build a statistical model that predicts the probability of each call being a somatic mutation, based on the detection status of multiple callers and a few associated features. CONCLUSION: The approach allows us to build a combined caller across the full range of stringency levels, which outperforms all of the individual callers. BioMed Central 2014-05-21 /pmc/articles/PMC4035752/ /pubmed/24885750 http://dx.doi.org/10.1186/1471-2105-15-154 Text en Copyright © 2014 Kim et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Kim, Su Yeon
Jacob, Laurent
Speed, Terence P
Combining calls from multiple somatic mutation-callers
title Combining calls from multiple somatic mutation-callers
title_full Combining calls from multiple somatic mutation-callers
title_fullStr Combining calls from multiple somatic mutation-callers
title_full_unstemmed Combining calls from multiple somatic mutation-callers
title_short Combining calls from multiple somatic mutation-callers
title_sort combining calls from multiple somatic mutation-callers
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4035752/
https://www.ncbi.nlm.nih.gov/pubmed/24885750
http://dx.doi.org/10.1186/1471-2105-15-154
work_keys_str_mv AT kimsuyeon combiningcallsfrommultiplesomaticmutationcallers
AT jacoblaurent combiningcallsfrommultiplesomaticmutationcallers
AT speedterencep combiningcallsfrommultiplesomaticmutationcallers