Cargando…
Combining calls from multiple somatic mutation-callers
BACKGROUND: Accurate somatic mutation-calling is essential for insightful mutation analyses in cancer studies. Several mutation-callers are publicly available and more are likely to appear. Nonetheless, mutation-calling is still challenging and there is unlikely to be one established caller that sys...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4035752/ https://www.ncbi.nlm.nih.gov/pubmed/24885750 http://dx.doi.org/10.1186/1471-2105-15-154 |
_version_ | 1782318098622185472 |
---|---|
author | Kim, Su Yeon Jacob, Laurent Speed, Terence P |
author_facet | Kim, Su Yeon Jacob, Laurent Speed, Terence P |
author_sort | Kim, Su Yeon |
collection | PubMed |
description | BACKGROUND: Accurate somatic mutation-calling is essential for insightful mutation analyses in cancer studies. Several mutation-callers are publicly available and more are likely to appear. Nonetheless, mutation-calling is still challenging and there is unlikely to be one established caller that systematically outperforms all others. Therefore, fully utilizing multiple callers can be a powerful way to construct a list of final calls for one’s research. RESULTS: Using a set of mutations from multiple callers that are impartially validated, we present a statistical approach for building a combined caller, which can be applied to combine calls in a wider dataset generated using a similar protocol. Using the mutation outputs and the validation data from The Cancer Genome Atlas endometrial study (6,746 sites), we demonstrate how to build a statistical model that predicts the probability of each call being a somatic mutation, based on the detection status of multiple callers and a few associated features. CONCLUSION: The approach allows us to build a combined caller across the full range of stringency levels, which outperforms all of the individual callers. |
format | Online Article Text |
id | pubmed-4035752 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40357522014-06-11 Combining calls from multiple somatic mutation-callers Kim, Su Yeon Jacob, Laurent Speed, Terence P BMC Bioinformatics Research Article BACKGROUND: Accurate somatic mutation-calling is essential for insightful mutation analyses in cancer studies. Several mutation-callers are publicly available and more are likely to appear. Nonetheless, mutation-calling is still challenging and there is unlikely to be one established caller that systematically outperforms all others. Therefore, fully utilizing multiple callers can be a powerful way to construct a list of final calls for one’s research. RESULTS: Using a set of mutations from multiple callers that are impartially validated, we present a statistical approach for building a combined caller, which can be applied to combine calls in a wider dataset generated using a similar protocol. Using the mutation outputs and the validation data from The Cancer Genome Atlas endometrial study (6,746 sites), we demonstrate how to build a statistical model that predicts the probability of each call being a somatic mutation, based on the detection status of multiple callers and a few associated features. CONCLUSION: The approach allows us to build a combined caller across the full range of stringency levels, which outperforms all of the individual callers. BioMed Central 2014-05-21 /pmc/articles/PMC4035752/ /pubmed/24885750 http://dx.doi.org/10.1186/1471-2105-15-154 Text en Copyright © 2014 Kim et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Kim, Su Yeon Jacob, Laurent Speed, Terence P Combining calls from multiple somatic mutation-callers |
title | Combining calls from multiple somatic mutation-callers |
title_full | Combining calls from multiple somatic mutation-callers |
title_fullStr | Combining calls from multiple somatic mutation-callers |
title_full_unstemmed | Combining calls from multiple somatic mutation-callers |
title_short | Combining calls from multiple somatic mutation-callers |
title_sort | combining calls from multiple somatic mutation-callers |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4035752/ https://www.ncbi.nlm.nih.gov/pubmed/24885750 http://dx.doi.org/10.1186/1471-2105-15-154 |
work_keys_str_mv | AT kimsuyeon combiningcallsfrommultiplesomaticmutationcallers AT jacoblaurent combiningcallsfrommultiplesomaticmutationcallers AT speedterencep combiningcallsfrommultiplesomaticmutationcallers |