Cargando…

HapMuC: somatic mutation calling using heterozygous germ line variants near candidate mutations

Motivation: Identifying somatic changes from tumor and matched normal sequences has become a standard approach in cancer research. More specifically, this requires accurate detection of somatic point mutations with low allele frequencies in impure and heterogeneous cancer samples. Although haplotype...

Descripción completa

Detalles Bibliográficos
Autores principales: Usuyama, Naoto, Shiraishi, Yuichi, Sato, Yusuke, Kume, Haruki, Homma, Yukio, Ogawa, Seishi, Miyano, Satoru, Imoto, Seiya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4816033/
https://www.ncbi.nlm.nih.gov/pubmed/25123903
http://dx.doi.org/10.1093/bioinformatics/btu537
_version_ 1782424655969124352
author Usuyama, Naoto
Shiraishi, Yuichi
Sato, Yusuke
Kume, Haruki
Homma, Yukio
Ogawa, Seishi
Miyano, Satoru
Imoto, Seiya
author_facet Usuyama, Naoto
Shiraishi, Yuichi
Sato, Yusuke
Kume, Haruki
Homma, Yukio
Ogawa, Seishi
Miyano, Satoru
Imoto, Seiya
author_sort Usuyama, Naoto
collection PubMed
description Motivation: Identifying somatic changes from tumor and matched normal sequences has become a standard approach in cancer research. More specifically, this requires accurate detection of somatic point mutations with low allele frequencies in impure and heterogeneous cancer samples. Although haplotype phasing information derived by using heterozygous germ line variants near candidate mutations would improve accuracy, no somatic mutation caller that uses such information is currently available. Results: We propose a Bayesian hierarchical method, termed HapMuC, in which power is increased by using available information on heterozygous germ line variants located near candidate mutations. We first constructed two generative models (the mutation model and the error model). In the generative models, we prepared candidate haplotypes, considering a heterozygous germ line variant if available, and the observed reads were realigned to the haplotypes. We then inferred the haplotype frequencies and computed the marginal likelihoods using a variational Bayesian algorithm. Finally, we derived a Bayes factor for evaluating the possibility of the existence of somatic mutations. We also demonstrated that our algorithm has superior specificity and sensitivity compared with existing methods, as determined based on a simulation, the TCGA Mutation Calling Benchmark 4 datasets and data from the COLO-829 cell line. Availability and implementation: The HapMuC source code is available from http://github.com/usuyama/hapmuc. Contact: imoto@ims.u-tokyo.ac.jp Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4816033
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-48160332016-04-04 HapMuC: somatic mutation calling using heterozygous germ line variants near candidate mutations Usuyama, Naoto Shiraishi, Yuichi Sato, Yusuke Kume, Haruki Homma, Yukio Ogawa, Seishi Miyano, Satoru Imoto, Seiya Bioinformatics Original Papers Motivation: Identifying somatic changes from tumor and matched normal sequences has become a standard approach in cancer research. More specifically, this requires accurate detection of somatic point mutations with low allele frequencies in impure and heterogeneous cancer samples. Although haplotype phasing information derived by using heterozygous germ line variants near candidate mutations would improve accuracy, no somatic mutation caller that uses such information is currently available. Results: We propose a Bayesian hierarchical method, termed HapMuC, in which power is increased by using available information on heterozygous germ line variants located near candidate mutations. We first constructed two generative models (the mutation model and the error model). In the generative models, we prepared candidate haplotypes, considering a heterozygous germ line variant if available, and the observed reads were realigned to the haplotypes. We then inferred the haplotype frequencies and computed the marginal likelihoods using a variational Bayesian algorithm. Finally, we derived a Bayes factor for evaluating the possibility of the existence of somatic mutations. We also demonstrated that our algorithm has superior specificity and sensitivity compared with existing methods, as determined based on a simulation, the TCGA Mutation Calling Benchmark 4 datasets and data from the COLO-829 cell line. Availability and implementation: The HapMuC source code is available from http://github.com/usuyama/hapmuc. Contact: imoto@ims.u-tokyo.ac.jp Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2014-12-01 2014-08-14 /pmc/articles/PMC4816033/ /pubmed/25123903 http://dx.doi.org/10.1093/bioinformatics/btu537 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Usuyama, Naoto
Shiraishi, Yuichi
Sato, Yusuke
Kume, Haruki
Homma, Yukio
Ogawa, Seishi
Miyano, Satoru
Imoto, Seiya
HapMuC: somatic mutation calling using heterozygous germ line variants near candidate mutations
title HapMuC: somatic mutation calling using heterozygous germ line variants near candidate mutations
title_full HapMuC: somatic mutation calling using heterozygous germ line variants near candidate mutations
title_fullStr HapMuC: somatic mutation calling using heterozygous germ line variants near candidate mutations
title_full_unstemmed HapMuC: somatic mutation calling using heterozygous germ line variants near candidate mutations
title_short HapMuC: somatic mutation calling using heterozygous germ line variants near candidate mutations
title_sort hapmuc: somatic mutation calling using heterozygous germ line variants near candidate mutations
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4816033/
https://www.ncbi.nlm.nih.gov/pubmed/25123903
http://dx.doi.org/10.1093/bioinformatics/btu537
work_keys_str_mv AT usuyamanaoto hapmucsomaticmutationcallingusingheterozygousgermlinevariantsnearcandidatemutations
AT shiraishiyuichi hapmucsomaticmutationcallingusingheterozygousgermlinevariantsnearcandidatemutations
AT satoyusuke hapmucsomaticmutationcallingusingheterozygousgermlinevariantsnearcandidatemutations
AT kumeharuki hapmucsomaticmutationcallingusingheterozygousgermlinevariantsnearcandidatemutations
AT hommayukio hapmucsomaticmutationcallingusingheterozygousgermlinevariantsnearcandidatemutations
AT ogawaseishi hapmucsomaticmutationcallingusingheterozygousgermlinevariantsnearcandidatemutations
AT miyanosatoru hapmucsomaticmutationcallingusingheterozygousgermlinevariantsnearcandidatemutations
AT imotoseiya hapmucsomaticmutationcallingusingheterozygousgermlinevariantsnearcandidatemutations