Cargando…
A Bayesian model integration for mutation calling through data partitioning
MOTIVATION: Detection of somatic mutations from tumor and matched normal sequencing data has become among the most important analysis methods in cancer research. Some existing mutation callers have focused on additional information, e.g. heterozygous single-nucleotide polymorphisms (SNPs) nearby mut...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6821361/ https://www.ncbi.nlm.nih.gov/pubmed/30924874 http://dx.doi.org/10.1093/bioinformatics/btz233 |
_version_ | 1783464128588808192 |
---|---|
author | Moriyama, Takuya Imoto, Seiya Hayashi, Shuto Shiraishi, Yuichi Miyano, Satoru Yamaguchi, Rui |
author_facet | Moriyama, Takuya Imoto, Seiya Hayashi, Shuto Shiraishi, Yuichi Miyano, Satoru Yamaguchi, Rui |
author_sort | Moriyama, Takuya |
collection | PubMed |
description | MOTIVATION: Detection of somatic mutations from tumor and matched normal sequencing data has become among the most important analysis methods in cancer research. Some existing mutation callers have focused on additional information, e.g. heterozygous single-nucleotide polymorphisms (SNPs) nearby mutation candidates or overlapping paired-end read information. However, existing methods cannot take multiple information sources into account simultaneously. Existing Bayesian hierarchical model-based methods construct two generative models, the tumor model and error model, and limited information sources have been modeled. RESULTS: We proposed a Bayesian model integration framework named as partitioning-based model integration. In this framework, through introducing partitions for paired-end reads based on given information sources, we integrate existing generative models and utilize multiple information sources. Based on that, we constructed a novel Bayesian hierarchical model-based method named as OHVarfinDer. In both the tumor model and error model, we introduced partitions for a set of paired-end reads that cover a mutation candidate position, and applied a different generative model for each category of paired-end reads. We demonstrated that our method can utilize both heterozygous SNP information and overlapping paired-end read information effectively in simulation datasets and real datasets. AVAILABILITY AND IMPLEMENTATION: https://github.com/takumorizo/OHVarfinDer. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-6821361 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-68213612019-11-04 A Bayesian model integration for mutation calling through data partitioning Moriyama, Takuya Imoto, Seiya Hayashi, Shuto Shiraishi, Yuichi Miyano, Satoru Yamaguchi, Rui Bioinformatics Original Papers MOTIVATION: Detection of somatic mutations from tumor and matched normal sequencing data has become among the most important analysis methods in cancer research. Some existing mutation callers have focused on additional information, e.g. heterozygous single-nucleotide polymorphisms (SNPs) nearby mutation candidates or overlapping paired-end read information. However, existing methods cannot take multiple information sources into account simultaneously. Existing Bayesian hierarchical model-based methods construct two generative models, the tumor model and error model, and limited information sources have been modeled. RESULTS: We proposed a Bayesian model integration framework named as partitioning-based model integration. In this framework, through introducing partitions for paired-end reads based on given information sources, we integrate existing generative models and utilize multiple information sources. Based on that, we constructed a novel Bayesian hierarchical model-based method named as OHVarfinDer. In both the tumor model and error model, we introduced partitions for a set of paired-end reads that cover a mutation candidate position, and applied a different generative model for each category of paired-end reads. We demonstrated that our method can utilize both heterozygous SNP information and overlapping paired-end read information effectively in simulation datasets and real datasets. AVAILABILITY AND IMPLEMENTATION: https://github.com/takumorizo/OHVarfinDer. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-11-01 2019-03-29 /pmc/articles/PMC6821361/ /pubmed/30924874 http://dx.doi.org/10.1093/bioinformatics/btz233 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Moriyama, Takuya Imoto, Seiya Hayashi, Shuto Shiraishi, Yuichi Miyano, Satoru Yamaguchi, Rui A Bayesian model integration for mutation calling through data partitioning |
title | A Bayesian model integration for mutation calling through data partitioning |
title_full | A Bayesian model integration for mutation calling through data partitioning |
title_fullStr | A Bayesian model integration for mutation calling through data partitioning |
title_full_unstemmed | A Bayesian model integration for mutation calling through data partitioning |
title_short | A Bayesian model integration for mutation calling through data partitioning |
title_sort | bayesian model integration for mutation calling through data partitioning |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6821361/ https://www.ncbi.nlm.nih.gov/pubmed/30924874 http://dx.doi.org/10.1093/bioinformatics/btz233 |
work_keys_str_mv | AT moriyamatakuya abayesianmodelintegrationformutationcallingthroughdatapartitioning AT imotoseiya abayesianmodelintegrationformutationcallingthroughdatapartitioning AT hayashishuto abayesianmodelintegrationformutationcallingthroughdatapartitioning AT shiraishiyuichi abayesianmodelintegrationformutationcallingthroughdatapartitioning AT miyanosatoru abayesianmodelintegrationformutationcallingthroughdatapartitioning AT yamaguchirui abayesianmodelintegrationformutationcallingthroughdatapartitioning AT moriyamatakuya bayesianmodelintegrationformutationcallingthroughdatapartitioning AT imotoseiya bayesianmodelintegrationformutationcallingthroughdatapartitioning AT hayashishuto bayesianmodelintegrationformutationcallingthroughdatapartitioning AT shiraishiyuichi bayesianmodelintegrationformutationcallingthroughdatapartitioning AT miyanosatoru bayesianmodelintegrationformutationcallingthroughdatapartitioning AT yamaguchirui bayesianmodelintegrationformutationcallingthroughdatapartitioning |