Cargando…

A Bayesian model integration for mutation calling through data partitioning

MOTIVATION: Detection of somatic mutations from tumor and matched normal sequencing data has become among the most important analysis methods in cancer research. Some existing mutation callers have focused on additional information, e.g. heterozygous single-nucleotide polymorphisms (SNPs) nearby mut...

Descripción completa

Detalles Bibliográficos
Autores principales: Moriyama, Takuya, Imoto, Seiya, Hayashi, Shuto, Shiraishi, Yuichi, Miyano, Satoru, Yamaguchi, Rui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6821361/
https://www.ncbi.nlm.nih.gov/pubmed/30924874
http://dx.doi.org/10.1093/bioinformatics/btz233
_version_ 1783464128588808192
author Moriyama, Takuya
Imoto, Seiya
Hayashi, Shuto
Shiraishi, Yuichi
Miyano, Satoru
Yamaguchi, Rui
author_facet Moriyama, Takuya
Imoto, Seiya
Hayashi, Shuto
Shiraishi, Yuichi
Miyano, Satoru
Yamaguchi, Rui
author_sort Moriyama, Takuya
collection PubMed
description MOTIVATION: Detection of somatic mutations from tumor and matched normal sequencing data has become among the most important analysis methods in cancer research. Some existing mutation callers have focused on additional information, e.g. heterozygous single-nucleotide polymorphisms (SNPs) nearby mutation candidates or overlapping paired-end read information. However, existing methods cannot take multiple information sources into account simultaneously. Existing Bayesian hierarchical model-based methods construct two generative models, the tumor model and error model, and limited information sources have been modeled. RESULTS: We proposed a Bayesian model integration framework named as partitioning-based model integration. In this framework, through introducing partitions for paired-end reads based on given information sources, we integrate existing generative models and utilize multiple information sources. Based on that, we constructed a novel Bayesian hierarchical model-based method named as OHVarfinDer. In both the tumor model and error model, we introduced partitions for a set of paired-end reads that cover a mutation candidate position, and applied a different generative model for each category of paired-end reads. We demonstrated that our method can utilize both heterozygous SNP information and overlapping paired-end read information effectively in simulation datasets and real datasets. AVAILABILITY AND IMPLEMENTATION: https://github.com/takumorizo/OHVarfinDer. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6821361
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-68213612019-11-04 A Bayesian model integration for mutation calling through data partitioning Moriyama, Takuya Imoto, Seiya Hayashi, Shuto Shiraishi, Yuichi Miyano, Satoru Yamaguchi, Rui Bioinformatics Original Papers MOTIVATION: Detection of somatic mutations from tumor and matched normal sequencing data has become among the most important analysis methods in cancer research. Some existing mutation callers have focused on additional information, e.g. heterozygous single-nucleotide polymorphisms (SNPs) nearby mutation candidates or overlapping paired-end read information. However, existing methods cannot take multiple information sources into account simultaneously. Existing Bayesian hierarchical model-based methods construct two generative models, the tumor model and error model, and limited information sources have been modeled. RESULTS: We proposed a Bayesian model integration framework named as partitioning-based model integration. In this framework, through introducing partitions for paired-end reads based on given information sources, we integrate existing generative models and utilize multiple information sources. Based on that, we constructed a novel Bayesian hierarchical model-based method named as OHVarfinDer. In both the tumor model and error model, we introduced partitions for a set of paired-end reads that cover a mutation candidate position, and applied a different generative model for each category of paired-end reads. We demonstrated that our method can utilize both heterozygous SNP information and overlapping paired-end read information effectively in simulation datasets and real datasets. AVAILABILITY AND IMPLEMENTATION: https://github.com/takumorizo/OHVarfinDer. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-11-01 2019-03-29 /pmc/articles/PMC6821361/ /pubmed/30924874 http://dx.doi.org/10.1093/bioinformatics/btz233 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Moriyama, Takuya
Imoto, Seiya
Hayashi, Shuto
Shiraishi, Yuichi
Miyano, Satoru
Yamaguchi, Rui
A Bayesian model integration for mutation calling through data partitioning
title A Bayesian model integration for mutation calling through data partitioning
title_full A Bayesian model integration for mutation calling through data partitioning
title_fullStr A Bayesian model integration for mutation calling through data partitioning
title_full_unstemmed A Bayesian model integration for mutation calling through data partitioning
title_short A Bayesian model integration for mutation calling through data partitioning
title_sort bayesian model integration for mutation calling through data partitioning
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6821361/
https://www.ncbi.nlm.nih.gov/pubmed/30924874
http://dx.doi.org/10.1093/bioinformatics/btz233
work_keys_str_mv AT moriyamatakuya abayesianmodelintegrationformutationcallingthroughdatapartitioning
AT imotoseiya abayesianmodelintegrationformutationcallingthroughdatapartitioning
AT hayashishuto abayesianmodelintegrationformutationcallingthroughdatapartitioning
AT shiraishiyuichi abayesianmodelintegrationformutationcallingthroughdatapartitioning
AT miyanosatoru abayesianmodelintegrationformutationcallingthroughdatapartitioning
AT yamaguchirui abayesianmodelintegrationformutationcallingthroughdatapartitioning
AT moriyamatakuya bayesianmodelintegrationformutationcallingthroughdatapartitioning
AT imotoseiya bayesianmodelintegrationformutationcallingthroughdatapartitioning
AT hayashishuto bayesianmodelintegrationformutationcallingthroughdatapartitioning
AT shiraishiyuichi bayesianmodelintegrationformutationcallingthroughdatapartitioning
AT miyanosatoru bayesianmodelintegrationformutationcallingthroughdatapartitioning
AT yamaguchirui bayesianmodelintegrationformutationcallingthroughdatapartitioning