Cargando…

A full Bayesian partition model for identifying hypo- and hyper-methylated loci from single nucleotide resolution sequencing data

BACKGROUD: DNA methylation is an epigenetic modification that plays important roles on gene regulation. Study of whole-genome bisulfite sequencing and reduced representation bisulfite sequencing brings the availability of DNA methylation at single CpG resolution. The main interest of study on DNA me...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Henan, He, Chong, Kushwaha, Garima, Xu, Dong, Qiu, Jing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4895387/
https://www.ncbi.nlm.nih.gov/pubmed/26818685
http://dx.doi.org/10.1186/s12859-015-0850-3
_version_ 1782435836971712512
author Wang, Henan
He, Chong
Kushwaha, Garima
Xu, Dong
Qiu, Jing
author_facet Wang, Henan
He, Chong
Kushwaha, Garima
Xu, Dong
Qiu, Jing
author_sort Wang, Henan
collection PubMed
description BACKGROUD: DNA methylation is an epigenetic modification that plays important roles on gene regulation. Study of whole-genome bisulfite sequencing and reduced representation bisulfite sequencing brings the availability of DNA methylation at single CpG resolution. The main interest of study on DNA methylation data is to test the methylation difference under two conditions of biological samples. However, the high cost and complexity of this sequencing experiment limits the number of biological replicates, which brings challenges to the development of statistical methods. RESULTS: Bayesian modeling is well known to be able to borrow strength across the genome, and hence is a powerful tool for high-dimensional- low-sample- size data. In order to provide accurate identification of methylation loci, especially for low coverage data, we propose a full Bayesian partition model to detect differentially methylated loci under two conditions of scientific study. Since hypo-methylation and hyper-methylation have distinct biological implication, it is desirable to differentiate these two types of differential methylation. The advantage of our Bayesian model is that it can produce one-step output of each locus being either equal-, hypo- or hyper-methylated locus without further post-hoc analysis. An R package named as MethyBayes implementing the proposed full Bayesian partition model will be submitted to the bioconductor website upon publication of the manuscript. CONCLUSIONS: The proposed full Bayesian partition model outperforms existing methods in terms of power while maintaining a low false discovery rate based on simulation studies and real data analysis including bioinformatics analysis. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0850-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4895387
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-48953872016-06-10 A full Bayesian partition model for identifying hypo- and hyper-methylated loci from single nucleotide resolution sequencing data Wang, Henan He, Chong Kushwaha, Garima Xu, Dong Qiu, Jing BMC Bioinformatics Proceedings BACKGROUD: DNA methylation is an epigenetic modification that plays important roles on gene regulation. Study of whole-genome bisulfite sequencing and reduced representation bisulfite sequencing brings the availability of DNA methylation at single CpG resolution. The main interest of study on DNA methylation data is to test the methylation difference under two conditions of biological samples. However, the high cost and complexity of this sequencing experiment limits the number of biological replicates, which brings challenges to the development of statistical methods. RESULTS: Bayesian modeling is well known to be able to borrow strength across the genome, and hence is a powerful tool for high-dimensional- low-sample- size data. In order to provide accurate identification of methylation loci, especially for low coverage data, we propose a full Bayesian partition model to detect differentially methylated loci under two conditions of scientific study. Since hypo-methylation and hyper-methylation have distinct biological implication, it is desirable to differentiate these two types of differential methylation. The advantage of our Bayesian model is that it can produce one-step output of each locus being either equal-, hypo- or hyper-methylated locus without further post-hoc analysis. An R package named as MethyBayes implementing the proposed full Bayesian partition model will be submitted to the bioconductor website upon publication of the manuscript. CONCLUSIONS: The proposed full Bayesian partition model outperforms existing methods in terms of power while maintaining a low false discovery rate based on simulation studies and real data analysis including bioinformatics analysis. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0850-3) contains supplementary material, which is available to authorized users. BioMed Central 2016-01-11 /pmc/articles/PMC4895387/ /pubmed/26818685 http://dx.doi.org/10.1186/s12859-015-0850-3 Text en © Wang et al. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Wang, Henan
He, Chong
Kushwaha, Garima
Xu, Dong
Qiu, Jing
A full Bayesian partition model for identifying hypo- and hyper-methylated loci from single nucleotide resolution sequencing data
title A full Bayesian partition model for identifying hypo- and hyper-methylated loci from single nucleotide resolution sequencing data
title_full A full Bayesian partition model for identifying hypo- and hyper-methylated loci from single nucleotide resolution sequencing data
title_fullStr A full Bayesian partition model for identifying hypo- and hyper-methylated loci from single nucleotide resolution sequencing data
title_full_unstemmed A full Bayesian partition model for identifying hypo- and hyper-methylated loci from single nucleotide resolution sequencing data
title_short A full Bayesian partition model for identifying hypo- and hyper-methylated loci from single nucleotide resolution sequencing data
title_sort full bayesian partition model for identifying hypo- and hyper-methylated loci from single nucleotide resolution sequencing data
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4895387/
https://www.ncbi.nlm.nih.gov/pubmed/26818685
http://dx.doi.org/10.1186/s12859-015-0850-3
work_keys_str_mv AT wanghenan afullbayesianpartitionmodelforidentifyinghypoandhypermethylatedlocifromsinglenucleotideresolutionsequencingdata
AT hechong afullbayesianpartitionmodelforidentifyinghypoandhypermethylatedlocifromsinglenucleotideresolutionsequencingdata
AT kushwahagarima afullbayesianpartitionmodelforidentifyinghypoandhypermethylatedlocifromsinglenucleotideresolutionsequencingdata
AT xudong afullbayesianpartitionmodelforidentifyinghypoandhypermethylatedlocifromsinglenucleotideresolutionsequencingdata
AT qiujing afullbayesianpartitionmodelforidentifyinghypoandhypermethylatedlocifromsinglenucleotideresolutionsequencingdata
AT wanghenan fullbayesianpartitionmodelforidentifyinghypoandhypermethylatedlocifromsinglenucleotideresolutionsequencingdata
AT hechong fullbayesianpartitionmodelforidentifyinghypoandhypermethylatedlocifromsinglenucleotideresolutionsequencingdata
AT kushwahagarima fullbayesianpartitionmodelforidentifyinghypoandhypermethylatedlocifromsinglenucleotideresolutionsequencingdata
AT xudong fullbayesianpartitionmodelforidentifyinghypoandhypermethylatedlocifromsinglenucleotideresolutionsequencingdata
AT qiujing fullbayesianpartitionmodelforidentifyinghypoandhypermethylatedlocifromsinglenucleotideresolutionsequencingdata