Cargando…

A statistical model for the analysis of beta values in DNA methylation studies

BACKGROUND: The analysis of DNA methylation is a key component in the development of personalized treatment approaches. A common way to measure DNA methylation is the calculation of beta values, which are bounded variables of the form M/(M+U) that are generated by Illumina’s 450k BeadChip array. The...

Descripción completa

Detalles Bibliográficos
Autores principales: Weinhold, Leonie, Wahl, Simone, Pechlivanis, Sonali, Hoffmann, Per, Schmid, Matthias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5120494/
https://www.ncbi.nlm.nih.gov/pubmed/27875981
http://dx.doi.org/10.1186/s12859-016-1347-4
_version_ 1782469251895918592
author Weinhold, Leonie
Wahl, Simone
Pechlivanis, Sonali
Hoffmann, Per
Schmid, Matthias
author_facet Weinhold, Leonie
Wahl, Simone
Pechlivanis, Sonali
Hoffmann, Per
Schmid, Matthias
author_sort Weinhold, Leonie
collection PubMed
description BACKGROUND: The analysis of DNA methylation is a key component in the development of personalized treatment approaches. A common way to measure DNA methylation is the calculation of beta values, which are bounded variables of the form M/(M+U) that are generated by Illumina’s 450k BeadChip array. The statistical analysis of beta values is considered to be challenging, as traditional methods for the analysis of bounded variables, such as M-value regression and beta regression, are based on regularity assumptions that are often too strong to adequately describe the distribution of beta values. RESULTS: We develop a statistical model for the analysis of beta values that is derived from a bivariate gamma distribution for the signal intensities M and U. By allowing for possible correlations between M and U, the proposed model explicitly takes into account the data-generating process underlying the calculation of beta values. Using simulated data and a real sample of DNA methylation data from the Heinz Nixdorf Recall cohort study, we demonstrate that the proposed model fits our data significantly better than beta regression and M-value regression. CONCLUSION: The proposed model contributes to an improved identification of associations between beta values and covariates such as clinical variables and lifestyle factors in epigenome-wide association studies. It is as easy to apply to a sample of beta values as beta regression and M-value regression. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1347-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5120494
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-51204942016-11-28 A statistical model for the analysis of beta values in DNA methylation studies Weinhold, Leonie Wahl, Simone Pechlivanis, Sonali Hoffmann, Per Schmid, Matthias BMC Bioinformatics Methodology Article BACKGROUND: The analysis of DNA methylation is a key component in the development of personalized treatment approaches. A common way to measure DNA methylation is the calculation of beta values, which are bounded variables of the form M/(M+U) that are generated by Illumina’s 450k BeadChip array. The statistical analysis of beta values is considered to be challenging, as traditional methods for the analysis of bounded variables, such as M-value regression and beta regression, are based on regularity assumptions that are often too strong to adequately describe the distribution of beta values. RESULTS: We develop a statistical model for the analysis of beta values that is derived from a bivariate gamma distribution for the signal intensities M and U. By allowing for possible correlations between M and U, the proposed model explicitly takes into account the data-generating process underlying the calculation of beta values. Using simulated data and a real sample of DNA methylation data from the Heinz Nixdorf Recall cohort study, we demonstrate that the proposed model fits our data significantly better than beta regression and M-value regression. CONCLUSION: The proposed model contributes to an improved identification of associations between beta values and covariates such as clinical variables and lifestyle factors in epigenome-wide association studies. It is as easy to apply to a sample of beta values as beta regression and M-value regression. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1347-4) contains supplementary material, which is available to authorized users. BioMed Central 2016-11-22 /pmc/articles/PMC5120494/ /pubmed/27875981 http://dx.doi.org/10.1186/s12859-016-1347-4 Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Weinhold, Leonie
Wahl, Simone
Pechlivanis, Sonali
Hoffmann, Per
Schmid, Matthias
A statistical model for the analysis of beta values in DNA methylation studies
title A statistical model for the analysis of beta values in DNA methylation studies
title_full A statistical model for the analysis of beta values in DNA methylation studies
title_fullStr A statistical model for the analysis of beta values in DNA methylation studies
title_full_unstemmed A statistical model for the analysis of beta values in DNA methylation studies
title_short A statistical model for the analysis of beta values in DNA methylation studies
title_sort statistical model for the analysis of beta values in dna methylation studies
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5120494/
https://www.ncbi.nlm.nih.gov/pubmed/27875981
http://dx.doi.org/10.1186/s12859-016-1347-4
work_keys_str_mv AT weinholdleonie astatisticalmodelfortheanalysisofbetavaluesindnamethylationstudies
AT wahlsimone astatisticalmodelfortheanalysisofbetavaluesindnamethylationstudies
AT pechlivanissonali astatisticalmodelfortheanalysisofbetavaluesindnamethylationstudies
AT hoffmannper astatisticalmodelfortheanalysisofbetavaluesindnamethylationstudies
AT schmidmatthias astatisticalmodelfortheanalysisofbetavaluesindnamethylationstudies
AT weinholdleonie statisticalmodelfortheanalysisofbetavaluesindnamethylationstudies
AT wahlsimone statisticalmodelfortheanalysisofbetavaluesindnamethylationstudies
AT pechlivanissonali statisticalmodelfortheanalysisofbetavaluesindnamethylationstudies
AT hoffmannper statisticalmodelfortheanalysisofbetavaluesindnamethylationstudies
AT schmidmatthias statisticalmodelfortheanalysisofbetavaluesindnamethylationstudies