Cargando…
A statistical model for the analysis of beta values in DNA methylation studies
BACKGROUND: The analysis of DNA methylation is a key component in the development of personalized treatment approaches. A common way to measure DNA methylation is the calculation of beta values, which are bounded variables of the form M/(M+U) that are generated by Illumina’s 450k BeadChip array. The...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5120494/ https://www.ncbi.nlm.nih.gov/pubmed/27875981 http://dx.doi.org/10.1186/s12859-016-1347-4 |
_version_ | 1782469251895918592 |
---|---|
author | Weinhold, Leonie Wahl, Simone Pechlivanis, Sonali Hoffmann, Per Schmid, Matthias |
author_facet | Weinhold, Leonie Wahl, Simone Pechlivanis, Sonali Hoffmann, Per Schmid, Matthias |
author_sort | Weinhold, Leonie |
collection | PubMed |
description | BACKGROUND: The analysis of DNA methylation is a key component in the development of personalized treatment approaches. A common way to measure DNA methylation is the calculation of beta values, which are bounded variables of the form M/(M+U) that are generated by Illumina’s 450k BeadChip array. The statistical analysis of beta values is considered to be challenging, as traditional methods for the analysis of bounded variables, such as M-value regression and beta regression, are based on regularity assumptions that are often too strong to adequately describe the distribution of beta values. RESULTS: We develop a statistical model for the analysis of beta values that is derived from a bivariate gamma distribution for the signal intensities M and U. By allowing for possible correlations between M and U, the proposed model explicitly takes into account the data-generating process underlying the calculation of beta values. Using simulated data and a real sample of DNA methylation data from the Heinz Nixdorf Recall cohort study, we demonstrate that the proposed model fits our data significantly better than beta regression and M-value regression. CONCLUSION: The proposed model contributes to an improved identification of associations between beta values and covariates such as clinical variables and lifestyle factors in epigenome-wide association studies. It is as easy to apply to a sample of beta values as beta regression and M-value regression. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1347-4) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5120494 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-51204942016-11-28 A statistical model for the analysis of beta values in DNA methylation studies Weinhold, Leonie Wahl, Simone Pechlivanis, Sonali Hoffmann, Per Schmid, Matthias BMC Bioinformatics Methodology Article BACKGROUND: The analysis of DNA methylation is a key component in the development of personalized treatment approaches. A common way to measure DNA methylation is the calculation of beta values, which are bounded variables of the form M/(M+U) that are generated by Illumina’s 450k BeadChip array. The statistical analysis of beta values is considered to be challenging, as traditional methods for the analysis of bounded variables, such as M-value regression and beta regression, are based on regularity assumptions that are often too strong to adequately describe the distribution of beta values. RESULTS: We develop a statistical model for the analysis of beta values that is derived from a bivariate gamma distribution for the signal intensities M and U. By allowing for possible correlations between M and U, the proposed model explicitly takes into account the data-generating process underlying the calculation of beta values. Using simulated data and a real sample of DNA methylation data from the Heinz Nixdorf Recall cohort study, we demonstrate that the proposed model fits our data significantly better than beta regression and M-value regression. CONCLUSION: The proposed model contributes to an improved identification of associations between beta values and covariates such as clinical variables and lifestyle factors in epigenome-wide association studies. It is as easy to apply to a sample of beta values as beta regression and M-value regression. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1347-4) contains supplementary material, which is available to authorized users. BioMed Central 2016-11-22 /pmc/articles/PMC5120494/ /pubmed/27875981 http://dx.doi.org/10.1186/s12859-016-1347-4 Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Weinhold, Leonie Wahl, Simone Pechlivanis, Sonali Hoffmann, Per Schmid, Matthias A statistical model for the analysis of beta values in DNA methylation studies |
title | A statistical model for the analysis of beta values in DNA methylation studies |
title_full | A statistical model for the analysis of beta values in DNA methylation studies |
title_fullStr | A statistical model for the analysis of beta values in DNA methylation studies |
title_full_unstemmed | A statistical model for the analysis of beta values in DNA methylation studies |
title_short | A statistical model for the analysis of beta values in DNA methylation studies |
title_sort | statistical model for the analysis of beta values in dna methylation studies |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5120494/ https://www.ncbi.nlm.nih.gov/pubmed/27875981 http://dx.doi.org/10.1186/s12859-016-1347-4 |
work_keys_str_mv | AT weinholdleonie astatisticalmodelfortheanalysisofbetavaluesindnamethylationstudies AT wahlsimone astatisticalmodelfortheanalysisofbetavaluesindnamethylationstudies AT pechlivanissonali astatisticalmodelfortheanalysisofbetavaluesindnamethylationstudies AT hoffmannper astatisticalmodelfortheanalysisofbetavaluesindnamethylationstudies AT schmidmatthias astatisticalmodelfortheanalysisofbetavaluesindnamethylationstudies AT weinholdleonie statisticalmodelfortheanalysisofbetavaluesindnamethylationstudies AT wahlsimone statisticalmodelfortheanalysisofbetavaluesindnamethylationstudies AT pechlivanissonali statisticalmodelfortheanalysisofbetavaluesindnamethylationstudies AT hoffmannper statisticalmodelfortheanalysisofbetavaluesindnamethylationstudies AT schmidmatthias statisticalmodelfortheanalysisofbetavaluesindnamethylationstudies |