Cargando…

pwrEWAS: a user-friendly tool for comprehensive power estimation for epigenome wide association studies (EWAS)

BACKGROUND: When designing an epigenome-wide association study (EWAS) to investigate the relationship between DNA methylation (DNAm) and some exposure(s) or phenotype(s), it is critically important to assess the sample size needed to detect a hypothesized difference with adequate statistical power....

Descripción completa

Detalles Bibliográficos
Autores principales: Graw, Stefan, Henn, Rosalyn, Thompson, Jeffrey A., Koestler, Devin C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6489300/
https://www.ncbi.nlm.nih.gov/pubmed/31035919
http://dx.doi.org/10.1186/s12859-019-2804-7
_version_ 1783414796507414528
author Graw, Stefan
Henn, Rosalyn
Thompson, Jeffrey A.
Koestler, Devin C.
author_facet Graw, Stefan
Henn, Rosalyn
Thompson, Jeffrey A.
Koestler, Devin C.
author_sort Graw, Stefan
collection PubMed
description BACKGROUND: When designing an epigenome-wide association study (EWAS) to investigate the relationship between DNA methylation (DNAm) and some exposure(s) or phenotype(s), it is critically important to assess the sample size needed to detect a hypothesized difference with adequate statistical power. However, the complex and nuanced nature of DNAm data makes direct assessment of statistical power challenging. To circumvent these challenges and to address the outstanding need for a user-friendly interface for EWAS power evaluation, we have developed pwrEWAS. RESULTS: The current implementation of pwrEWAS accommodates power estimation for two-group comparisons of DNAm (e.g. case vs control, exposed vs non-exposed, etc.), where methylation assessment is carried out using the Illumina Human Methylation BeadChip technology. Power is calculated using a semi-parametric simulation-based approach in which DNAm data is randomly generated from beta-distributions using CpG-specific means and variances estimated from one of several different existing DNAm data sets, chosen to cover the most common tissue-types used in EWAS. In addition to specifying the tissue type to be used for DNAm profiling, users are required to specify the sample size, number of differentially methylated CpGs, effect size(s) (Δ(β)), target false discovery rate (FDR) and the number of simulated data sets, and have the option of selecting from several different statistical methods to perform differential methylation analyses. pwrEWAS reports the marginal power, marginal type I error rate, marginal FDR, and false discovery cost (FDC). Here, we demonstrate how pwrEWAS can be applied in practice using a hypothetical EWAS. In addition, we report its computational efficiency across a variety of user settings. CONCLUSION: Both under- and overpowered studies unnecessarily deplete resources and even risk failure of a study. With pwrEWAS, we provide a user-friendly tool to help researchers circumvent these risks and to assist in the design and planning of EWAS. AVAILABILITY: The web interface is written in the R statistical programming language using Shiny (RStudio Inc., 2016) and is available at https://biostats-shinyr.kumc.edu/pwrEWAS/. The R package for pwrEWAS is publicly available at GitHub (https://github.com/stefangraw/pwrEWAS). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2804-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6489300
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64893002019-06-04 pwrEWAS: a user-friendly tool for comprehensive power estimation for epigenome wide association studies (EWAS) Graw, Stefan Henn, Rosalyn Thompson, Jeffrey A. Koestler, Devin C. BMC Bioinformatics Software BACKGROUND: When designing an epigenome-wide association study (EWAS) to investigate the relationship between DNA methylation (DNAm) and some exposure(s) or phenotype(s), it is critically important to assess the sample size needed to detect a hypothesized difference with adequate statistical power. However, the complex and nuanced nature of DNAm data makes direct assessment of statistical power challenging. To circumvent these challenges and to address the outstanding need for a user-friendly interface for EWAS power evaluation, we have developed pwrEWAS. RESULTS: The current implementation of pwrEWAS accommodates power estimation for two-group comparisons of DNAm (e.g. case vs control, exposed vs non-exposed, etc.), where methylation assessment is carried out using the Illumina Human Methylation BeadChip technology. Power is calculated using a semi-parametric simulation-based approach in which DNAm data is randomly generated from beta-distributions using CpG-specific means and variances estimated from one of several different existing DNAm data sets, chosen to cover the most common tissue-types used in EWAS. In addition to specifying the tissue type to be used for DNAm profiling, users are required to specify the sample size, number of differentially methylated CpGs, effect size(s) (Δ(β)), target false discovery rate (FDR) and the number of simulated data sets, and have the option of selecting from several different statistical methods to perform differential methylation analyses. pwrEWAS reports the marginal power, marginal type I error rate, marginal FDR, and false discovery cost (FDC). Here, we demonstrate how pwrEWAS can be applied in practice using a hypothetical EWAS. In addition, we report its computational efficiency across a variety of user settings. CONCLUSION: Both under- and overpowered studies unnecessarily deplete resources and even risk failure of a study. With pwrEWAS, we provide a user-friendly tool to help researchers circumvent these risks and to assist in the design and planning of EWAS. AVAILABILITY: The web interface is written in the R statistical programming language using Shiny (RStudio Inc., 2016) and is available at https://biostats-shinyr.kumc.edu/pwrEWAS/. The R package for pwrEWAS is publicly available at GitHub (https://github.com/stefangraw/pwrEWAS). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2804-7) contains supplementary material, which is available to authorized users. BioMed Central 2019-04-29 /pmc/articles/PMC6489300/ /pubmed/31035919 http://dx.doi.org/10.1186/s12859-019-2804-7 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Graw, Stefan
Henn, Rosalyn
Thompson, Jeffrey A.
Koestler, Devin C.
pwrEWAS: a user-friendly tool for comprehensive power estimation for epigenome wide association studies (EWAS)
title pwrEWAS: a user-friendly tool for comprehensive power estimation for epigenome wide association studies (EWAS)
title_full pwrEWAS: a user-friendly tool for comprehensive power estimation for epigenome wide association studies (EWAS)
title_fullStr pwrEWAS: a user-friendly tool for comprehensive power estimation for epigenome wide association studies (EWAS)
title_full_unstemmed pwrEWAS: a user-friendly tool for comprehensive power estimation for epigenome wide association studies (EWAS)
title_short pwrEWAS: a user-friendly tool for comprehensive power estimation for epigenome wide association studies (EWAS)
title_sort pwrewas: a user-friendly tool for comprehensive power estimation for epigenome wide association studies (ewas)
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6489300/
https://www.ncbi.nlm.nih.gov/pubmed/31035919
http://dx.doi.org/10.1186/s12859-019-2804-7
work_keys_str_mv AT grawstefan pwrewasauserfriendlytoolforcomprehensivepowerestimationforepigenomewideassociationstudiesewas
AT hennrosalyn pwrewasauserfriendlytoolforcomprehensivepowerestimationforepigenomewideassociationstudiesewas
AT thompsonjeffreya pwrewasauserfriendlytoolforcomprehensivepowerestimationforepigenomewideassociationstudiesewas
AT koestlerdevinc pwrewasauserfriendlytoolforcomprehensivepowerestimationforepigenomewideassociationstudiesewas