Cargando…
pwrEWAS: a user-friendly tool for comprehensive power estimation for epigenome wide association studies (EWAS)
BACKGROUND: When designing an epigenome-wide association study (EWAS) to investigate the relationship between DNA methylation (DNAm) and some exposure(s) or phenotype(s), it is critically important to assess the sample size needed to detect a hypothesized difference with adequate statistical power....
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6489300/ https://www.ncbi.nlm.nih.gov/pubmed/31035919 http://dx.doi.org/10.1186/s12859-019-2804-7 |
_version_ | 1783414796507414528 |
---|---|
author | Graw, Stefan Henn, Rosalyn Thompson, Jeffrey A. Koestler, Devin C. |
author_facet | Graw, Stefan Henn, Rosalyn Thompson, Jeffrey A. Koestler, Devin C. |
author_sort | Graw, Stefan |
collection | PubMed |
description | BACKGROUND: When designing an epigenome-wide association study (EWAS) to investigate the relationship between DNA methylation (DNAm) and some exposure(s) or phenotype(s), it is critically important to assess the sample size needed to detect a hypothesized difference with adequate statistical power. However, the complex and nuanced nature of DNAm data makes direct assessment of statistical power challenging. To circumvent these challenges and to address the outstanding need for a user-friendly interface for EWAS power evaluation, we have developed pwrEWAS. RESULTS: The current implementation of pwrEWAS accommodates power estimation for two-group comparisons of DNAm (e.g. case vs control, exposed vs non-exposed, etc.), where methylation assessment is carried out using the Illumina Human Methylation BeadChip technology. Power is calculated using a semi-parametric simulation-based approach in which DNAm data is randomly generated from beta-distributions using CpG-specific means and variances estimated from one of several different existing DNAm data sets, chosen to cover the most common tissue-types used in EWAS. In addition to specifying the tissue type to be used for DNAm profiling, users are required to specify the sample size, number of differentially methylated CpGs, effect size(s) (Δ(β)), target false discovery rate (FDR) and the number of simulated data sets, and have the option of selecting from several different statistical methods to perform differential methylation analyses. pwrEWAS reports the marginal power, marginal type I error rate, marginal FDR, and false discovery cost (FDC). Here, we demonstrate how pwrEWAS can be applied in practice using a hypothetical EWAS. In addition, we report its computational efficiency across a variety of user settings. CONCLUSION: Both under- and overpowered studies unnecessarily deplete resources and even risk failure of a study. With pwrEWAS, we provide a user-friendly tool to help researchers circumvent these risks and to assist in the design and planning of EWAS. AVAILABILITY: The web interface is written in the R statistical programming language using Shiny (RStudio Inc., 2016) and is available at https://biostats-shinyr.kumc.edu/pwrEWAS/. The R package for pwrEWAS is publicly available at GitHub (https://github.com/stefangraw/pwrEWAS). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2804-7) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6489300 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-64893002019-06-04 pwrEWAS: a user-friendly tool for comprehensive power estimation for epigenome wide association studies (EWAS) Graw, Stefan Henn, Rosalyn Thompson, Jeffrey A. Koestler, Devin C. BMC Bioinformatics Software BACKGROUND: When designing an epigenome-wide association study (EWAS) to investigate the relationship between DNA methylation (DNAm) and some exposure(s) or phenotype(s), it is critically important to assess the sample size needed to detect a hypothesized difference with adequate statistical power. However, the complex and nuanced nature of DNAm data makes direct assessment of statistical power challenging. To circumvent these challenges and to address the outstanding need for a user-friendly interface for EWAS power evaluation, we have developed pwrEWAS. RESULTS: The current implementation of pwrEWAS accommodates power estimation for two-group comparisons of DNAm (e.g. case vs control, exposed vs non-exposed, etc.), where methylation assessment is carried out using the Illumina Human Methylation BeadChip technology. Power is calculated using a semi-parametric simulation-based approach in which DNAm data is randomly generated from beta-distributions using CpG-specific means and variances estimated from one of several different existing DNAm data sets, chosen to cover the most common tissue-types used in EWAS. In addition to specifying the tissue type to be used for DNAm profiling, users are required to specify the sample size, number of differentially methylated CpGs, effect size(s) (Δ(β)), target false discovery rate (FDR) and the number of simulated data sets, and have the option of selecting from several different statistical methods to perform differential methylation analyses. pwrEWAS reports the marginal power, marginal type I error rate, marginal FDR, and false discovery cost (FDC). Here, we demonstrate how pwrEWAS can be applied in practice using a hypothetical EWAS. In addition, we report its computational efficiency across a variety of user settings. CONCLUSION: Both under- and overpowered studies unnecessarily deplete resources and even risk failure of a study. With pwrEWAS, we provide a user-friendly tool to help researchers circumvent these risks and to assist in the design and planning of EWAS. AVAILABILITY: The web interface is written in the R statistical programming language using Shiny (RStudio Inc., 2016) and is available at https://biostats-shinyr.kumc.edu/pwrEWAS/. The R package for pwrEWAS is publicly available at GitHub (https://github.com/stefangraw/pwrEWAS). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2804-7) contains supplementary material, which is available to authorized users. BioMed Central 2019-04-29 /pmc/articles/PMC6489300/ /pubmed/31035919 http://dx.doi.org/10.1186/s12859-019-2804-7 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Graw, Stefan Henn, Rosalyn Thompson, Jeffrey A. Koestler, Devin C. pwrEWAS: a user-friendly tool for comprehensive power estimation for epigenome wide association studies (EWAS) |
title | pwrEWAS: a user-friendly tool for comprehensive power estimation for epigenome wide association studies (EWAS) |
title_full | pwrEWAS: a user-friendly tool for comprehensive power estimation for epigenome wide association studies (EWAS) |
title_fullStr | pwrEWAS: a user-friendly tool for comprehensive power estimation for epigenome wide association studies (EWAS) |
title_full_unstemmed | pwrEWAS: a user-friendly tool for comprehensive power estimation for epigenome wide association studies (EWAS) |
title_short | pwrEWAS: a user-friendly tool for comprehensive power estimation for epigenome wide association studies (EWAS) |
title_sort | pwrewas: a user-friendly tool for comprehensive power estimation for epigenome wide association studies (ewas) |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6489300/ https://www.ncbi.nlm.nih.gov/pubmed/31035919 http://dx.doi.org/10.1186/s12859-019-2804-7 |
work_keys_str_mv | AT grawstefan pwrewasauserfriendlytoolforcomprehensivepowerestimationforepigenomewideassociationstudiesewas AT hennrosalyn pwrewasauserfriendlytoolforcomprehensivepowerestimationforepigenomewideassociationstudiesewas AT thompsonjeffreya pwrewasauserfriendlytoolforcomprehensivepowerestimationforepigenomewideassociationstudiesewas AT koestlerdevinc pwrewasauserfriendlytoolforcomprehensivepowerestimationforepigenomewideassociationstudiesewas |