Cargando…
Leveraging biological and statistical covariates improves the detection power in epigenome-wide association testing
BACKGROUND: Epigenome-wide association studies (EWAS), which seek the association between epigenetic marks and an outcome or exposure, involve multiple hypothesis testing. False discovery rate (FDR) control has been widely used for multiple testing correction. However, traditional FDR control method...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7132874/ https://www.ncbi.nlm.nih.gov/pubmed/32252795 http://dx.doi.org/10.1186/s13059-020-02001-7 |
_version_ | 1783517522371280896 |
---|---|
author | Huang, Jinyan Bai, Ling Cui, Bowen Wu, Liang Wang, Liwen An, Zhiyin Ruan, Shulin Yu, Yue Zhang, Xianyang Chen, Jun |
author_facet | Huang, Jinyan Bai, Ling Cui, Bowen Wu, Liang Wang, Liwen An, Zhiyin Ruan, Shulin Yu, Yue Zhang, Xianyang Chen, Jun |
author_sort | Huang, Jinyan |
collection | PubMed |
description | BACKGROUND: Epigenome-wide association studies (EWAS), which seek the association between epigenetic marks and an outcome or exposure, involve multiple hypothesis testing. False discovery rate (FDR) control has been widely used for multiple testing correction. However, traditional FDR control methods do not use auxiliary covariates, and they could be less powerful if the covariates could inform the likelihood of the null hypothesis. Recently, many covariate-adaptive FDR control methods have been developed, but application of these methods to EWAS data has not yet been explored. It is not clear whether these methods can significantly improve detection power, and if so, which covariates are more relevant for EWAS data. RESULTS: In this study, we evaluate the performance of five covariate-adaptive FDR control methods with EWAS-related covariates using simulated as well as real EWAS datasets. We develop an omnibus test to assess the informativeness of the covariates. We find that statistical covariates are generally more informative than biological covariates, and the covariates of methylation mean and variance are almost universally informative. In contrast, the informativeness of biological covariates depends on specific datasets. We show that the independent hypothesis weighting (IHW) and covariate adaptive multiple testing (CAMT) method are overall more powerful, especially for sparse signals, and could improve the detection power by a median of 25% and 68% on real datasets, compared to the ST procedure. We further validate the findings in various biological contexts. CONCLUSIONS: Covariate-adaptive FDR control methods with informative covariates can significantly increase the detection power for EWAS. For sparse signals, IHW and CAMT are recommended. |
format | Online Article Text |
id | pubmed-7132874 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-71328742020-04-11 Leveraging biological and statistical covariates improves the detection power in epigenome-wide association testing Huang, Jinyan Bai, Ling Cui, Bowen Wu, Liang Wang, Liwen An, Zhiyin Ruan, Shulin Yu, Yue Zhang, Xianyang Chen, Jun Genome Biol Research BACKGROUND: Epigenome-wide association studies (EWAS), which seek the association between epigenetic marks and an outcome or exposure, involve multiple hypothesis testing. False discovery rate (FDR) control has been widely used for multiple testing correction. However, traditional FDR control methods do not use auxiliary covariates, and they could be less powerful if the covariates could inform the likelihood of the null hypothesis. Recently, many covariate-adaptive FDR control methods have been developed, but application of these methods to EWAS data has not yet been explored. It is not clear whether these methods can significantly improve detection power, and if so, which covariates are more relevant for EWAS data. RESULTS: In this study, we evaluate the performance of five covariate-adaptive FDR control methods with EWAS-related covariates using simulated as well as real EWAS datasets. We develop an omnibus test to assess the informativeness of the covariates. We find that statistical covariates are generally more informative than biological covariates, and the covariates of methylation mean and variance are almost universally informative. In contrast, the informativeness of biological covariates depends on specific datasets. We show that the independent hypothesis weighting (IHW) and covariate adaptive multiple testing (CAMT) method are overall more powerful, especially for sparse signals, and could improve the detection power by a median of 25% and 68% on real datasets, compared to the ST procedure. We further validate the findings in various biological contexts. CONCLUSIONS: Covariate-adaptive FDR control methods with informative covariates can significantly increase the detection power for EWAS. For sparse signals, IHW and CAMT are recommended. BioMed Central 2020-04-06 /pmc/articles/PMC7132874/ /pubmed/32252795 http://dx.doi.org/10.1186/s13059-020-02001-7 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Huang, Jinyan Bai, Ling Cui, Bowen Wu, Liang Wang, Liwen An, Zhiyin Ruan, Shulin Yu, Yue Zhang, Xianyang Chen, Jun Leveraging biological and statistical covariates improves the detection power in epigenome-wide association testing |
title | Leveraging biological and statistical covariates improves the detection power in epigenome-wide association testing |
title_full | Leveraging biological and statistical covariates improves the detection power in epigenome-wide association testing |
title_fullStr | Leveraging biological and statistical covariates improves the detection power in epigenome-wide association testing |
title_full_unstemmed | Leveraging biological and statistical covariates improves the detection power in epigenome-wide association testing |
title_short | Leveraging biological and statistical covariates improves the detection power in epigenome-wide association testing |
title_sort | leveraging biological and statistical covariates improves the detection power in epigenome-wide association testing |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7132874/ https://www.ncbi.nlm.nih.gov/pubmed/32252795 http://dx.doi.org/10.1186/s13059-020-02001-7 |
work_keys_str_mv | AT huangjinyan leveragingbiologicalandstatisticalcovariatesimprovesthedetectionpowerinepigenomewideassociationtesting AT bailing leveragingbiologicalandstatisticalcovariatesimprovesthedetectionpowerinepigenomewideassociationtesting AT cuibowen leveragingbiologicalandstatisticalcovariatesimprovesthedetectionpowerinepigenomewideassociationtesting AT wuliang leveragingbiologicalandstatisticalcovariatesimprovesthedetectionpowerinepigenomewideassociationtesting AT wangliwen leveragingbiologicalandstatisticalcovariatesimprovesthedetectionpowerinepigenomewideassociationtesting AT anzhiyin leveragingbiologicalandstatisticalcovariatesimprovesthedetectionpowerinepigenomewideassociationtesting AT ruanshulin leveragingbiologicalandstatisticalcovariatesimprovesthedetectionpowerinepigenomewideassociationtesting AT yuyue leveragingbiologicalandstatisticalcovariatesimprovesthedetectionpowerinepigenomewideassociationtesting AT zhangxianyang leveragingbiologicalandstatisticalcovariatesimprovesthedetectionpowerinepigenomewideassociationtesting AT chenjun leveragingbiologicalandstatisticalcovariatesimprovesthedetectionpowerinepigenomewideassociationtesting |