Cargando…

Leveraging biological and statistical covariates improves the detection power in epigenome-wide association testing

BACKGROUND: Epigenome-wide association studies (EWAS), which seek the association between epigenetic marks and an outcome or exposure, involve multiple hypothesis testing. False discovery rate (FDR) control has been widely used for multiple testing correction. However, traditional FDR control method...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Jinyan, Bai, Ling, Cui, Bowen, Wu, Liang, Wang, Liwen, An, Zhiyin, Ruan, Shulin, Yu, Yue, Zhang, Xianyang, Chen, Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7132874/
https://www.ncbi.nlm.nih.gov/pubmed/32252795
http://dx.doi.org/10.1186/s13059-020-02001-7
_version_ 1783517522371280896
author Huang, Jinyan
Bai, Ling
Cui, Bowen
Wu, Liang
Wang, Liwen
An, Zhiyin
Ruan, Shulin
Yu, Yue
Zhang, Xianyang
Chen, Jun
author_facet Huang, Jinyan
Bai, Ling
Cui, Bowen
Wu, Liang
Wang, Liwen
An, Zhiyin
Ruan, Shulin
Yu, Yue
Zhang, Xianyang
Chen, Jun
author_sort Huang, Jinyan
collection PubMed
description BACKGROUND: Epigenome-wide association studies (EWAS), which seek the association between epigenetic marks and an outcome or exposure, involve multiple hypothesis testing. False discovery rate (FDR) control has been widely used for multiple testing correction. However, traditional FDR control methods do not use auxiliary covariates, and they could be less powerful if the covariates could inform the likelihood of the null hypothesis. Recently, many covariate-adaptive FDR control methods have been developed, but application of these methods to EWAS data has not yet been explored. It is not clear whether these methods can significantly improve detection power, and if so, which covariates are more relevant for EWAS data. RESULTS: In this study, we evaluate the performance of five covariate-adaptive FDR control methods with EWAS-related covariates using simulated as well as real EWAS datasets. We develop an omnibus test to assess the informativeness of the covariates. We find that statistical covariates are generally more informative than biological covariates, and the covariates of methylation mean and variance are almost universally informative. In contrast, the informativeness of biological covariates depends on specific datasets. We show that the independent hypothesis weighting (IHW) and covariate adaptive multiple testing (CAMT) method are overall more powerful, especially for sparse signals, and could improve the detection power by a median of 25% and 68% on real datasets, compared to the ST procedure. We further validate the findings in various biological contexts. CONCLUSIONS: Covariate-adaptive FDR control methods with informative covariates can significantly increase the detection power for EWAS. For sparse signals, IHW and CAMT are recommended.
format Online
Article
Text
id pubmed-7132874
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-71328742020-04-11 Leveraging biological and statistical covariates improves the detection power in epigenome-wide association testing Huang, Jinyan Bai, Ling Cui, Bowen Wu, Liang Wang, Liwen An, Zhiyin Ruan, Shulin Yu, Yue Zhang, Xianyang Chen, Jun Genome Biol Research BACKGROUND: Epigenome-wide association studies (EWAS), which seek the association between epigenetic marks and an outcome or exposure, involve multiple hypothesis testing. False discovery rate (FDR) control has been widely used for multiple testing correction. However, traditional FDR control methods do not use auxiliary covariates, and they could be less powerful if the covariates could inform the likelihood of the null hypothesis. Recently, many covariate-adaptive FDR control methods have been developed, but application of these methods to EWAS data has not yet been explored. It is not clear whether these methods can significantly improve detection power, and if so, which covariates are more relevant for EWAS data. RESULTS: In this study, we evaluate the performance of five covariate-adaptive FDR control methods with EWAS-related covariates using simulated as well as real EWAS datasets. We develop an omnibus test to assess the informativeness of the covariates. We find that statistical covariates are generally more informative than biological covariates, and the covariates of methylation mean and variance are almost universally informative. In contrast, the informativeness of biological covariates depends on specific datasets. We show that the independent hypothesis weighting (IHW) and covariate adaptive multiple testing (CAMT) method are overall more powerful, especially for sparse signals, and could improve the detection power by a median of 25% and 68% on real datasets, compared to the ST procedure. We further validate the findings in various biological contexts. CONCLUSIONS: Covariate-adaptive FDR control methods with informative covariates can significantly increase the detection power for EWAS. For sparse signals, IHW and CAMT are recommended. BioMed Central 2020-04-06 /pmc/articles/PMC7132874/ /pubmed/32252795 http://dx.doi.org/10.1186/s13059-020-02001-7 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Huang, Jinyan
Bai, Ling
Cui, Bowen
Wu, Liang
Wang, Liwen
An, Zhiyin
Ruan, Shulin
Yu, Yue
Zhang, Xianyang
Chen, Jun
Leveraging biological and statistical covariates improves the detection power in epigenome-wide association testing
title Leveraging biological and statistical covariates improves the detection power in epigenome-wide association testing
title_full Leveraging biological and statistical covariates improves the detection power in epigenome-wide association testing
title_fullStr Leveraging biological and statistical covariates improves the detection power in epigenome-wide association testing
title_full_unstemmed Leveraging biological and statistical covariates improves the detection power in epigenome-wide association testing
title_short Leveraging biological and statistical covariates improves the detection power in epigenome-wide association testing
title_sort leveraging biological and statistical covariates improves the detection power in epigenome-wide association testing
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7132874/
https://www.ncbi.nlm.nih.gov/pubmed/32252795
http://dx.doi.org/10.1186/s13059-020-02001-7
work_keys_str_mv AT huangjinyan leveragingbiologicalandstatisticalcovariatesimprovesthedetectionpowerinepigenomewideassociationtesting
AT bailing leveragingbiologicalandstatisticalcovariatesimprovesthedetectionpowerinepigenomewideassociationtesting
AT cuibowen leveragingbiologicalandstatisticalcovariatesimprovesthedetectionpowerinepigenomewideassociationtesting
AT wuliang leveragingbiologicalandstatisticalcovariatesimprovesthedetectionpowerinepigenomewideassociationtesting
AT wangliwen leveragingbiologicalandstatisticalcovariatesimprovesthedetectionpowerinepigenomewideassociationtesting
AT anzhiyin leveragingbiologicalandstatisticalcovariatesimprovesthedetectionpowerinepigenomewideassociationtesting
AT ruanshulin leveragingbiologicalandstatisticalcovariatesimprovesthedetectionpowerinepigenomewideassociationtesting
AT yuyue leveragingbiologicalandstatisticalcovariatesimprovesthedetectionpowerinepigenomewideassociationtesting
AT zhangxianyang leveragingbiologicalandstatisticalcovariatesimprovesthedetectionpowerinepigenomewideassociationtesting
AT chenjun leveragingbiologicalandstatisticalcovariatesimprovesthedetectionpowerinepigenomewideassociationtesting