Cargando…
A comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets
Technological advances and reduced costs of high-density methylation arrays have led to an increasing number of association studies on the possible relationship between human disease and epigenetic variability. DNA samples from peripheral blood or other tissue types are analyzed in epigenome-wide as...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6954449/ https://www.ncbi.nlm.nih.gov/pubmed/30099476 http://dx.doi.org/10.1093/bib/bby068 |
_version_ | 1783486798398226432 |
---|---|
author | Brägelmann, Johannes Lorenzo Bermejo, Justo |
author_facet | Brägelmann, Johannes Lorenzo Bermejo, Justo |
author_sort | Brägelmann, Johannes |
collection | PubMed |
description | Technological advances and reduced costs of high-density methylation arrays have led to an increasing number of association studies on the possible relationship between human disease and epigenetic variability. DNA samples from peripheral blood or other tissue types are analyzed in epigenome-wide association studies (EWAS) to detect methylation differences related to a particular phenotype. Since information on the cell-type composition of the sample is generally not available and methylation profiles are cell-type specific, statistical methods have been developed for adjustment of cell-type heterogeneity in EWAS. In this study we systematically compared five popular adjustment methods: the factored spectrally transformed linear mixed model (FaST-LMM-EWASher), the sparse principal component analysis algorithm ReFACTor, surrogate variable analysis (SVA), independent SVA (ISVA) and an optimized version of SVA (SmartSVA). We used real data and applied a multilayered simulation framework to assess the type I error rate, the statistical power and the quality of estimated methylation differences according to major study characteristics. While all five adjustment methods improved false-positive rates compared with unadjusted analyses, FaST-LMM-EWASher resulted in the lowest type I error rate at the expense of low statistical power. SVA efficiently corrected for cell-type heterogeneity in EWAS up to 200 cases and 200 controls, but did not control type I error rates in larger studies. Results based on real data sets confirmed simulation findings with the strongest control of type I error rates by FaST-LMM-EWASher and SmartSVA. Overall, ReFACTor, ISVA and SmartSVA showed the best comparable statistical power, quality of estimated methylation differences and runtime. |
format | Online Article Text |
id | pubmed-6954449 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-69544492020-01-16 A comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets Brägelmann, Johannes Lorenzo Bermejo, Justo Brief Bioinform Review Article Technological advances and reduced costs of high-density methylation arrays have led to an increasing number of association studies on the possible relationship between human disease and epigenetic variability. DNA samples from peripheral blood or other tissue types are analyzed in epigenome-wide association studies (EWAS) to detect methylation differences related to a particular phenotype. Since information on the cell-type composition of the sample is generally not available and methylation profiles are cell-type specific, statistical methods have been developed for adjustment of cell-type heterogeneity in EWAS. In this study we systematically compared five popular adjustment methods: the factored spectrally transformed linear mixed model (FaST-LMM-EWASher), the sparse principal component analysis algorithm ReFACTor, surrogate variable analysis (SVA), independent SVA (ISVA) and an optimized version of SVA (SmartSVA). We used real data and applied a multilayered simulation framework to assess the type I error rate, the statistical power and the quality of estimated methylation differences according to major study characteristics. While all five adjustment methods improved false-positive rates compared with unadjusted analyses, FaST-LMM-EWASher resulted in the lowest type I error rate at the expense of low statistical power. SVA efficiently corrected for cell-type heterogeneity in EWAS up to 200 cases and 200 controls, but did not control type I error rates in larger studies. Results based on real data sets confirmed simulation findings with the strongest control of type I error rates by FaST-LMM-EWASher and SmartSVA. Overall, ReFACTor, ISVA and SmartSVA showed the best comparable statistical power, quality of estimated methylation differences and runtime. Oxford University Press 2018-08-06 /pmc/articles/PMC6954449/ /pubmed/30099476 http://dx.doi.org/10.1093/bib/bby068 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Review Article Brägelmann, Johannes Lorenzo Bermejo, Justo A comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets |
title | A comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets |
title_full | A comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets |
title_fullStr | A comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets |
title_full_unstemmed | A comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets |
title_short | A comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets |
title_sort | comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets |
topic | Review Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6954449/ https://www.ncbi.nlm.nih.gov/pubmed/30099476 http://dx.doi.org/10.1093/bib/bby068 |
work_keys_str_mv | AT bragelmannjohannes acomparativeanalysisofcelltypeadjustmentmethodsforepigenomewideassociationstudiesbasedonsimulatedandrealdatasets AT lorenzobermejojusto acomparativeanalysisofcelltypeadjustmentmethodsforepigenomewideassociationstudiesbasedonsimulatedandrealdatasets AT bragelmannjohannes comparativeanalysisofcelltypeadjustmentmethodsforepigenomewideassociationstudiesbasedonsimulatedandrealdatasets AT lorenzobermejojusto comparativeanalysisofcelltypeadjustmentmethodsforepigenomewideassociationstudiesbasedonsimulatedandrealdatasets |