Cargando…

A comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets

Technological advances and reduced costs of high-density methylation arrays have led to an increasing number of association studies on the possible relationship between human disease and epigenetic variability. DNA samples from peripheral blood or other tissue types are analyzed in epigenome-wide as...

Descripción completa

Detalles Bibliográficos
Autores principales: Brägelmann, Johannes, Lorenzo Bermejo, Justo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6954449/
https://www.ncbi.nlm.nih.gov/pubmed/30099476
http://dx.doi.org/10.1093/bib/bby068
_version_ 1783486798398226432
author Brägelmann, Johannes
Lorenzo Bermejo, Justo
author_facet Brägelmann, Johannes
Lorenzo Bermejo, Justo
author_sort Brägelmann, Johannes
collection PubMed
description Technological advances and reduced costs of high-density methylation arrays have led to an increasing number of association studies on the possible relationship between human disease and epigenetic variability. DNA samples from peripheral blood or other tissue types are analyzed in epigenome-wide association studies (EWAS) to detect methylation differences related to a particular phenotype. Since information on the cell-type composition of the sample is generally not available and methylation profiles are cell-type specific, statistical methods have been developed for adjustment of cell-type heterogeneity in EWAS. In this study we systematically compared five popular adjustment methods: the factored spectrally transformed linear mixed model (FaST-LMM-EWASher), the sparse principal component analysis algorithm ReFACTor, surrogate variable analysis (SVA), independent SVA (ISVA) and an optimized version of SVA (SmartSVA). We used real data and applied a multilayered simulation framework to assess the type I error rate, the statistical power and the quality of estimated methylation differences according to major study characteristics. While all five adjustment methods improved false-positive rates compared with unadjusted analyses, FaST-LMM-EWASher resulted in the lowest type I error rate at the expense of low statistical power. SVA efficiently corrected for cell-type heterogeneity in EWAS up to 200 cases and 200 controls, but did not control type I error rates in larger studies. Results based on real data sets confirmed simulation findings with the strongest control of type I error rates by FaST-LMM-EWASher and SmartSVA. Overall, ReFACTor, ISVA and SmartSVA showed the best comparable statistical power, quality of estimated methylation differences and runtime.
format Online
Article
Text
id pubmed-6954449
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-69544492020-01-16 A comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets Brägelmann, Johannes Lorenzo Bermejo, Justo Brief Bioinform Review Article Technological advances and reduced costs of high-density methylation arrays have led to an increasing number of association studies on the possible relationship between human disease and epigenetic variability. DNA samples from peripheral blood or other tissue types are analyzed in epigenome-wide association studies (EWAS) to detect methylation differences related to a particular phenotype. Since information on the cell-type composition of the sample is generally not available and methylation profiles are cell-type specific, statistical methods have been developed for adjustment of cell-type heterogeneity in EWAS. In this study we systematically compared five popular adjustment methods: the factored spectrally transformed linear mixed model (FaST-LMM-EWASher), the sparse principal component analysis algorithm ReFACTor, surrogate variable analysis (SVA), independent SVA (ISVA) and an optimized version of SVA (SmartSVA). We used real data and applied a multilayered simulation framework to assess the type I error rate, the statistical power and the quality of estimated methylation differences according to major study characteristics. While all five adjustment methods improved false-positive rates compared with unadjusted analyses, FaST-LMM-EWASher resulted in the lowest type I error rate at the expense of low statistical power. SVA efficiently corrected for cell-type heterogeneity in EWAS up to 200 cases and 200 controls, but did not control type I error rates in larger studies. Results based on real data sets confirmed simulation findings with the strongest control of type I error rates by FaST-LMM-EWASher and SmartSVA. Overall, ReFACTor, ISVA and SmartSVA showed the best comparable statistical power, quality of estimated methylation differences and runtime. Oxford University Press 2018-08-06 /pmc/articles/PMC6954449/ /pubmed/30099476 http://dx.doi.org/10.1093/bib/bby068 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Review Article
Brägelmann, Johannes
Lorenzo Bermejo, Justo
A comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets
title A comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets
title_full A comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets
title_fullStr A comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets
title_full_unstemmed A comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets
title_short A comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets
title_sort comparative analysis of cell-type adjustment methods for epigenome-wide association studies based on simulated and real data sets
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6954449/
https://www.ncbi.nlm.nih.gov/pubmed/30099476
http://dx.doi.org/10.1093/bib/bby068
work_keys_str_mv AT bragelmannjohannes acomparativeanalysisofcelltypeadjustmentmethodsforepigenomewideassociationstudiesbasedonsimulatedandrealdatasets
AT lorenzobermejojusto acomparativeanalysisofcelltypeadjustmentmethodsforepigenomewideassociationstudiesbasedonsimulatedandrealdatasets
AT bragelmannjohannes comparativeanalysisofcelltypeadjustmentmethodsforepigenomewideassociationstudiesbasedonsimulatedandrealdatasets
AT lorenzobermejojusto comparativeanalysisofcelltypeadjustmentmethodsforepigenomewideassociationstudiesbasedonsimulatedandrealdatasets