Cargando…

Imputation of missing covariate values in epigenome-wide analysis of DNA methylation data

DNA methylation is a widely studied epigenetic mechanism and alterations in methylation patterns may be involved in the development of common diseases. Unlike inherited changes in genetic sequence, variation in site-specific methylation varies by tissue, developmental stage, and disease status, and...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Chong, Demerath, Ellen W., Pankow, James S., Bressler, Jan, Fornage, Myriam, Grove, Megan L., Chen, Wei, Guan, Weihua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Taylor & Francis 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4846117/
https://www.ncbi.nlm.nih.gov/pubmed/26890800
http://dx.doi.org/10.1080/15592294.2016.1145328
_version_ 1782429038415970304
author Wu, Chong
Demerath, Ellen W.
Pankow, James S.
Bressler, Jan
Fornage, Myriam
Grove, Megan L.
Chen, Wei
Guan, Weihua
author_facet Wu, Chong
Demerath, Ellen W.
Pankow, James S.
Bressler, Jan
Fornage, Myriam
Grove, Megan L.
Chen, Wei
Guan, Weihua
author_sort Wu, Chong
collection PubMed
description DNA methylation is a widely studied epigenetic mechanism and alterations in methylation patterns may be involved in the development of common diseases. Unlike inherited changes in genetic sequence, variation in site-specific methylation varies by tissue, developmental stage, and disease status, and may be impacted by aging and exposure to environmental factors, such as diet or smoking. These non-genetic factors are typically included in epigenome-wide association studies (EWAS) because they may be confounding factors to the association between methylation and disease. However, missing values in these variables can lead to reduced sample size and decrease the statistical power of EWAS. We propose a site selection and multiple imputation (MI) method to impute missing covariate values and to perform association tests in EWAS. Then, we compare this method to an alternative projection-based method. Through simulations, we show that the MI-based method is slightly conservative, but provides consistent estimates for effect size. We also illustrate these methods with data from the Atherosclerosis Risk in Communities (ARIC) study to carry out an EWAS between methylation levels and smoking status, in which missing cell type compositions and white blood cell counts are imputed.
format Online
Article
Text
id pubmed-4846117
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Taylor & Francis
record_format MEDLINE/PubMed
spelling pubmed-48461172016-05-09 Imputation of missing covariate values in epigenome-wide analysis of DNA methylation data Wu, Chong Demerath, Ellen W. Pankow, James S. Bressler, Jan Fornage, Myriam Grove, Megan L. Chen, Wei Guan, Weihua Epigenetics Research Paper DNA methylation is a widely studied epigenetic mechanism and alterations in methylation patterns may be involved in the development of common diseases. Unlike inherited changes in genetic sequence, variation in site-specific methylation varies by tissue, developmental stage, and disease status, and may be impacted by aging and exposure to environmental factors, such as diet or smoking. These non-genetic factors are typically included in epigenome-wide association studies (EWAS) because they may be confounding factors to the association between methylation and disease. However, missing values in these variables can lead to reduced sample size and decrease the statistical power of EWAS. We propose a site selection and multiple imputation (MI) method to impute missing covariate values and to perform association tests in EWAS. Then, we compare this method to an alternative projection-based method. Through simulations, we show that the MI-based method is slightly conservative, but provides consistent estimates for effect size. We also illustrate these methods with data from the Atherosclerosis Risk in Communities (ARIC) study to carry out an EWAS between methylation levels and smoking status, in which missing cell type compositions and white blood cell counts are imputed. Taylor & Francis 2016-02-18 /pmc/articles/PMC4846117/ /pubmed/26890800 http://dx.doi.org/10.1080/15592294.2016.1145328 Text en © 2016 The Author(s). Published with license by Taylor & Francis Group, LLC http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution-Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. The moral rights of the named author(s) have been asserted.
spellingShingle Research Paper
Wu, Chong
Demerath, Ellen W.
Pankow, James S.
Bressler, Jan
Fornage, Myriam
Grove, Megan L.
Chen, Wei
Guan, Weihua
Imputation of missing covariate values in epigenome-wide analysis of DNA methylation data
title Imputation of missing covariate values in epigenome-wide analysis of DNA methylation data
title_full Imputation of missing covariate values in epigenome-wide analysis of DNA methylation data
title_fullStr Imputation of missing covariate values in epigenome-wide analysis of DNA methylation data
title_full_unstemmed Imputation of missing covariate values in epigenome-wide analysis of DNA methylation data
title_short Imputation of missing covariate values in epigenome-wide analysis of DNA methylation data
title_sort imputation of missing covariate values in epigenome-wide analysis of dna methylation data
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4846117/
https://www.ncbi.nlm.nih.gov/pubmed/26890800
http://dx.doi.org/10.1080/15592294.2016.1145328
work_keys_str_mv AT wuchong imputationofmissingcovariatevaluesinepigenomewideanalysisofdnamethylationdata
AT demerathellenw imputationofmissingcovariatevaluesinepigenomewideanalysisofdnamethylationdata
AT pankowjamess imputationofmissingcovariatevaluesinepigenomewideanalysisofdnamethylationdata
AT bresslerjan imputationofmissingcovariatevaluesinepigenomewideanalysisofdnamethylationdata
AT fornagemyriam imputationofmissingcovariatevaluesinepigenomewideanalysisofdnamethylationdata
AT grovemeganl imputationofmissingcovariatevaluesinepigenomewideanalysisofdnamethylationdata
AT chenwei imputationofmissingcovariatevaluesinepigenomewideanalysisofdnamethylationdata
AT guanweihua imputationofmissingcovariatevaluesinepigenomewideanalysisofdnamethylationdata