Cargando…

Reference-free cell mixture adjustments in analysis of DNA methylation data

Motivation: Recently there has been increasing interest in the effects of cell mixture on the measurement of DNA methylation, specifically the extent to which small perturbations in cell mixture proportions can register as changes in DNA methylation. A recently published set of statistical methods e...

Descripción completa

Detalles Bibliográficos
Autores principales: Houseman, Eugene Andres, Molitor, John, Marsit, Carmen J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4016702/
https://www.ncbi.nlm.nih.gov/pubmed/24451622
http://dx.doi.org/10.1093/bioinformatics/btu029
_version_ 1782315552684900352
author Houseman, Eugene Andres
Molitor, John
Marsit, Carmen J.
author_facet Houseman, Eugene Andres
Molitor, John
Marsit, Carmen J.
author_sort Houseman, Eugene Andres
collection PubMed
description Motivation: Recently there has been increasing interest in the effects of cell mixture on the measurement of DNA methylation, specifically the extent to which small perturbations in cell mixture proportions can register as changes in DNA methylation. A recently published set of statistical methods exploits this association to infer changes in cell mixture proportions, and these methods are presently being applied to adjust for cell mixture effect in the context of epigenome-wide association studies. However, these adjustments require the existence of reference datasets, which may be laborious or expensive to collect. For some tissues such as placenta, saliva, adipose or tumor tissue, the relevant underlying cell types may not be known. Results: We propose a method for conducting epigenome-wide association studies analysis when a reference dataset is unavailable, including a bootstrap method for estimating standard errors. We demonstrate via simulation study and several real data analyses that our proposed method can perform as well as or better than methods that make explicit use of reference datasets. In particular, it may adjust for detailed cell type differences that may be unavailable even in existing reference datasets. Availability and implementation: Software is available in the R package RefFreeEWAS. Data for three of four examples were obtained from Gene Expression Omnibus (GEO), accession numbers GSE37008, GSE42861 and GSE30601, while reference data were obtained from GEO accession number GSE39981. Contact: andres.houseman@oregonstate.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4016702
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-40167022014-05-12 Reference-free cell mixture adjustments in analysis of DNA methylation data Houseman, Eugene Andres Molitor, John Marsit, Carmen J. Bioinformatics Original Papers Motivation: Recently there has been increasing interest in the effects of cell mixture on the measurement of DNA methylation, specifically the extent to which small perturbations in cell mixture proportions can register as changes in DNA methylation. A recently published set of statistical methods exploits this association to infer changes in cell mixture proportions, and these methods are presently being applied to adjust for cell mixture effect in the context of epigenome-wide association studies. However, these adjustments require the existence of reference datasets, which may be laborious or expensive to collect. For some tissues such as placenta, saliva, adipose or tumor tissue, the relevant underlying cell types may not be known. Results: We propose a method for conducting epigenome-wide association studies analysis when a reference dataset is unavailable, including a bootstrap method for estimating standard errors. We demonstrate via simulation study and several real data analyses that our proposed method can perform as well as or better than methods that make explicit use of reference datasets. In particular, it may adjust for detailed cell type differences that may be unavailable even in existing reference datasets. Availability and implementation: Software is available in the R package RefFreeEWAS. Data for three of four examples were obtained from Gene Expression Omnibus (GEO), accession numbers GSE37008, GSE42861 and GSE30601, while reference data were obtained from GEO accession number GSE39981. Contact: andres.houseman@oregonstate.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2014-05-15 2014-01-21 /pmc/articles/PMC4016702/ /pubmed/24451622 http://dx.doi.org/10.1093/bioinformatics/btu029 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Houseman, Eugene Andres
Molitor, John
Marsit, Carmen J.
Reference-free cell mixture adjustments in analysis of DNA methylation data
title Reference-free cell mixture adjustments in analysis of DNA methylation data
title_full Reference-free cell mixture adjustments in analysis of DNA methylation data
title_fullStr Reference-free cell mixture adjustments in analysis of DNA methylation data
title_full_unstemmed Reference-free cell mixture adjustments in analysis of DNA methylation data
title_short Reference-free cell mixture adjustments in analysis of DNA methylation data
title_sort reference-free cell mixture adjustments in analysis of dna methylation data
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4016702/
https://www.ncbi.nlm.nih.gov/pubmed/24451622
http://dx.doi.org/10.1093/bioinformatics/btu029
work_keys_str_mv AT housemaneugeneandres referencefreecellmixtureadjustmentsinanalysisofdnamethylationdata
AT molitorjohn referencefreecellmixtureadjustmentsinanalysisofdnamethylationdata
AT marsitcarmenj referencefreecellmixtureadjustmentsinanalysisofdnamethylationdata