Cargando…

Longitudinal analysis strategies for modelling epigenetic trajectories

BACKGROUND: DNA methylation levels are known to vary over time, and modelling these trajectories is crucial for our understanding of the biological relevance of these changes over time. However, due to the computational cost of fitting multilevel models across the epigenome, most trajectory modellin...

Descripción completa

Detalles Bibliográficos
Autores principales: Staley, James R, Suderman, Matthew, Simpkin, Andrew J, Gaunt, Tom R, Heron, Jon, Relton, Caroline L, Tilling, Kate
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5913606/
https://www.ncbi.nlm.nih.gov/pubmed/29462323
http://dx.doi.org/10.1093/ije/dyy012
_version_ 1783316572061827072
author Staley, James R
Suderman, Matthew
Simpkin, Andrew J
Gaunt, Tom R
Heron, Jon
Relton, Caroline L
Tilling, Kate
author_facet Staley, James R
Suderman, Matthew
Simpkin, Andrew J
Gaunt, Tom R
Heron, Jon
Relton, Caroline L
Tilling, Kate
author_sort Staley, James R
collection PubMed
description BACKGROUND: DNA methylation levels are known to vary over time, and modelling these trajectories is crucial for our understanding of the biological relevance of these changes over time. However, due to the computational cost of fitting multilevel models across the epigenome, most trajectory modelling efforts to date have focused on a subset of CpG sites identified through epigenome-wide association studies (EWAS) at individual time-points. METHODS: We propose using linear regression across the repeated measures, estimating cluster-robust standard errors using a sandwich estimator, as a less computationally intensive strategy than multilevel modelling. We compared these two longitudinal approaches, as well as three approaches based on EWAS (associated at baseline, at any time-point and at all time-points), for identifying epigenetic change over time related to an exposure using simulations and by applying them to blood DNA methylation profiles from the Accessible Resource for Integrated Epigenomics Studies (ARIES). RESULTS: Restricting association testing to EWAS at baseline identified a less complete set of associations than performing EWAS at each time-point or applying the longitudinal modelling approaches to the full dataset. Linear regression models with cluster-robust standard errors identified similar sets of associations with almost identical estimates of effect as the multilevel models, while also being 74 times more efficient. Both longitudinal modelling approaches identified comparable sets of CpG sites in ARIES with an association with prenatal exposure to smoking (>70% agreement). CONCLUSIONS: Linear regression with cluster-robust standard errors is an appropriate and efficient approach for longitudinal analysis of DNA methylation data.
format Online
Article
Text
id pubmed-5913606
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-59136062018-04-30 Longitudinal analysis strategies for modelling epigenetic trajectories Staley, James R Suderman, Matthew Simpkin, Andrew J Gaunt, Tom R Heron, Jon Relton, Caroline L Tilling, Kate Int J Epidemiol Genetic Epidemiology BACKGROUND: DNA methylation levels are known to vary over time, and modelling these trajectories is crucial for our understanding of the biological relevance of these changes over time. However, due to the computational cost of fitting multilevel models across the epigenome, most trajectory modelling efforts to date have focused on a subset of CpG sites identified through epigenome-wide association studies (EWAS) at individual time-points. METHODS: We propose using linear regression across the repeated measures, estimating cluster-robust standard errors using a sandwich estimator, as a less computationally intensive strategy than multilevel modelling. We compared these two longitudinal approaches, as well as three approaches based on EWAS (associated at baseline, at any time-point and at all time-points), for identifying epigenetic change over time related to an exposure using simulations and by applying them to blood DNA methylation profiles from the Accessible Resource for Integrated Epigenomics Studies (ARIES). RESULTS: Restricting association testing to EWAS at baseline identified a less complete set of associations than performing EWAS at each time-point or applying the longitudinal modelling approaches to the full dataset. Linear regression models with cluster-robust standard errors identified similar sets of associations with almost identical estimates of effect as the multilevel models, while also being 74 times more efficient. Both longitudinal modelling approaches identified comparable sets of CpG sites in ARIES with an association with prenatal exposure to smoking (>70% agreement). CONCLUSIONS: Linear regression with cluster-robust standard errors is an appropriate and efficient approach for longitudinal analysis of DNA methylation data. Oxford University Press 2018-04 2018-02-16 /pmc/articles/PMC5913606/ /pubmed/29462323 http://dx.doi.org/10.1093/ije/dyy012 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of the International Epidemiological Association. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Genetic Epidemiology
Staley, James R
Suderman, Matthew
Simpkin, Andrew J
Gaunt, Tom R
Heron, Jon
Relton, Caroline L
Tilling, Kate
Longitudinal analysis strategies for modelling epigenetic trajectories
title Longitudinal analysis strategies for modelling epigenetic trajectories
title_full Longitudinal analysis strategies for modelling epigenetic trajectories
title_fullStr Longitudinal analysis strategies for modelling epigenetic trajectories
title_full_unstemmed Longitudinal analysis strategies for modelling epigenetic trajectories
title_short Longitudinal analysis strategies for modelling epigenetic trajectories
title_sort longitudinal analysis strategies for modelling epigenetic trajectories
topic Genetic Epidemiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5913606/
https://www.ncbi.nlm.nih.gov/pubmed/29462323
http://dx.doi.org/10.1093/ije/dyy012
work_keys_str_mv AT staleyjamesr longitudinalanalysisstrategiesformodellingepigenetictrajectories
AT sudermanmatthew longitudinalanalysisstrategiesformodellingepigenetictrajectories
AT simpkinandrewj longitudinalanalysisstrategiesformodellingepigenetictrajectories
AT gaunttomr longitudinalanalysisstrategiesformodellingepigenetictrajectories
AT heronjon longitudinalanalysisstrategiesformodellingepigenetictrajectories
AT reltoncarolinel longitudinalanalysisstrategiesformodellingepigenetictrajectories
AT tillingkate longitudinalanalysisstrategiesformodellingepigenetictrajectories