Cargando…
Longitudinal analysis strategies for modelling epigenetic trajectories
BACKGROUND: DNA methylation levels are known to vary over time, and modelling these trajectories is crucial for our understanding of the biological relevance of these changes over time. However, due to the computational cost of fitting multilevel models across the epigenome, most trajectory modellin...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5913606/ https://www.ncbi.nlm.nih.gov/pubmed/29462323 http://dx.doi.org/10.1093/ije/dyy012 |
_version_ | 1783316572061827072 |
---|---|
author | Staley, James R Suderman, Matthew Simpkin, Andrew J Gaunt, Tom R Heron, Jon Relton, Caroline L Tilling, Kate |
author_facet | Staley, James R Suderman, Matthew Simpkin, Andrew J Gaunt, Tom R Heron, Jon Relton, Caroline L Tilling, Kate |
author_sort | Staley, James R |
collection | PubMed |
description | BACKGROUND: DNA methylation levels are known to vary over time, and modelling these trajectories is crucial for our understanding of the biological relevance of these changes over time. However, due to the computational cost of fitting multilevel models across the epigenome, most trajectory modelling efforts to date have focused on a subset of CpG sites identified through epigenome-wide association studies (EWAS) at individual time-points. METHODS: We propose using linear regression across the repeated measures, estimating cluster-robust standard errors using a sandwich estimator, as a less computationally intensive strategy than multilevel modelling. We compared these two longitudinal approaches, as well as three approaches based on EWAS (associated at baseline, at any time-point and at all time-points), for identifying epigenetic change over time related to an exposure using simulations and by applying them to blood DNA methylation profiles from the Accessible Resource for Integrated Epigenomics Studies (ARIES). RESULTS: Restricting association testing to EWAS at baseline identified a less complete set of associations than performing EWAS at each time-point or applying the longitudinal modelling approaches to the full dataset. Linear regression models with cluster-robust standard errors identified similar sets of associations with almost identical estimates of effect as the multilevel models, while also being 74 times more efficient. Both longitudinal modelling approaches identified comparable sets of CpG sites in ARIES with an association with prenatal exposure to smoking (>70% agreement). CONCLUSIONS: Linear regression with cluster-robust standard errors is an appropriate and efficient approach for longitudinal analysis of DNA methylation data. |
format | Online Article Text |
id | pubmed-5913606 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-59136062018-04-30 Longitudinal analysis strategies for modelling epigenetic trajectories Staley, James R Suderman, Matthew Simpkin, Andrew J Gaunt, Tom R Heron, Jon Relton, Caroline L Tilling, Kate Int J Epidemiol Genetic Epidemiology BACKGROUND: DNA methylation levels are known to vary over time, and modelling these trajectories is crucial for our understanding of the biological relevance of these changes over time. However, due to the computational cost of fitting multilevel models across the epigenome, most trajectory modelling efforts to date have focused on a subset of CpG sites identified through epigenome-wide association studies (EWAS) at individual time-points. METHODS: We propose using linear regression across the repeated measures, estimating cluster-robust standard errors using a sandwich estimator, as a less computationally intensive strategy than multilevel modelling. We compared these two longitudinal approaches, as well as three approaches based on EWAS (associated at baseline, at any time-point and at all time-points), for identifying epigenetic change over time related to an exposure using simulations and by applying them to blood DNA methylation profiles from the Accessible Resource for Integrated Epigenomics Studies (ARIES). RESULTS: Restricting association testing to EWAS at baseline identified a less complete set of associations than performing EWAS at each time-point or applying the longitudinal modelling approaches to the full dataset. Linear regression models with cluster-robust standard errors identified similar sets of associations with almost identical estimates of effect as the multilevel models, while also being 74 times more efficient. Both longitudinal modelling approaches identified comparable sets of CpG sites in ARIES with an association with prenatal exposure to smoking (>70% agreement). CONCLUSIONS: Linear regression with cluster-robust standard errors is an appropriate and efficient approach for longitudinal analysis of DNA methylation data. Oxford University Press 2018-04 2018-02-16 /pmc/articles/PMC5913606/ /pubmed/29462323 http://dx.doi.org/10.1093/ije/dyy012 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of the International Epidemiological Association. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Genetic Epidemiology Staley, James R Suderman, Matthew Simpkin, Andrew J Gaunt, Tom R Heron, Jon Relton, Caroline L Tilling, Kate Longitudinal analysis strategies for modelling epigenetic trajectories |
title | Longitudinal analysis strategies for modelling epigenetic trajectories |
title_full | Longitudinal analysis strategies for modelling epigenetic trajectories |
title_fullStr | Longitudinal analysis strategies for modelling epigenetic trajectories |
title_full_unstemmed | Longitudinal analysis strategies for modelling epigenetic trajectories |
title_short | Longitudinal analysis strategies for modelling epigenetic trajectories |
title_sort | longitudinal analysis strategies for modelling epigenetic trajectories |
topic | Genetic Epidemiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5913606/ https://www.ncbi.nlm.nih.gov/pubmed/29462323 http://dx.doi.org/10.1093/ije/dyy012 |
work_keys_str_mv | AT staleyjamesr longitudinalanalysisstrategiesformodellingepigenetictrajectories AT sudermanmatthew longitudinalanalysisstrategiesformodellingepigenetictrajectories AT simpkinandrewj longitudinalanalysisstrategiesformodellingepigenetictrajectories AT gaunttomr longitudinalanalysisstrategiesformodellingepigenetictrajectories AT heronjon longitudinalanalysisstrategiesformodellingepigenetictrajectories AT reltoncarolinel longitudinalanalysisstrategiesformodellingepigenetictrajectories AT tillingkate longitudinalanalysisstrategiesformodellingepigenetictrajectories |