Cargando…

Integrated analysis of DNA-methylation and gene expression using high-dimensional penalized regression: a cohort study on bone mineral density in postmenopausal women

BACKGROUND: Using high-dimensional penalized regression we studied genome-wide DNA-methylation in bone biopsies of 80 postmenopausal women in relation to their bone mineral density (BMD). The women showed BMD varying from severely osteoporotic to normal. Global gene expression data from the same ind...

Descripción completa

Detalles Bibliográficos
Autores principales: Lien, Tonje G., Borgan, Ørnulf, Reppe, Sjur, Gautvik, Kaare, Glad, Ingrid Kristine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5842543/
https://www.ncbi.nlm.nih.gov/pubmed/29514638
http://dx.doi.org/10.1186/s12920-018-0341-2
_version_ 1783304917542240256
author Lien, Tonje G.
Borgan, Ørnulf
Reppe, Sjur
Gautvik, Kaare
Glad, Ingrid Kristine
author_facet Lien, Tonje G.
Borgan, Ørnulf
Reppe, Sjur
Gautvik, Kaare
Glad, Ingrid Kristine
author_sort Lien, Tonje G.
collection PubMed
description BACKGROUND: Using high-dimensional penalized regression we studied genome-wide DNA-methylation in bone biopsies of 80 postmenopausal women in relation to their bone mineral density (BMD). The women showed BMD varying from severely osteoporotic to normal. Global gene expression data from the same individuals was available, and since DNA-methylation often affects gene expression, the overall aim of this paper was to include both of these omics data sets into an integrated analysis. METHODS: The classical penalized regression uses one penalty, but we incorporated individual penalties for each of the DNA-methylation sites. These individual penalties were guided by the strength of association between DNA-methylations and gene transcript levels. DNA-methylations that were highly associated to one or more transcripts got lower penalties and were therefore favored compared to DNA-methylations showing less association to expression. Because of the complex pathways and interactions among genes, we investigated both the association between DNA-methylations and their corresponding cis gene, as well as the association between DNA-methylations and trans-located genes. Two integrating penalized methods were used: first, an adaptive group-regularized ridge regression, and secondly, variable selection was performed through a modified version of the weighted lasso. RESULTS: When information from gene expressions was integrated, predictive performance was considerably improved, in terms of predictive mean square error, compared to classical penalized regression without data integration. We found a 14.7% improvement in the ridge regression case and a 17% improvement for the lasso case. Our version of the weighted lasso with data integration found a list of 22 interesting methylation sites. Several corresponded to genes that are known to be important in bone formation. Using BMD as response and these 22 methylation sites as covariates, least square regression analyses resulted in R(2)=0.726, comparable to an average R(2)=0.438 for 10000 randomly selected groups of DNA-methylations with group size 22. CONCLUSIONS: Two recent types of penalized regression methods were adapted to integrate DNA-methylation and their association to gene expression in the analysis of bone mineral density. In both cases predictions clearly benefit from including the additional information on gene expressions. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12920-018-0341-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5842543
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-58425432018-03-14 Integrated analysis of DNA-methylation and gene expression using high-dimensional penalized regression: a cohort study on bone mineral density in postmenopausal women Lien, Tonje G. Borgan, Ørnulf Reppe, Sjur Gautvik, Kaare Glad, Ingrid Kristine BMC Med Genomics Research Article BACKGROUND: Using high-dimensional penalized regression we studied genome-wide DNA-methylation in bone biopsies of 80 postmenopausal women in relation to their bone mineral density (BMD). The women showed BMD varying from severely osteoporotic to normal. Global gene expression data from the same individuals was available, and since DNA-methylation often affects gene expression, the overall aim of this paper was to include both of these omics data sets into an integrated analysis. METHODS: The classical penalized regression uses one penalty, but we incorporated individual penalties for each of the DNA-methylation sites. These individual penalties were guided by the strength of association between DNA-methylations and gene transcript levels. DNA-methylations that were highly associated to one or more transcripts got lower penalties and were therefore favored compared to DNA-methylations showing less association to expression. Because of the complex pathways and interactions among genes, we investigated both the association between DNA-methylations and their corresponding cis gene, as well as the association between DNA-methylations and trans-located genes. Two integrating penalized methods were used: first, an adaptive group-regularized ridge regression, and secondly, variable selection was performed through a modified version of the weighted lasso. RESULTS: When information from gene expressions was integrated, predictive performance was considerably improved, in terms of predictive mean square error, compared to classical penalized regression without data integration. We found a 14.7% improvement in the ridge regression case and a 17% improvement for the lasso case. Our version of the weighted lasso with data integration found a list of 22 interesting methylation sites. Several corresponded to genes that are known to be important in bone formation. Using BMD as response and these 22 methylation sites as covariates, least square regression analyses resulted in R(2)=0.726, comparable to an average R(2)=0.438 for 10000 randomly selected groups of DNA-methylations with group size 22. CONCLUSIONS: Two recent types of penalized regression methods were adapted to integrate DNA-methylation and their association to gene expression in the analysis of bone mineral density. In both cases predictions clearly benefit from including the additional information on gene expressions. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12920-018-0341-2) contains supplementary material, which is available to authorized users. BioMed Central 2018-03-07 /pmc/articles/PMC5842543/ /pubmed/29514638 http://dx.doi.org/10.1186/s12920-018-0341-2 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Lien, Tonje G.
Borgan, Ørnulf
Reppe, Sjur
Gautvik, Kaare
Glad, Ingrid Kristine
Integrated analysis of DNA-methylation and gene expression using high-dimensional penalized regression: a cohort study on bone mineral density in postmenopausal women
title Integrated analysis of DNA-methylation and gene expression using high-dimensional penalized regression: a cohort study on bone mineral density in postmenopausal women
title_full Integrated analysis of DNA-methylation and gene expression using high-dimensional penalized regression: a cohort study on bone mineral density in postmenopausal women
title_fullStr Integrated analysis of DNA-methylation and gene expression using high-dimensional penalized regression: a cohort study on bone mineral density in postmenopausal women
title_full_unstemmed Integrated analysis of DNA-methylation and gene expression using high-dimensional penalized regression: a cohort study on bone mineral density in postmenopausal women
title_short Integrated analysis of DNA-methylation and gene expression using high-dimensional penalized regression: a cohort study on bone mineral density in postmenopausal women
title_sort integrated analysis of dna-methylation and gene expression using high-dimensional penalized regression: a cohort study on bone mineral density in postmenopausal women
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5842543/
https://www.ncbi.nlm.nih.gov/pubmed/29514638
http://dx.doi.org/10.1186/s12920-018-0341-2
work_keys_str_mv AT lientonjeg integratedanalysisofdnamethylationandgeneexpressionusinghighdimensionalpenalizedregressionacohortstudyonbonemineraldensityinpostmenopausalwomen
AT borganørnulf integratedanalysisofdnamethylationandgeneexpressionusinghighdimensionalpenalizedregressionacohortstudyonbonemineraldensityinpostmenopausalwomen
AT reppesjur integratedanalysisofdnamethylationandgeneexpressionusinghighdimensionalpenalizedregressionacohortstudyonbonemineraldensityinpostmenopausalwomen
AT gautvikkaare integratedanalysisofdnamethylationandgeneexpressionusinghighdimensionalpenalizedregressionacohortstudyonbonemineraldensityinpostmenopausalwomen
AT gladingridkristine integratedanalysisofdnamethylationandgeneexpressionusinghighdimensionalpenalizedregressionacohortstudyonbonemineraldensityinpostmenopausalwomen