Cargando…
Integrated genetic and epigenetic prediction of coronary heart disease in the Framingham Heart Study
An improved method for detecting coronary heart disease (CHD) could have substantial clinical impact. Building on the idea that systemic effects of CHD risk factors are a conglomeration of genetic and environmental factors, we use machine learning techniques and integrate genetic, epigenetic and phe...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5749823/ https://www.ncbi.nlm.nih.gov/pubmed/29293675 http://dx.doi.org/10.1371/journal.pone.0190549 |
_version_ | 1783289645112492032 |
---|---|
author | Dogan, Meeshanthini V. Grumbach, Isabella M. Michaelson, Jacob J. Philibert, Robert A. |
author_facet | Dogan, Meeshanthini V. Grumbach, Isabella M. Michaelson, Jacob J. Philibert, Robert A. |
author_sort | Dogan, Meeshanthini V. |
collection | PubMed |
description | An improved method for detecting coronary heart disease (CHD) could have substantial clinical impact. Building on the idea that systemic effects of CHD risk factors are a conglomeration of genetic and environmental factors, we use machine learning techniques and integrate genetic, epigenetic and phenotype data from the Framingham Heart Study to build and test a Random Forest classification model for symptomatic CHD. Our classifier was trained on n = 1,545 individuals and consisted of four DNA methylation sites, two SNPs, age and gender. The methylation sites and SNPs were selected during the training phase. The final trained model was then tested on n = 142 individuals. The test data comprised of individuals removed based on relatedness to those in the training dataset. This integrated classifier was capable of classifying symptomatic CHD status of those in the test set with an accuracy, sensitivity and specificity of 78%, 0.75 and 0.80, respectively. In contrast, a model using only conventional CHD risk factors as predictors had an accuracy and sensitivity of only 65% and 0.42, respectively, but with a specificity of 0.89 in the test set. Regression analyses of the methylation signatures illustrate our ability to map these signatures to known risk factors in CHD pathogenesis. These results demonstrate the capability of an integrated approach to effectively model symptomatic CHD status. These results also suggest that future studies of biomaterial collected from longitudinally informative cohorts that are specifically characterized for cardiac disease at follow-up could lead to the introduction of sensitive, readily employable integrated genetic-epigenetic algorithms for predicting onset of future symptomatic CHD. |
format | Online Article Text |
id | pubmed-5749823 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-57498232018-01-26 Integrated genetic and epigenetic prediction of coronary heart disease in the Framingham Heart Study Dogan, Meeshanthini V. Grumbach, Isabella M. Michaelson, Jacob J. Philibert, Robert A. PLoS One Research Article An improved method for detecting coronary heart disease (CHD) could have substantial clinical impact. Building on the idea that systemic effects of CHD risk factors are a conglomeration of genetic and environmental factors, we use machine learning techniques and integrate genetic, epigenetic and phenotype data from the Framingham Heart Study to build and test a Random Forest classification model for symptomatic CHD. Our classifier was trained on n = 1,545 individuals and consisted of four DNA methylation sites, two SNPs, age and gender. The methylation sites and SNPs were selected during the training phase. The final trained model was then tested on n = 142 individuals. The test data comprised of individuals removed based on relatedness to those in the training dataset. This integrated classifier was capable of classifying symptomatic CHD status of those in the test set with an accuracy, sensitivity and specificity of 78%, 0.75 and 0.80, respectively. In contrast, a model using only conventional CHD risk factors as predictors had an accuracy and sensitivity of only 65% and 0.42, respectively, but with a specificity of 0.89 in the test set. Regression analyses of the methylation signatures illustrate our ability to map these signatures to known risk factors in CHD pathogenesis. These results demonstrate the capability of an integrated approach to effectively model symptomatic CHD status. These results also suggest that future studies of biomaterial collected from longitudinally informative cohorts that are specifically characterized for cardiac disease at follow-up could lead to the introduction of sensitive, readily employable integrated genetic-epigenetic algorithms for predicting onset of future symptomatic CHD. Public Library of Science 2018-01-02 /pmc/articles/PMC5749823/ /pubmed/29293675 http://dx.doi.org/10.1371/journal.pone.0190549 Text en © 2018 Dogan et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Dogan, Meeshanthini V. Grumbach, Isabella M. Michaelson, Jacob J. Philibert, Robert A. Integrated genetic and epigenetic prediction of coronary heart disease in the Framingham Heart Study |
title | Integrated genetic and epigenetic prediction of coronary heart disease in the Framingham Heart Study |
title_full | Integrated genetic and epigenetic prediction of coronary heart disease in the Framingham Heart Study |
title_fullStr | Integrated genetic and epigenetic prediction of coronary heart disease in the Framingham Heart Study |
title_full_unstemmed | Integrated genetic and epigenetic prediction of coronary heart disease in the Framingham Heart Study |
title_short | Integrated genetic and epigenetic prediction of coronary heart disease in the Framingham Heart Study |
title_sort | integrated genetic and epigenetic prediction of coronary heart disease in the framingham heart study |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5749823/ https://www.ncbi.nlm.nih.gov/pubmed/29293675 http://dx.doi.org/10.1371/journal.pone.0190549 |
work_keys_str_mv | AT doganmeeshanthiniv integratedgeneticandepigeneticpredictionofcoronaryheartdiseaseintheframinghamheartstudy AT grumbachisabellam integratedgeneticandepigeneticpredictionofcoronaryheartdiseaseintheframinghamheartstudy AT michaelsonjacobj integratedgeneticandepigeneticpredictionofcoronaryheartdiseaseintheframinghamheartstudy AT philibertroberta integratedgeneticandepigeneticpredictionofcoronaryheartdiseaseintheframinghamheartstudy |