Cargando…

Machine-learning to stratify diabetic patients using novel cardiac biomarkers and integrative genomics

BACKGROUND: Diabetes mellitus is a chronic disease that impacts an increasing percentage of people each year. Among its comorbidities, diabetics are two to four times more likely to develop cardiovascular diseases. While HbA1c remains the primary diagnostic for diabetics, its ability to predict long...

Descripción completa

Detalles Bibliográficos
Autores principales: Hathaway, Quincy A., Roth, Skyler M., Pinti, Mark V., Sprando, Daniel C., Kunovac, Amina, Durr, Andrya J., Cook, Chris C., Fink, Garrett K., Cheuvront, Tristen B., Grossman, Jasmine H., Aljahli, Ghadah A., Taylor, Andrew D., Giromini, Andrew P., Allen, Jessica L., Hollander, John M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6560734/
https://www.ncbi.nlm.nih.gov/pubmed/31185988
http://dx.doi.org/10.1186/s12933-019-0879-0
_version_ 1783426007909269504
author Hathaway, Quincy A.
Roth, Skyler M.
Pinti, Mark V.
Sprando, Daniel C.
Kunovac, Amina
Durr, Andrya J.
Cook, Chris C.
Fink, Garrett K.
Cheuvront, Tristen B.
Grossman, Jasmine H.
Aljahli, Ghadah A.
Taylor, Andrew D.
Giromini, Andrew P.
Allen, Jessica L.
Hollander, John M.
author_facet Hathaway, Quincy A.
Roth, Skyler M.
Pinti, Mark V.
Sprando, Daniel C.
Kunovac, Amina
Durr, Andrya J.
Cook, Chris C.
Fink, Garrett K.
Cheuvront, Tristen B.
Grossman, Jasmine H.
Aljahli, Ghadah A.
Taylor, Andrew D.
Giromini, Andrew P.
Allen, Jessica L.
Hollander, John M.
author_sort Hathaway, Quincy A.
collection PubMed
description BACKGROUND: Diabetes mellitus is a chronic disease that impacts an increasing percentage of people each year. Among its comorbidities, diabetics are two to four times more likely to develop cardiovascular diseases. While HbA1c remains the primary diagnostic for diabetics, its ability to predict long-term, health outcomes across diverse demographics, ethnic groups, and at a personalized level are limited. The purpose of this study was to provide a model for precision medicine through the implementation of machine-learning algorithms using multiple cardiac biomarkers as a means for predicting diabetes mellitus development. METHODS: Right atrial appendages from 50 patients, 30 non-diabetic and 20 type 2 diabetic, were procured from the WVU Ruby Memorial Hospital. Machine-learning was applied to physiological, biochemical, and sequencing data for each patient. Supervised learning implementing SHapley Additive exPlanations (SHAP) allowed binary (no diabetes or type 2 diabetes) and multiple classification (no diabetes, prediabetes, and type 2 diabetes) of the patient cohort with and without the inclusion of HbA1c levels. Findings were validated through Logistic Regression (LR), Linear Discriminant Analysis (LDA), Gaussian Naïve Bayes (NB), Support Vector Machine (SVM), and Classification and Regression Tree (CART) models with tenfold cross validation. RESULTS: Total nuclear methylation and hydroxymethylation were highly correlated to diabetic status, with nuclear methylation and mitochondrial electron transport chain (ETC) activities achieving superior testing accuracies in the predictive model (~ 84% testing, binary). Mitochondrial DNA SNPs found in the D-Loop region (SNP-73G, -16126C, and -16362C) were highly associated with diabetes mellitus. The CpG island of transcription factor A, mitochondrial (TFAM) revealed CpG24 (chr10:58385262, P = 0.003) and CpG29 (chr10:58385324, P = 0.001) as markers correlating with diabetic progression. When combining the most predictive factors from each set, total nuclear methylation and CpG24 methylation were the best diagnostic measures in both binary and multiple classification sets. CONCLUSIONS: Using machine-learning, we were able to identify novel as well as the most relevant biomarkers associated with type 2 diabetes mellitus by integrating physiological, biochemical, and sequencing datasets. Ultimately, this approach may be used as a guideline for future investigations into disease pathogenesis and novel biomarker discovery. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12933-019-0879-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6560734
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-65607342019-06-14 Machine-learning to stratify diabetic patients using novel cardiac biomarkers and integrative genomics Hathaway, Quincy A. Roth, Skyler M. Pinti, Mark V. Sprando, Daniel C. Kunovac, Amina Durr, Andrya J. Cook, Chris C. Fink, Garrett K. Cheuvront, Tristen B. Grossman, Jasmine H. Aljahli, Ghadah A. Taylor, Andrew D. Giromini, Andrew P. Allen, Jessica L. Hollander, John M. Cardiovasc Diabetol Original Investigation BACKGROUND: Diabetes mellitus is a chronic disease that impacts an increasing percentage of people each year. Among its comorbidities, diabetics are two to four times more likely to develop cardiovascular diseases. While HbA1c remains the primary diagnostic for diabetics, its ability to predict long-term, health outcomes across diverse demographics, ethnic groups, and at a personalized level are limited. The purpose of this study was to provide a model for precision medicine through the implementation of machine-learning algorithms using multiple cardiac biomarkers as a means for predicting diabetes mellitus development. METHODS: Right atrial appendages from 50 patients, 30 non-diabetic and 20 type 2 diabetic, were procured from the WVU Ruby Memorial Hospital. Machine-learning was applied to physiological, biochemical, and sequencing data for each patient. Supervised learning implementing SHapley Additive exPlanations (SHAP) allowed binary (no diabetes or type 2 diabetes) and multiple classification (no diabetes, prediabetes, and type 2 diabetes) of the patient cohort with and without the inclusion of HbA1c levels. Findings were validated through Logistic Regression (LR), Linear Discriminant Analysis (LDA), Gaussian Naïve Bayes (NB), Support Vector Machine (SVM), and Classification and Regression Tree (CART) models with tenfold cross validation. RESULTS: Total nuclear methylation and hydroxymethylation were highly correlated to diabetic status, with nuclear methylation and mitochondrial electron transport chain (ETC) activities achieving superior testing accuracies in the predictive model (~ 84% testing, binary). Mitochondrial DNA SNPs found in the D-Loop region (SNP-73G, -16126C, and -16362C) were highly associated with diabetes mellitus. The CpG island of transcription factor A, mitochondrial (TFAM) revealed CpG24 (chr10:58385262, P = 0.003) and CpG29 (chr10:58385324, P = 0.001) as markers correlating with diabetic progression. When combining the most predictive factors from each set, total nuclear methylation and CpG24 methylation were the best diagnostic measures in both binary and multiple classification sets. CONCLUSIONS: Using machine-learning, we were able to identify novel as well as the most relevant biomarkers associated with type 2 diabetes mellitus by integrating physiological, biochemical, and sequencing datasets. Ultimately, this approach may be used as a guideline for future investigations into disease pathogenesis and novel biomarker discovery. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12933-019-0879-0) contains supplementary material, which is available to authorized users. BioMed Central 2019-06-11 /pmc/articles/PMC6560734/ /pubmed/31185988 http://dx.doi.org/10.1186/s12933-019-0879-0 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Original Investigation
Hathaway, Quincy A.
Roth, Skyler M.
Pinti, Mark V.
Sprando, Daniel C.
Kunovac, Amina
Durr, Andrya J.
Cook, Chris C.
Fink, Garrett K.
Cheuvront, Tristen B.
Grossman, Jasmine H.
Aljahli, Ghadah A.
Taylor, Andrew D.
Giromini, Andrew P.
Allen, Jessica L.
Hollander, John M.
Machine-learning to stratify diabetic patients using novel cardiac biomarkers and integrative genomics
title Machine-learning to stratify diabetic patients using novel cardiac biomarkers and integrative genomics
title_full Machine-learning to stratify diabetic patients using novel cardiac biomarkers and integrative genomics
title_fullStr Machine-learning to stratify diabetic patients using novel cardiac biomarkers and integrative genomics
title_full_unstemmed Machine-learning to stratify diabetic patients using novel cardiac biomarkers and integrative genomics
title_short Machine-learning to stratify diabetic patients using novel cardiac biomarkers and integrative genomics
title_sort machine-learning to stratify diabetic patients using novel cardiac biomarkers and integrative genomics
topic Original Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6560734/
https://www.ncbi.nlm.nih.gov/pubmed/31185988
http://dx.doi.org/10.1186/s12933-019-0879-0
work_keys_str_mv AT hathawayquincya machinelearningtostratifydiabeticpatientsusingnovelcardiacbiomarkersandintegrativegenomics
AT rothskylerm machinelearningtostratifydiabeticpatientsusingnovelcardiacbiomarkersandintegrativegenomics
AT pintimarkv machinelearningtostratifydiabeticpatientsusingnovelcardiacbiomarkersandintegrativegenomics
AT sprandodanielc machinelearningtostratifydiabeticpatientsusingnovelcardiacbiomarkersandintegrativegenomics
AT kunovacamina machinelearningtostratifydiabeticpatientsusingnovelcardiacbiomarkersandintegrativegenomics
AT durrandryaj machinelearningtostratifydiabeticpatientsusingnovelcardiacbiomarkersandintegrativegenomics
AT cookchrisc machinelearningtostratifydiabeticpatientsusingnovelcardiacbiomarkersandintegrativegenomics
AT finkgarrettk machinelearningtostratifydiabeticpatientsusingnovelcardiacbiomarkersandintegrativegenomics
AT cheuvronttristenb machinelearningtostratifydiabeticpatientsusingnovelcardiacbiomarkersandintegrativegenomics
AT grossmanjasmineh machinelearningtostratifydiabeticpatientsusingnovelcardiacbiomarkersandintegrativegenomics
AT aljahlighadaha machinelearningtostratifydiabeticpatientsusingnovelcardiacbiomarkersandintegrativegenomics
AT taylorandrewd machinelearningtostratifydiabeticpatientsusingnovelcardiacbiomarkersandintegrativegenomics
AT girominiandrewp machinelearningtostratifydiabeticpatientsusingnovelcardiacbiomarkersandintegrativegenomics
AT allenjessical machinelearningtostratifydiabeticpatientsusingnovelcardiacbiomarkersandintegrativegenomics
AT hollanderjohnm machinelearningtostratifydiabeticpatientsusingnovelcardiacbiomarkersandintegrativegenomics