Cargando…

Predictive utilities of lipid traits, lipoprotein subfractions and other risk factors for incident diabetes: a machine learning approach in the Diabetes Prevention Program

INTRODUCTION: Although various lipid and non-lipid analytes measured by nuclear magnetic resonance (NMR) spectroscopy have been associated with type 2 diabetes, a structured comparison of the ability of NMR-derived biomarkers and standard lipids to predict individual diabetes risk has not been under...

Descripción completa

Detalles Bibliográficos
Autores principales: Varga, Tibor V, Liu, Jinxi, Goldberg, Ronald B, Chen, Guannan, Dagogo-Jack, Samuel, Lorenzo, Carlos, Mather, Kieren J, Pi-Sunyer, Xavier, Brunak, Søren, Temprosa, Marinella
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8016090/
https://www.ncbi.nlm.nih.gov/pubmed/33789908
http://dx.doi.org/10.1136/bmjdrc-2020-001953
_version_ 1783673785307627520
author Varga, Tibor V
Liu, Jinxi
Goldberg, Ronald B
Chen, Guannan
Dagogo-Jack, Samuel
Lorenzo, Carlos
Mather, Kieren J
Pi-Sunyer, Xavier
Brunak, Søren
Temprosa, Marinella
author_facet Varga, Tibor V
Liu, Jinxi
Goldberg, Ronald B
Chen, Guannan
Dagogo-Jack, Samuel
Lorenzo, Carlos
Mather, Kieren J
Pi-Sunyer, Xavier
Brunak, Søren
Temprosa, Marinella
author_sort Varga, Tibor V
collection PubMed
description INTRODUCTION: Although various lipid and non-lipid analytes measured by nuclear magnetic resonance (NMR) spectroscopy have been associated with type 2 diabetes, a structured comparison of the ability of NMR-derived biomarkers and standard lipids to predict individual diabetes risk has not been undertaken in larger studies nor among individuals at high risk of diabetes. RESEARCH DESIGN AND METHODS: Cumulative discriminative utilities of various groups of biomarkers including NMR lipoproteins, related non-lipid biomarkers, standard lipids, and demographic and glycemic traits were compared for short-term (3.2 years) and long-term (15 years) diabetes development in the Diabetes Prevention Program, a multiethnic, placebo-controlled, randomized controlled trial of individuals with pre-diabetes in the USA (N=2590). Logistic regression, Cox proportional hazards model and six different hyperparameter-tuned machine learning algorithms were compared. The Matthews Correlation Coefficient (MCC) was used as the primary measure of discriminative utility. RESULTS: Models with baseline NMR analytes and their changes did not improve the discriminative utility of simpler models including standard lipids or demographic and glycemic traits. Across all algorithms, models with baseline 2-hour glucose performed the best (max MCC=0.36). Sophisticated machine learning algorithms performed similarly to logistic regression in this study. CONCLUSIONS: NMR lipoproteins and related non-lipid biomarkers were associated but did not augment discrimination of diabetes risk beyond traditional diabetes risk factors except for 2-hour glucose. Machine learning algorithms provided no meaningful improvement for discrimination compared with logistic regression, which suggests a lack of influential latent interactions among the analytes assessed in this study. TRIAL REGISTRATION NUMBER: Diabetes Prevention Program: NCT00004992; Diabetes Prevention Program Outcomes Study: NCT00038727.
format Online
Article
Text
id pubmed-8016090
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-80160902021-04-21 Predictive utilities of lipid traits, lipoprotein subfractions and other risk factors for incident diabetes: a machine learning approach in the Diabetes Prevention Program Varga, Tibor V Liu, Jinxi Goldberg, Ronald B Chen, Guannan Dagogo-Jack, Samuel Lorenzo, Carlos Mather, Kieren J Pi-Sunyer, Xavier Brunak, Søren Temprosa, Marinella BMJ Open Diabetes Res Care Epidemiology/Health services research INTRODUCTION: Although various lipid and non-lipid analytes measured by nuclear magnetic resonance (NMR) spectroscopy have been associated with type 2 diabetes, a structured comparison of the ability of NMR-derived biomarkers and standard lipids to predict individual diabetes risk has not been undertaken in larger studies nor among individuals at high risk of diabetes. RESEARCH DESIGN AND METHODS: Cumulative discriminative utilities of various groups of biomarkers including NMR lipoproteins, related non-lipid biomarkers, standard lipids, and demographic and glycemic traits were compared for short-term (3.2 years) and long-term (15 years) diabetes development in the Diabetes Prevention Program, a multiethnic, placebo-controlled, randomized controlled trial of individuals with pre-diabetes in the USA (N=2590). Logistic regression, Cox proportional hazards model and six different hyperparameter-tuned machine learning algorithms were compared. The Matthews Correlation Coefficient (MCC) was used as the primary measure of discriminative utility. RESULTS: Models with baseline NMR analytes and their changes did not improve the discriminative utility of simpler models including standard lipids or demographic and glycemic traits. Across all algorithms, models with baseline 2-hour glucose performed the best (max MCC=0.36). Sophisticated machine learning algorithms performed similarly to logistic regression in this study. CONCLUSIONS: NMR lipoproteins and related non-lipid biomarkers were associated but did not augment discrimination of diabetes risk beyond traditional diabetes risk factors except for 2-hour glucose. Machine learning algorithms provided no meaningful improvement for discrimination compared with logistic regression, which suggests a lack of influential latent interactions among the analytes assessed in this study. TRIAL REGISTRATION NUMBER: Diabetes Prevention Program: NCT00004992; Diabetes Prevention Program Outcomes Study: NCT00038727. BMJ Publishing Group 2021-03-31 /pmc/articles/PMC8016090/ /pubmed/33789908 http://dx.doi.org/10.1136/bmjdrc-2020-001953 Text en © Author(s) (or their employer(s)) 2021. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. http://creativecommons.org/licenses/by-nc/4.0/ http://creativecommons.org/licenses/by-nc/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Epidemiology/Health services research
Varga, Tibor V
Liu, Jinxi
Goldberg, Ronald B
Chen, Guannan
Dagogo-Jack, Samuel
Lorenzo, Carlos
Mather, Kieren J
Pi-Sunyer, Xavier
Brunak, Søren
Temprosa, Marinella
Predictive utilities of lipid traits, lipoprotein subfractions and other risk factors for incident diabetes: a machine learning approach in the Diabetes Prevention Program
title Predictive utilities of lipid traits, lipoprotein subfractions and other risk factors for incident diabetes: a machine learning approach in the Diabetes Prevention Program
title_full Predictive utilities of lipid traits, lipoprotein subfractions and other risk factors for incident diabetes: a machine learning approach in the Diabetes Prevention Program
title_fullStr Predictive utilities of lipid traits, lipoprotein subfractions and other risk factors for incident diabetes: a machine learning approach in the Diabetes Prevention Program
title_full_unstemmed Predictive utilities of lipid traits, lipoprotein subfractions and other risk factors for incident diabetes: a machine learning approach in the Diabetes Prevention Program
title_short Predictive utilities of lipid traits, lipoprotein subfractions and other risk factors for incident diabetes: a machine learning approach in the Diabetes Prevention Program
title_sort predictive utilities of lipid traits, lipoprotein subfractions and other risk factors for incident diabetes: a machine learning approach in the diabetes prevention program
topic Epidemiology/Health services research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8016090/
https://www.ncbi.nlm.nih.gov/pubmed/33789908
http://dx.doi.org/10.1136/bmjdrc-2020-001953
work_keys_str_mv AT vargatiborv predictiveutilitiesoflipidtraitslipoproteinsubfractionsandotherriskfactorsforincidentdiabetesamachinelearningapproachinthediabetespreventionprogram
AT liujinxi predictiveutilitiesoflipidtraitslipoproteinsubfractionsandotherriskfactorsforincidentdiabetesamachinelearningapproachinthediabetespreventionprogram
AT goldbergronaldb predictiveutilitiesoflipidtraitslipoproteinsubfractionsandotherriskfactorsforincidentdiabetesamachinelearningapproachinthediabetespreventionprogram
AT chenguannan predictiveutilitiesoflipidtraitslipoproteinsubfractionsandotherriskfactorsforincidentdiabetesamachinelearningapproachinthediabetespreventionprogram
AT dagogojacksamuel predictiveutilitiesoflipidtraitslipoproteinsubfractionsandotherriskfactorsforincidentdiabetesamachinelearningapproachinthediabetespreventionprogram
AT lorenzocarlos predictiveutilitiesoflipidtraitslipoproteinsubfractionsandotherriskfactorsforincidentdiabetesamachinelearningapproachinthediabetespreventionprogram
AT matherkierenj predictiveutilitiesoflipidtraitslipoproteinsubfractionsandotherriskfactorsforincidentdiabetesamachinelearningapproachinthediabetespreventionprogram
AT pisunyerxavier predictiveutilitiesoflipidtraitslipoproteinsubfractionsandotherriskfactorsforincidentdiabetesamachinelearningapproachinthediabetespreventionprogram
AT brunaksøren predictiveutilitiesoflipidtraitslipoproteinsubfractionsandotherriskfactorsforincidentdiabetesamachinelearningapproachinthediabetespreventionprogram
AT temprosamarinella predictiveutilitiesoflipidtraitslipoproteinsubfractionsandotherriskfactorsforincidentdiabetesamachinelearningapproachinthediabetespreventionprogram
AT predictiveutilitiesoflipidtraitslipoproteinsubfractionsandotherriskfactorsforincidentdiabetesamachinelearningapproachinthediabetespreventionprogram