Cargando…

Derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease

AIM: Predicting progression in diabetic kidney disease (DKD) is critical to improving outcomes. We sought to develop/validate a machine-learned, prognostic risk score (KidneyIntelX™) combining electronic health records (EHR) and biomarkers. METHODS: This is an observational cohort study of patients...

Descripción completa

Detalles Bibliográficos
Autores principales: Chan, Lili, Nadkarni, Girish N., Fleming, Fergus, McCullough, James R., Connolly, Patricia, Mosoyan, Gohar, El Salem, Fadi, Kattan, Michael W., Vassalotti, Joseph A., Murphy, Barbara, Donovan, Michael J., Coca, Steven G., Damrauer, Scott M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8187208/
https://www.ncbi.nlm.nih.gov/pubmed/33797560
http://dx.doi.org/10.1007/s00125-021-05444-0
_version_ 1783705097567469568
author Chan, Lili
Nadkarni, Girish N.
Fleming, Fergus
McCullough, James R.
Connolly, Patricia
Mosoyan, Gohar
El Salem, Fadi
Kattan, Michael W.
Vassalotti, Joseph A.
Murphy, Barbara
Donovan, Michael J.
Coca, Steven G.
Damrauer, Scott M.
author_facet Chan, Lili
Nadkarni, Girish N.
Fleming, Fergus
McCullough, James R.
Connolly, Patricia
Mosoyan, Gohar
El Salem, Fadi
Kattan, Michael W.
Vassalotti, Joseph A.
Murphy, Barbara
Donovan, Michael J.
Coca, Steven G.
Damrauer, Scott M.
author_sort Chan, Lili
collection PubMed
description AIM: Predicting progression in diabetic kidney disease (DKD) is critical to improving outcomes. We sought to develop/validate a machine-learned, prognostic risk score (KidneyIntelX™) combining electronic health records (EHR) and biomarkers. METHODS: This is an observational cohort study of patients with prevalent DKD/banked plasma from two EHR-linked biobanks. A random forest model was trained, and performance (AUC, positive and negative predictive values [PPV/NPV], and net reclassification index [NRI]) was compared with that of a clinical model and Kidney Disease: Improving Global Outcomes (KDIGO) categories for predicting a composite outcome of eGFR decline of ≥5 ml/min per year, ≥40% sustained decline, or kidney failure within 5 years. RESULTS: In 1146 patients, the median age was 63 years, 51% were female, the baseline eGFR was 54 ml min(−1) [1.73 m](−2), the urine albumin to creatinine ratio (uACR) was 6.9 mg/mmol, follow-up was 4.3 years and 21% had the composite endpoint. On cross-validation in derivation (n = 686), KidneyIntelX had an AUC of 0.77 (95% CI 0.74, 0.79). In validation (n = 460), the AUC was 0.77 (95% CI 0.76, 0.79). By comparison, the AUC for the clinical model was 0.62 (95% CI 0.61, 0.63) in derivation and 0.61 (95% CI 0.60, 0.63) in validation. Using derivation cut-offs, KidneyIntelX stratified 46%, 37% and 17% of the validation cohort into low-, intermediate- and high-risk groups for the composite kidney endpoint, respectively. The PPV for progressive decline in kidney function in the high-risk group was 61% for KidneyIntelX vs 40% for the highest risk strata by KDIGO categorisation (p < 0.001). Only 10% of those scored as low risk by KidneyIntelX experienced progression (i.e., NPV of 90%). The NRI(event) for the high-risk group was 41% (p < 0.05). CONCLUSIONS: KidneyIntelX improved prediction of kidney outcomes over KDIGO and clinical models in individuals with early stages of DKD. GRAPHICAL ABSTRACT: [Image: see text] SUPPLEMENTARY INFORMATION: The online version contains peer-reviewed but unedited supplementary material available at 10.1007/s00125-021-05444-0.
format Online
Article
Text
id pubmed-8187208
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-81872082021-06-11 Derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease Chan, Lili Nadkarni, Girish N. Fleming, Fergus McCullough, James R. Connolly, Patricia Mosoyan, Gohar El Salem, Fadi Kattan, Michael W. Vassalotti, Joseph A. Murphy, Barbara Donovan, Michael J. Coca, Steven G. Damrauer, Scott M. Diabetologia Article AIM: Predicting progression in diabetic kidney disease (DKD) is critical to improving outcomes. We sought to develop/validate a machine-learned, prognostic risk score (KidneyIntelX™) combining electronic health records (EHR) and biomarkers. METHODS: This is an observational cohort study of patients with prevalent DKD/banked plasma from two EHR-linked biobanks. A random forest model was trained, and performance (AUC, positive and negative predictive values [PPV/NPV], and net reclassification index [NRI]) was compared with that of a clinical model and Kidney Disease: Improving Global Outcomes (KDIGO) categories for predicting a composite outcome of eGFR decline of ≥5 ml/min per year, ≥40% sustained decline, or kidney failure within 5 years. RESULTS: In 1146 patients, the median age was 63 years, 51% were female, the baseline eGFR was 54 ml min(−1) [1.73 m](−2), the urine albumin to creatinine ratio (uACR) was 6.9 mg/mmol, follow-up was 4.3 years and 21% had the composite endpoint. On cross-validation in derivation (n = 686), KidneyIntelX had an AUC of 0.77 (95% CI 0.74, 0.79). In validation (n = 460), the AUC was 0.77 (95% CI 0.76, 0.79). By comparison, the AUC for the clinical model was 0.62 (95% CI 0.61, 0.63) in derivation and 0.61 (95% CI 0.60, 0.63) in validation. Using derivation cut-offs, KidneyIntelX stratified 46%, 37% and 17% of the validation cohort into low-, intermediate- and high-risk groups for the composite kidney endpoint, respectively. The PPV for progressive decline in kidney function in the high-risk group was 61% for KidneyIntelX vs 40% for the highest risk strata by KDIGO categorisation (p < 0.001). Only 10% of those scored as low risk by KidneyIntelX experienced progression (i.e., NPV of 90%). The NRI(event) for the high-risk group was 41% (p < 0.05). CONCLUSIONS: KidneyIntelX improved prediction of kidney outcomes over KDIGO and clinical models in individuals with early stages of DKD. GRAPHICAL ABSTRACT: [Image: see text] SUPPLEMENTARY INFORMATION: The online version contains peer-reviewed but unedited supplementary material available at 10.1007/s00125-021-05444-0. Springer Berlin Heidelberg 2021-04-02 2021 /pmc/articles/PMC8187208/ /pubmed/33797560 http://dx.doi.org/10.1007/s00125-021-05444-0 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Chan, Lili
Nadkarni, Girish N.
Fleming, Fergus
McCullough, James R.
Connolly, Patricia
Mosoyan, Gohar
El Salem, Fadi
Kattan, Michael W.
Vassalotti, Joseph A.
Murphy, Barbara
Donovan, Michael J.
Coca, Steven G.
Damrauer, Scott M.
Derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease
title Derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease
title_full Derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease
title_fullStr Derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease
title_full_unstemmed Derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease
title_short Derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease
title_sort derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8187208/
https://www.ncbi.nlm.nih.gov/pubmed/33797560
http://dx.doi.org/10.1007/s00125-021-05444-0
work_keys_str_mv AT chanlili derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease
AT nadkarnigirishn derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease
AT flemingfergus derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease
AT mcculloughjamesr derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease
AT connollypatricia derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease
AT mosoyangohar derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease
AT elsalemfadi derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease
AT kattanmichaelw derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease
AT vassalottijosepha derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease
AT murphybarbara derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease
AT donovanmichaelj derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease
AT cocasteveng derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease
AT damrauerscottm derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease