Cargando…
Derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease
AIM: Predicting progression in diabetic kidney disease (DKD) is critical to improving outcomes. We sought to develop/validate a machine-learned, prognostic risk score (KidneyIntelX™) combining electronic health records (EHR) and biomarkers. METHODS: This is an observational cohort study of patients...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8187208/ https://www.ncbi.nlm.nih.gov/pubmed/33797560 http://dx.doi.org/10.1007/s00125-021-05444-0 |
_version_ | 1783705097567469568 |
---|---|
author | Chan, Lili Nadkarni, Girish N. Fleming, Fergus McCullough, James R. Connolly, Patricia Mosoyan, Gohar El Salem, Fadi Kattan, Michael W. Vassalotti, Joseph A. Murphy, Barbara Donovan, Michael J. Coca, Steven G. Damrauer, Scott M. |
author_facet | Chan, Lili Nadkarni, Girish N. Fleming, Fergus McCullough, James R. Connolly, Patricia Mosoyan, Gohar El Salem, Fadi Kattan, Michael W. Vassalotti, Joseph A. Murphy, Barbara Donovan, Michael J. Coca, Steven G. Damrauer, Scott M. |
author_sort | Chan, Lili |
collection | PubMed |
description | AIM: Predicting progression in diabetic kidney disease (DKD) is critical to improving outcomes. We sought to develop/validate a machine-learned, prognostic risk score (KidneyIntelX™) combining electronic health records (EHR) and biomarkers. METHODS: This is an observational cohort study of patients with prevalent DKD/banked plasma from two EHR-linked biobanks. A random forest model was trained, and performance (AUC, positive and negative predictive values [PPV/NPV], and net reclassification index [NRI]) was compared with that of a clinical model and Kidney Disease: Improving Global Outcomes (KDIGO) categories for predicting a composite outcome of eGFR decline of ≥5 ml/min per year, ≥40% sustained decline, or kidney failure within 5 years. RESULTS: In 1146 patients, the median age was 63 years, 51% were female, the baseline eGFR was 54 ml min(−1) [1.73 m](−2), the urine albumin to creatinine ratio (uACR) was 6.9 mg/mmol, follow-up was 4.3 years and 21% had the composite endpoint. On cross-validation in derivation (n = 686), KidneyIntelX had an AUC of 0.77 (95% CI 0.74, 0.79). In validation (n = 460), the AUC was 0.77 (95% CI 0.76, 0.79). By comparison, the AUC for the clinical model was 0.62 (95% CI 0.61, 0.63) in derivation and 0.61 (95% CI 0.60, 0.63) in validation. Using derivation cut-offs, KidneyIntelX stratified 46%, 37% and 17% of the validation cohort into low-, intermediate- and high-risk groups for the composite kidney endpoint, respectively. The PPV for progressive decline in kidney function in the high-risk group was 61% for KidneyIntelX vs 40% for the highest risk strata by KDIGO categorisation (p < 0.001). Only 10% of those scored as low risk by KidneyIntelX experienced progression (i.e., NPV of 90%). The NRI(event) for the high-risk group was 41% (p < 0.05). CONCLUSIONS: KidneyIntelX improved prediction of kidney outcomes over KDIGO and clinical models in individuals with early stages of DKD. GRAPHICAL ABSTRACT: [Image: see text] SUPPLEMENTARY INFORMATION: The online version contains peer-reviewed but unedited supplementary material available at 10.1007/s00125-021-05444-0. |
format | Online Article Text |
id | pubmed-8187208 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-81872082021-06-11 Derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease Chan, Lili Nadkarni, Girish N. Fleming, Fergus McCullough, James R. Connolly, Patricia Mosoyan, Gohar El Salem, Fadi Kattan, Michael W. Vassalotti, Joseph A. Murphy, Barbara Donovan, Michael J. Coca, Steven G. Damrauer, Scott M. Diabetologia Article AIM: Predicting progression in diabetic kidney disease (DKD) is critical to improving outcomes. We sought to develop/validate a machine-learned, prognostic risk score (KidneyIntelX™) combining electronic health records (EHR) and biomarkers. METHODS: This is an observational cohort study of patients with prevalent DKD/banked plasma from two EHR-linked biobanks. A random forest model was trained, and performance (AUC, positive and negative predictive values [PPV/NPV], and net reclassification index [NRI]) was compared with that of a clinical model and Kidney Disease: Improving Global Outcomes (KDIGO) categories for predicting a composite outcome of eGFR decline of ≥5 ml/min per year, ≥40% sustained decline, or kidney failure within 5 years. RESULTS: In 1146 patients, the median age was 63 years, 51% were female, the baseline eGFR was 54 ml min(−1) [1.73 m](−2), the urine albumin to creatinine ratio (uACR) was 6.9 mg/mmol, follow-up was 4.3 years and 21% had the composite endpoint. On cross-validation in derivation (n = 686), KidneyIntelX had an AUC of 0.77 (95% CI 0.74, 0.79). In validation (n = 460), the AUC was 0.77 (95% CI 0.76, 0.79). By comparison, the AUC for the clinical model was 0.62 (95% CI 0.61, 0.63) in derivation and 0.61 (95% CI 0.60, 0.63) in validation. Using derivation cut-offs, KidneyIntelX stratified 46%, 37% and 17% of the validation cohort into low-, intermediate- and high-risk groups for the composite kidney endpoint, respectively. The PPV for progressive decline in kidney function in the high-risk group was 61% for KidneyIntelX vs 40% for the highest risk strata by KDIGO categorisation (p < 0.001). Only 10% of those scored as low risk by KidneyIntelX experienced progression (i.e., NPV of 90%). The NRI(event) for the high-risk group was 41% (p < 0.05). CONCLUSIONS: KidneyIntelX improved prediction of kidney outcomes over KDIGO and clinical models in individuals with early stages of DKD. GRAPHICAL ABSTRACT: [Image: see text] SUPPLEMENTARY INFORMATION: The online version contains peer-reviewed but unedited supplementary material available at 10.1007/s00125-021-05444-0. Springer Berlin Heidelberg 2021-04-02 2021 /pmc/articles/PMC8187208/ /pubmed/33797560 http://dx.doi.org/10.1007/s00125-021-05444-0 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Chan, Lili Nadkarni, Girish N. Fleming, Fergus McCullough, James R. Connolly, Patricia Mosoyan, Gohar El Salem, Fadi Kattan, Michael W. Vassalotti, Joseph A. Murphy, Barbara Donovan, Michael J. Coca, Steven G. Damrauer, Scott M. Derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease |
title | Derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease |
title_full | Derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease |
title_fullStr | Derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease |
title_full_unstemmed | Derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease |
title_short | Derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease |
title_sort | derivation and validation of a machine learning risk score using biomarker and electronic patient data to predict progression of diabetic kidney disease |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8187208/ https://www.ncbi.nlm.nih.gov/pubmed/33797560 http://dx.doi.org/10.1007/s00125-021-05444-0 |
work_keys_str_mv | AT chanlili derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease AT nadkarnigirishn derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease AT flemingfergus derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease AT mcculloughjamesr derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease AT connollypatricia derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease AT mosoyangohar derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease AT elsalemfadi derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease AT kattanmichaelw derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease AT vassalottijosepha derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease AT murphybarbara derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease AT donovanmichaelj derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease AT cocasteveng derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease AT damrauerscottm derivationandvalidationofamachinelearningriskscoreusingbiomarkerandelectronicpatientdatatopredictprogressionofdiabetickidneydisease |