Cargando…

Machine Learning Models to Predict Kidney Stone Recurrence Using 24 Hour Urine Testing and Electronic Health Record-Derived Features

OBJECTIVE: To assess the accuracy of machine learning models in predicting kidney stone recurrence using variables extracted from the electronic health record (EHR). METHODS: We trained three separate machine learning (ML) models (least absolute shrinkage and selection operator regression [LASSO], r...

Descripción completa

Detalles Bibliográficos
Autores principales: Doyle, Patrick, Gong, Wu, Hsi, Ryan, Kavoussi, Nicholas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Journal Experts 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10350114/
https://www.ncbi.nlm.nih.gov/pubmed/37461654
http://dx.doi.org/10.21203/rs.3.rs-3107998/v1
_version_ 1785074062023720960
author Doyle, Patrick
Gong, Wu
Hsi, Ryan
Kavoussi, Nicholas
author_facet Doyle, Patrick
Gong, Wu
Hsi, Ryan
Kavoussi, Nicholas
author_sort Doyle, Patrick
collection PubMed
description OBJECTIVE: To assess the accuracy of machine learning models in predicting kidney stone recurrence using variables extracted from the electronic health record (EHR). METHODS: We trained three separate machine learning (ML) models (least absolute shrinkage and selection operator regression [LASSO], random forest [RF], and gradient boosted decision tree [XGBoost] to predict 2-year and 5-year symptomatic kidney stone recurrence from electronic health-record (EHR) derived features and 24H urine data (n = 1231). ML models were compared to logistic regression [LR]. A manual, retrospective review was performed to evaluate for a symptomatic stone event, defined as pain, acute kidney injury or recurrent infections attributed to a kidney stone identified in the clinic or the emergency department, or for any stone requiring surgical treatment. We evaluated performance using area under the receiver operating curve (AUC-ROC) and identified important features for each model. RESULTS: The 2- and 5- year symptomatic stone recurrence rates were 25% and 31%, respectively. The LASSO model performed best for symptomatic stone recurrence prediction (2-yr AUC: 0.62, 5-yr AUC: 0.63). Other models demonstrated modest overall performance at 2- and 5-years: LR (0.585, 0.618), RF (0.570, 0.608), and XGBoost (0.580, 0.621). Patient age was the only feature in the top 5 features of every model. Additionally, the LASSO model prioritized BMI and history of gout for prediction. CONCLUSIONS: Throughout our cohorts, ML models demonstrated comparable results to that of LR, with the LASSO model outperforming all other models. Further model testing should evaluate the utility of 24H urine features in model structure.
format Online
Article
Text
id pubmed-10350114
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Journal Experts
record_format MEDLINE/PubMed
spelling pubmed-103501142023-07-17 Machine Learning Models to Predict Kidney Stone Recurrence Using 24 Hour Urine Testing and Electronic Health Record-Derived Features Doyle, Patrick Gong, Wu Hsi, Ryan Kavoussi, Nicholas Res Sq Article OBJECTIVE: To assess the accuracy of machine learning models in predicting kidney stone recurrence using variables extracted from the electronic health record (EHR). METHODS: We trained three separate machine learning (ML) models (least absolute shrinkage and selection operator regression [LASSO], random forest [RF], and gradient boosted decision tree [XGBoost] to predict 2-year and 5-year symptomatic kidney stone recurrence from electronic health-record (EHR) derived features and 24H urine data (n = 1231). ML models were compared to logistic regression [LR]. A manual, retrospective review was performed to evaluate for a symptomatic stone event, defined as pain, acute kidney injury or recurrent infections attributed to a kidney stone identified in the clinic or the emergency department, or for any stone requiring surgical treatment. We evaluated performance using area under the receiver operating curve (AUC-ROC) and identified important features for each model. RESULTS: The 2- and 5- year symptomatic stone recurrence rates were 25% and 31%, respectively. The LASSO model performed best for symptomatic stone recurrence prediction (2-yr AUC: 0.62, 5-yr AUC: 0.63). Other models demonstrated modest overall performance at 2- and 5-years: LR (0.585, 0.618), RF (0.570, 0.608), and XGBoost (0.580, 0.621). Patient age was the only feature in the top 5 features of every model. Additionally, the LASSO model prioritized BMI and history of gout for prediction. CONCLUSIONS: Throughout our cohorts, ML models demonstrated comparable results to that of LR, with the LASSO model outperforming all other models. Further model testing should evaluate the utility of 24H urine features in model structure. American Journal Experts 2023-06-29 /pmc/articles/PMC10350114/ /pubmed/37461654 http://dx.doi.org/10.21203/rs.3.rs-3107998/v1 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Doyle, Patrick
Gong, Wu
Hsi, Ryan
Kavoussi, Nicholas
Machine Learning Models to Predict Kidney Stone Recurrence Using 24 Hour Urine Testing and Electronic Health Record-Derived Features
title Machine Learning Models to Predict Kidney Stone Recurrence Using 24 Hour Urine Testing and Electronic Health Record-Derived Features
title_full Machine Learning Models to Predict Kidney Stone Recurrence Using 24 Hour Urine Testing and Electronic Health Record-Derived Features
title_fullStr Machine Learning Models to Predict Kidney Stone Recurrence Using 24 Hour Urine Testing and Electronic Health Record-Derived Features
title_full_unstemmed Machine Learning Models to Predict Kidney Stone Recurrence Using 24 Hour Urine Testing and Electronic Health Record-Derived Features
title_short Machine Learning Models to Predict Kidney Stone Recurrence Using 24 Hour Urine Testing and Electronic Health Record-Derived Features
title_sort machine learning models to predict kidney stone recurrence using 24 hour urine testing and electronic health record-derived features
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10350114/
https://www.ncbi.nlm.nih.gov/pubmed/37461654
http://dx.doi.org/10.21203/rs.3.rs-3107998/v1
work_keys_str_mv AT doylepatrick machinelearningmodelstopredictkidneystonerecurrenceusing24hoururinetestingandelectronichealthrecordderivedfeatures
AT gongwu machinelearningmodelstopredictkidneystonerecurrenceusing24hoururinetestingandelectronichealthrecordderivedfeatures
AT hsiryan machinelearningmodelstopredictkidneystonerecurrenceusing24hoururinetestingandelectronichealthrecordderivedfeatures
AT kavoussinicholas machinelearningmodelstopredictkidneystonerecurrenceusing24hoururinetestingandelectronichealthrecordderivedfeatures