Cargando…
Machine Learning Models to Predict Kidney Stone Recurrence Using 24 Hour Urine Testing and Electronic Health Record-Derived Features
OBJECTIVE: To assess the accuracy of machine learning models in predicting kidney stone recurrence using variables extracted from the electronic health record (EHR). METHODS: We trained three separate machine learning (ML) models (least absolute shrinkage and selection operator regression [LASSO], r...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Journal Experts
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10350114/ https://www.ncbi.nlm.nih.gov/pubmed/37461654 http://dx.doi.org/10.21203/rs.3.rs-3107998/v1 |
_version_ | 1785074062023720960 |
---|---|
author | Doyle, Patrick Gong, Wu Hsi, Ryan Kavoussi, Nicholas |
author_facet | Doyle, Patrick Gong, Wu Hsi, Ryan Kavoussi, Nicholas |
author_sort | Doyle, Patrick |
collection | PubMed |
description | OBJECTIVE: To assess the accuracy of machine learning models in predicting kidney stone recurrence using variables extracted from the electronic health record (EHR). METHODS: We trained three separate machine learning (ML) models (least absolute shrinkage and selection operator regression [LASSO], random forest [RF], and gradient boosted decision tree [XGBoost] to predict 2-year and 5-year symptomatic kidney stone recurrence from electronic health-record (EHR) derived features and 24H urine data (n = 1231). ML models were compared to logistic regression [LR]. A manual, retrospective review was performed to evaluate for a symptomatic stone event, defined as pain, acute kidney injury or recurrent infections attributed to a kidney stone identified in the clinic or the emergency department, or for any stone requiring surgical treatment. We evaluated performance using area under the receiver operating curve (AUC-ROC) and identified important features for each model. RESULTS: The 2- and 5- year symptomatic stone recurrence rates were 25% and 31%, respectively. The LASSO model performed best for symptomatic stone recurrence prediction (2-yr AUC: 0.62, 5-yr AUC: 0.63). Other models demonstrated modest overall performance at 2- and 5-years: LR (0.585, 0.618), RF (0.570, 0.608), and XGBoost (0.580, 0.621). Patient age was the only feature in the top 5 features of every model. Additionally, the LASSO model prioritized BMI and history of gout for prediction. CONCLUSIONS: Throughout our cohorts, ML models demonstrated comparable results to that of LR, with the LASSO model outperforming all other models. Further model testing should evaluate the utility of 24H urine features in model structure. |
format | Online Article Text |
id | pubmed-10350114 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | American Journal Experts |
record_format | MEDLINE/PubMed |
spelling | pubmed-103501142023-07-17 Machine Learning Models to Predict Kidney Stone Recurrence Using 24 Hour Urine Testing and Electronic Health Record-Derived Features Doyle, Patrick Gong, Wu Hsi, Ryan Kavoussi, Nicholas Res Sq Article OBJECTIVE: To assess the accuracy of machine learning models in predicting kidney stone recurrence using variables extracted from the electronic health record (EHR). METHODS: We trained three separate machine learning (ML) models (least absolute shrinkage and selection operator regression [LASSO], random forest [RF], and gradient boosted decision tree [XGBoost] to predict 2-year and 5-year symptomatic kidney stone recurrence from electronic health-record (EHR) derived features and 24H urine data (n = 1231). ML models were compared to logistic regression [LR]. A manual, retrospective review was performed to evaluate for a symptomatic stone event, defined as pain, acute kidney injury or recurrent infections attributed to a kidney stone identified in the clinic or the emergency department, or for any stone requiring surgical treatment. We evaluated performance using area under the receiver operating curve (AUC-ROC) and identified important features for each model. RESULTS: The 2- and 5- year symptomatic stone recurrence rates were 25% and 31%, respectively. The LASSO model performed best for symptomatic stone recurrence prediction (2-yr AUC: 0.62, 5-yr AUC: 0.63). Other models demonstrated modest overall performance at 2- and 5-years: LR (0.585, 0.618), RF (0.570, 0.608), and XGBoost (0.580, 0.621). Patient age was the only feature in the top 5 features of every model. Additionally, the LASSO model prioritized BMI and history of gout for prediction. CONCLUSIONS: Throughout our cohorts, ML models demonstrated comparable results to that of LR, with the LASSO model outperforming all other models. Further model testing should evaluate the utility of 24H urine features in model structure. American Journal Experts 2023-06-29 /pmc/articles/PMC10350114/ /pubmed/37461654 http://dx.doi.org/10.21203/rs.3.rs-3107998/v1 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Doyle, Patrick Gong, Wu Hsi, Ryan Kavoussi, Nicholas Machine Learning Models to Predict Kidney Stone Recurrence Using 24 Hour Urine Testing and Electronic Health Record-Derived Features |
title | Machine Learning Models to Predict Kidney Stone Recurrence Using 24 Hour Urine Testing and Electronic Health Record-Derived Features |
title_full | Machine Learning Models to Predict Kidney Stone Recurrence Using 24 Hour Urine Testing and Electronic Health Record-Derived Features |
title_fullStr | Machine Learning Models to Predict Kidney Stone Recurrence Using 24 Hour Urine Testing and Electronic Health Record-Derived Features |
title_full_unstemmed | Machine Learning Models to Predict Kidney Stone Recurrence Using 24 Hour Urine Testing and Electronic Health Record-Derived Features |
title_short | Machine Learning Models to Predict Kidney Stone Recurrence Using 24 Hour Urine Testing and Electronic Health Record-Derived Features |
title_sort | machine learning models to predict kidney stone recurrence using 24 hour urine testing and electronic health record-derived features |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10350114/ https://www.ncbi.nlm.nih.gov/pubmed/37461654 http://dx.doi.org/10.21203/rs.3.rs-3107998/v1 |
work_keys_str_mv | AT doylepatrick machinelearningmodelstopredictkidneystonerecurrenceusing24hoururinetestingandelectronichealthrecordderivedfeatures AT gongwu machinelearningmodelstopredictkidneystonerecurrenceusing24hoururinetestingandelectronichealthrecordderivedfeatures AT hsiryan machinelearningmodelstopredictkidneystonerecurrenceusing24hoururinetestingandelectronichealthrecordderivedfeatures AT kavoussinicholas machinelearningmodelstopredictkidneystonerecurrenceusing24hoururinetestingandelectronichealthrecordderivedfeatures |