Cargando…

Machine learning clinical prediction models for acute kidney injury: the impact of baseline creatinine on prediction efficacy

BACKGROUND: There are many Machine Learning (ML) models which predict acute kidney injury (AKI) for hospitalised patients. While a primary goal of these models is to support clinical decision-making, the adoption of inconsistent methods of estimating baseline serum creatinine (sCr) may result in a p...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kamel Rahimi, Amir, Ghadimi, Moji, van der Vegt, Anton H., Canfell, Oliver J., Pole, Jason D., Sullivan, Clair, Shrapnel, Sally
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2023
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10563357/ https://www.ncbi.nlm.nih.gov/pubmed/37814311 http://dx.doi.org/10.1186/s12911-023-02306-0

_version_	1785118323744178176
author	Kamel Rahimi, Amir Ghadimi, Moji van der Vegt, Anton H. Canfell, Oliver J. Pole, Jason D. Sullivan, Clair Shrapnel, Sally
author_facet	Kamel Rahimi, Amir Ghadimi, Moji van der Vegt, Anton H. Canfell, Oliver J. Pole, Jason D. Sullivan, Clair Shrapnel, Sally
author_sort	Kamel Rahimi, Amir
collection	PubMed
description	BACKGROUND: There are many Machine Learning (ML) models which predict acute kidney injury (AKI) for hospitalised patients. While a primary goal of these models is to support clinical decision-making, the adoption of inconsistent methods of estimating baseline serum creatinine (sCr) may result in a poor understanding of these models’ effectiveness in clinical practice. Until now, the performance of such models with different baselines has not been compared on a single dataset. Additionally, AKI prediction models are known to have a high rate of false positive (FP) events regardless of baseline methods. This warrants further exploration of FP events to provide insight into potential underlying reasons. OBJECTIVE: The first aim of this study was to assess the variance in performance of ML models using three methods of baseline sCr on a retrospective dataset. The second aim was to conduct an error analysis to gain insight into the underlying factors contributing to FP events. MATERIALS AND METHODS: The Intensive Care Unit (ICU) patients of the Medical Information Mart for Intensive Care (MIMIC)-IV dataset was used with the KDIGO (Kidney Disease Improving Global Outcome) definition to identify AKI episodes. Three different methods of estimating baseline sCr were defined as (1) the minimum sCr, (2) the Modification of Diet in Renal Disease (MDRD) equation and the minimum sCr and (3) the MDRD equation and the mean of preadmission sCr. For the first aim of this study, a suite of ML models was developed for each baseline and the performance of the models was assessed. An analysis of variance was performed to assess the significant difference between eXtreme Gradient Boosting (XGB) models across all baselines. To address the second aim, Explainable AI (XAI) methods were used to analyse the XGB errors with Baseline 3. RESULTS: Regarding the first aim, we observed variances in discriminative metrics and calibration errors of ML models when different baseline methods were adopted. Using Baseline 1 resulted in a 14% reduction in the f1 score for both Baseline 2 and Baseline 3. There was no significant difference observed in the results between Baseline 2 and Baseline 3. For the second aim, the FP cohort was analysed using the XAI methods which led to relabelling data with the mean of sCr in 180 to 0 days pre-ICU as the preferred sCr baseline method. The XGB model using this relabelled data achieved an AUC of 0.85, recall of 0.63, precision of 0.54 and f1 score of 0.58. The cohort size was 31,586 admissions, of which 5,473 (17.32%) had AKI. CONCLUSION: In the absence of a widely accepted method of baseline sCr, AKI prediction studies need to consider the impact of different baseline methods on the effectiveness of ML models and their potential implications in real-world implementations. The utilisation of XAI methods can be effective in providing insight into the occurrence of prediction errors. This can potentially augment the success rate of ML implementation in routine care. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02306-0.
format	Online Article Text
id	pubmed-10563357
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-105633572023-10-11 Machine learning clinical prediction models for acute kidney injury: the impact of baseline creatinine on prediction efficacy Kamel Rahimi, Amir Ghadimi, Moji van der Vegt, Anton H. Canfell, Oliver J. Pole, Jason D. Sullivan, Clair Shrapnel, Sally BMC Med Inform Decis Mak Research BACKGROUND: There are many Machine Learning (ML) models which predict acute kidney injury (AKI) for hospitalised patients. While a primary goal of these models is to support clinical decision-making, the adoption of inconsistent methods of estimating baseline serum creatinine (sCr) may result in a poor understanding of these models’ effectiveness in clinical practice. Until now, the performance of such models with different baselines has not been compared on a single dataset. Additionally, AKI prediction models are known to have a high rate of false positive (FP) events regardless of baseline methods. This warrants further exploration of FP events to provide insight into potential underlying reasons. OBJECTIVE: The first aim of this study was to assess the variance in performance of ML models using three methods of baseline sCr on a retrospective dataset. The second aim was to conduct an error analysis to gain insight into the underlying factors contributing to FP events. MATERIALS AND METHODS: The Intensive Care Unit (ICU) patients of the Medical Information Mart for Intensive Care (MIMIC)-IV dataset was used with the KDIGO (Kidney Disease Improving Global Outcome) definition to identify AKI episodes. Three different methods of estimating baseline sCr were defined as (1) the minimum sCr, (2) the Modification of Diet in Renal Disease (MDRD) equation and the minimum sCr and (3) the MDRD equation and the mean of preadmission sCr. For the first aim of this study, a suite of ML models was developed for each baseline and the performance of the models was assessed. An analysis of variance was performed to assess the significant difference between eXtreme Gradient Boosting (XGB) models across all baselines. To address the second aim, Explainable AI (XAI) methods were used to analyse the XGB errors with Baseline 3. RESULTS: Regarding the first aim, we observed variances in discriminative metrics and calibration errors of ML models when different baseline methods were adopted. Using Baseline 1 resulted in a 14% reduction in the f1 score for both Baseline 2 and Baseline 3. There was no significant difference observed in the results between Baseline 2 and Baseline 3. For the second aim, the FP cohort was analysed using the XAI methods which led to relabelling data with the mean of sCr in 180 to 0 days pre-ICU as the preferred sCr baseline method. The XGB model using this relabelled data achieved an AUC of 0.85, recall of 0.63, precision of 0.54 and f1 score of 0.58. The cohort size was 31,586 admissions, of which 5,473 (17.32%) had AKI. CONCLUSION: In the absence of a widely accepted method of baseline sCr, AKI prediction studies need to consider the impact of different baseline methods on the effectiveness of ML models and their potential implications in real-world implementations. The utilisation of XAI methods can be effective in providing insight into the occurrence of prediction errors. This can potentially augment the success rate of ML implementation in routine care. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02306-0. BioMed Central 2023-10-09 /pmc/articles/PMC10563357/ /pubmed/37814311 http://dx.doi.org/10.1186/s12911-023-02306-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Kamel Rahimi, Amir Ghadimi, Moji van der Vegt, Anton H. Canfell, Oliver J. Pole, Jason D. Sullivan, Clair Shrapnel, Sally Machine learning clinical prediction models for acute kidney injury: the impact of baseline creatinine on prediction efficacy
title	Machine learning clinical prediction models for acute kidney injury: the impact of baseline creatinine on prediction efficacy
title_full	Machine learning clinical prediction models for acute kidney injury: the impact of baseline creatinine on prediction efficacy
title_fullStr	Machine learning clinical prediction models for acute kidney injury: the impact of baseline creatinine on prediction efficacy
title_full_unstemmed	Machine learning clinical prediction models for acute kidney injury: the impact of baseline creatinine on prediction efficacy
title_short	Machine learning clinical prediction models for acute kidney injury: the impact of baseline creatinine on prediction efficacy
title_sort	machine learning clinical prediction models for acute kidney injury: the impact of baseline creatinine on prediction efficacy
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10563357/ https://www.ncbi.nlm.nih.gov/pubmed/37814311 http://dx.doi.org/10.1186/s12911-023-02306-0
work_keys_str_mv	AT kamelrahimiamir machinelearningclinicalpredictionmodelsforacutekidneyinjurytheimpactofbaselinecreatinineonpredictionefficacy AT ghadimimoji machinelearningclinicalpredictionmodelsforacutekidneyinjurytheimpactofbaselinecreatinineonpredictionefficacy AT vandervegtantonh machinelearningclinicalpredictionmodelsforacutekidneyinjurytheimpactofbaselinecreatinineonpredictionefficacy AT canfelloliverj machinelearningclinicalpredictionmodelsforacutekidneyinjurytheimpactofbaselinecreatinineonpredictionefficacy AT polejasond machinelearningclinicalpredictionmodelsforacutekidneyinjurytheimpactofbaselinecreatinineonpredictionefficacy AT sullivanclair machinelearningclinicalpredictionmodelsforacutekidneyinjurytheimpactofbaselinecreatinineonpredictionefficacy AT shrapnelsally machinelearningclinicalpredictionmodelsforacutekidneyinjurytheimpactofbaselinecreatinineonpredictionefficacy

Machine learning clinical prediction models for acute kidney injury: the impact of baseline creatinine on prediction efficacy

Ejemplares similares