Cargando…

Implications of resampling data to address the class imbalance problem (IRCIP): an evaluation of impact on performance between classification algorithms in medical data

OBJECTIVE: When correcting for the “class imbalance” problem in medical data, the effects of resampling applied on classifier algorithms remain unclear. We examined the effect on performance over several combinations of classifiers and resampling ratios. MATERIALS AND METHODS: Multiple classificatio...

Descripción completa

Detalles Bibliográficos
Autores principales:	Welvaars, Koen, Oosterhoff, Jacobien H F, van den Bekerom, Michel P J, Doornberg, Job N, van Haarst, Ernst P
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2023
Materias:	Research and Applications
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10232287/ https://www.ncbi.nlm.nih.gov/pubmed/37266187 http://dx.doi.org/10.1093/jamiaopen/ooad033

_version_	1785051940554539008
author	Welvaars, Koen Oosterhoff, Jacobien H F van den Bekerom, Michel P J Doornberg, Job N van Haarst, Ernst P
author_facet	Welvaars, Koen Oosterhoff, Jacobien H F van den Bekerom, Michel P J Doornberg, Job N van Haarst, Ernst P
author_sort	Welvaars, Koen
collection	PubMed
description	OBJECTIVE: When correcting for the “class imbalance” problem in medical data, the effects of resampling applied on classifier algorithms remain unclear. We examined the effect on performance over several combinations of classifiers and resampling ratios. MATERIALS AND METHODS: Multiple classification algorithms were trained on 7 resampled datasets: no correction, random undersampling, 4 ratios of Synthetic Minority Oversampling Technique (SMOTE), and random oversampling with the Adaptive Synthetic algorithm (ADASYN). Performance was evaluated in Area Under the Curve (AUC), precision, recall, Brier score, and calibration metrics. A case study on prediction modeling for 30-day unplanned readmissions in previously admitted Urology patients was presented. RESULTS: For most algorithms, using resampled data showed a significant increase in AUC and precision, ranging from 0.74 (CI: 0.69–0.79) to 0.93 (CI: 0.92–0.94), and 0.35 (CI: 0.12–0.58) to 0.86 (CI: 0.81–0.92) respectively. All classification algorithms showed significant increases in recall, and significant decreases in Brier score with distorted calibration overestimating positives. DISCUSSION: Imbalance correction resulted in an overall improved performance, yet poorly calibrated models. There can still be clinical utility due to a strong discriminating performance, specifically when predicting only low and high risk cases is clinically more relevant. CONCLUSION: Resampling data resulted in increased performances in classification algorithms, yet produced an overestimation of positive predictions. Based on the findings from our case study, a thoughtful predefinition of the clinical prediction task may guide the use of resampling techniques in future studies aiming to improve clinical decision support tools.
format	Online Article Text
id	pubmed-10232287
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-102322872023-06-01 Implications of resampling data to address the class imbalance problem (IRCIP): an evaluation of impact on performance between classification algorithms in medical data Welvaars, Koen Oosterhoff, Jacobien H F van den Bekerom, Michel P J Doornberg, Job N van Haarst, Ernst P JAMIA Open Research and Applications OBJECTIVE: When correcting for the “class imbalance” problem in medical data, the effects of resampling applied on classifier algorithms remain unclear. We examined the effect on performance over several combinations of classifiers and resampling ratios. MATERIALS AND METHODS: Multiple classification algorithms were trained on 7 resampled datasets: no correction, random undersampling, 4 ratios of Synthetic Minority Oversampling Technique (SMOTE), and random oversampling with the Adaptive Synthetic algorithm (ADASYN). Performance was evaluated in Area Under the Curve (AUC), precision, recall, Brier score, and calibration metrics. A case study on prediction modeling for 30-day unplanned readmissions in previously admitted Urology patients was presented. RESULTS: For most algorithms, using resampled data showed a significant increase in AUC and precision, ranging from 0.74 (CI: 0.69–0.79) to 0.93 (CI: 0.92–0.94), and 0.35 (CI: 0.12–0.58) to 0.86 (CI: 0.81–0.92) respectively. All classification algorithms showed significant increases in recall, and significant decreases in Brier score with distorted calibration overestimating positives. DISCUSSION: Imbalance correction resulted in an overall improved performance, yet poorly calibrated models. There can still be clinical utility due to a strong discriminating performance, specifically when predicting only low and high risk cases is clinically more relevant. CONCLUSION: Resampling data resulted in increased performances in classification algorithms, yet produced an overestimation of positive predictions. Based on the findings from our case study, a thoughtful predefinition of the clinical prediction task may guide the use of resampling techniques in future studies aiming to improve clinical decision support tools. Oxford University Press 2023-05-31 /pmc/articles/PMC10232287/ /pubmed/37266187 http://dx.doi.org/10.1093/jamiaopen/ooad033 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle	Research and Applications Welvaars, Koen Oosterhoff, Jacobien H F van den Bekerom, Michel P J Doornberg, Job N van Haarst, Ernst P Implications of resampling data to address the class imbalance problem (IRCIP): an evaluation of impact on performance between classification algorithms in medical data
title	Implications of resampling data to address the class imbalance problem (IRCIP): an evaluation of impact on performance between classification algorithms in medical data
title_full	Implications of resampling data to address the class imbalance problem (IRCIP): an evaluation of impact on performance between classification algorithms in medical data
title_fullStr	Implications of resampling data to address the class imbalance problem (IRCIP): an evaluation of impact on performance between classification algorithms in medical data
title_full_unstemmed	Implications of resampling data to address the class imbalance problem (IRCIP): an evaluation of impact on performance between classification algorithms in medical data
title_short	Implications of resampling data to address the class imbalance problem (IRCIP): an evaluation of impact on performance between classification algorithms in medical data
title_sort	implications of resampling data to address the class imbalance problem (ircip): an evaluation of impact on performance between classification algorithms in medical data
topic	Research and Applications
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10232287/ https://www.ncbi.nlm.nih.gov/pubmed/37266187 http://dx.doi.org/10.1093/jamiaopen/ooad033
work_keys_str_mv	AT welvaarskoen implicationsofresamplingdatatoaddresstheclassimbalanceproblemircipanevaluationofimpactonperformancebetweenclassificationalgorithmsinmedicaldata AT oosterhoffjacobienhf implicationsofresamplingdatatoaddresstheclassimbalanceproblemircipanevaluationofimpactonperformancebetweenclassificationalgorithmsinmedicaldata AT vandenbekerommichelpj implicationsofresamplingdatatoaddresstheclassimbalanceproblemircipanevaluationofimpactonperformancebetweenclassificationalgorithmsinmedicaldata AT doornbergjobn implicationsofresamplingdatatoaddresstheclassimbalanceproblemircipanevaluationofimpactonperformancebetweenclassificationalgorithmsinmedicaldata AT vanhaarsternstp implicationsofresamplingdatatoaddresstheclassimbalanceproblemircipanevaluationofimpactonperformancebetweenclassificationalgorithmsinmedicaldata AT implicationsofresamplingdatatoaddresstheclassimbalanceproblemircipanevaluationofimpactonperformancebetweenclassificationalgorithmsinmedicaldata

Implications of resampling data to address the class imbalance problem (IRCIP): an evaluation of impact on performance between classification algorithms in medical data

Ejemplares similares