Cargando…

Automated identification of patient subgroups: A case-study on mortality of COVID-19 patients admitted to the ICU

BACKGROUND: – Subgroup discovery (SGD) is the automated splitting of the data into complex subgroups. Various SGD methods have been applied to the medical domain, but none have been extensively evaluated. We assess the numerical and clinical quality of SGD methods. METHOD: – We applied the improved...

Descripción completa

Detalles Bibliográficos
Autores principales: Vagliano, I., Kingma, M.Y., Dongelmans, D.A., de Lange, D.W., de Keizer, N.F., Schut, M.C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Author(s). Published by Elsevier Ltd. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10266884/
https://www.ncbi.nlm.nih.gov/pubmed/37356293
http://dx.doi.org/10.1016/j.compbiomed.2023.107146
_version_ 1785058827312300032
author Vagliano, I.
Kingma, M.Y.
Dongelmans, D.A.
de Lange, D.W.
de Keizer, N.F.
Schut, M.C.
author_facet Vagliano, I.
Kingma, M.Y.
Dongelmans, D.A.
de Lange, D.W.
de Keizer, N.F.
Schut, M.C.
author_sort Vagliano, I.
collection PubMed
description BACKGROUND: – Subgroup discovery (SGD) is the automated splitting of the data into complex subgroups. Various SGD methods have been applied to the medical domain, but none have been extensively evaluated. We assess the numerical and clinical quality of SGD methods. METHOD: – We applied the improved Subgroup Set Discovery (SSD++), Patient Rule Induction Method (PRIM) and APRIORI – Subgroup Discovery (APRIORI-SD) algorithms to obtain patient subgroups on observational data of 14,548 COVID-19 patients admitted to 73 Dutch intensive care units. Hospital mortality was the clinical outcome. Numerical significance of the subgroups was assessed with information-theoretic measures. Clinical significance of the subgroups was assessed by comparing variable importance on population and subgroup levels and by expert evaluation. RESULTS: – The tested algorithms varied widely in the total number of discovered subgroups (5-62), the number of selected variables, and the predictive value of the subgroups. Qualitative assessment showed that the found subgroups make clinical sense. SSD++ found most subgroups (n = 62), which added predictive value and generally showed high potential for clinical use. APRIORI-SD and PRIM found fewer subgroups (n = 5 and 6), which did not add predictive value and were clinically less relevant. CONCLUSION: – Automated SGD methods find clinical subgroups that are relevant when assessed quantitatively (yield added predictive value) and qualitatively (intensivists consider the subgroups significant). Different methods yield different subgroups with varying degrees of predictive performance and clinical quality. External validation is needed to generalize the results to other populations and future research should explore which algorithm performs best in other settings.
format Online
Article
Text
id pubmed-10266884
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher The Author(s). Published by Elsevier Ltd.
record_format MEDLINE/PubMed
spelling pubmed-102668842023-06-15 Automated identification of patient subgroups: A case-study on mortality of COVID-19 patients admitted to the ICU Vagliano, I. Kingma, M.Y. Dongelmans, D.A. de Lange, D.W. de Keizer, N.F. Schut, M.C. Comput Biol Med Article BACKGROUND: – Subgroup discovery (SGD) is the automated splitting of the data into complex subgroups. Various SGD methods have been applied to the medical domain, but none have been extensively evaluated. We assess the numerical and clinical quality of SGD methods. METHOD: – We applied the improved Subgroup Set Discovery (SSD++), Patient Rule Induction Method (PRIM) and APRIORI – Subgroup Discovery (APRIORI-SD) algorithms to obtain patient subgroups on observational data of 14,548 COVID-19 patients admitted to 73 Dutch intensive care units. Hospital mortality was the clinical outcome. Numerical significance of the subgroups was assessed with information-theoretic measures. Clinical significance of the subgroups was assessed by comparing variable importance on population and subgroup levels and by expert evaluation. RESULTS: – The tested algorithms varied widely in the total number of discovered subgroups (5-62), the number of selected variables, and the predictive value of the subgroups. Qualitative assessment showed that the found subgroups make clinical sense. SSD++ found most subgroups (n = 62), which added predictive value and generally showed high potential for clinical use. APRIORI-SD and PRIM found fewer subgroups (n = 5 and 6), which did not add predictive value and were clinically less relevant. CONCLUSION: – Automated SGD methods find clinical subgroups that are relevant when assessed quantitatively (yield added predictive value) and qualitatively (intensivists consider the subgroups significant). Different methods yield different subgroups with varying degrees of predictive performance and clinical quality. External validation is needed to generalize the results to other populations and future research should explore which algorithm performs best in other settings. The Author(s). Published by Elsevier Ltd. 2023-09 2023-06-15 /pmc/articles/PMC10266884/ /pubmed/37356293 http://dx.doi.org/10.1016/j.compbiomed.2023.107146 Text en © 2023 The Author(s) Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Vagliano, I.
Kingma, M.Y.
Dongelmans, D.A.
de Lange, D.W.
de Keizer, N.F.
Schut, M.C.
Automated identification of patient subgroups: A case-study on mortality of COVID-19 patients admitted to the ICU
title Automated identification of patient subgroups: A case-study on mortality of COVID-19 patients admitted to the ICU
title_full Automated identification of patient subgroups: A case-study on mortality of COVID-19 patients admitted to the ICU
title_fullStr Automated identification of patient subgroups: A case-study on mortality of COVID-19 patients admitted to the ICU
title_full_unstemmed Automated identification of patient subgroups: A case-study on mortality of COVID-19 patients admitted to the ICU
title_short Automated identification of patient subgroups: A case-study on mortality of COVID-19 patients admitted to the ICU
title_sort automated identification of patient subgroups: a case-study on mortality of covid-19 patients admitted to the icu
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10266884/
https://www.ncbi.nlm.nih.gov/pubmed/37356293
http://dx.doi.org/10.1016/j.compbiomed.2023.107146
work_keys_str_mv AT vaglianoi automatedidentificationofpatientsubgroupsacasestudyonmortalityofcovid19patientsadmittedtotheicu
AT kingmamy automatedidentificationofpatientsubgroupsacasestudyonmortalityofcovid19patientsadmittedtotheicu
AT dongelmansda automatedidentificationofpatientsubgroupsacasestudyonmortalityofcovid19patientsadmittedtotheicu
AT delangedw automatedidentificationofpatientsubgroupsacasestudyonmortalityofcovid19patientsadmittedtotheicu
AT dekeizernf automatedidentificationofpatientsubgroupsacasestudyonmortalityofcovid19patientsadmittedtotheicu
AT schutmc automatedidentificationofpatientsubgroupsacasestudyonmortalityofcovid19patientsadmittedtotheicu
AT automatedidentificationofpatientsubgroupsacasestudyonmortalityofcovid19patientsadmittedtotheicu