Cargando…
Automated identification of patient subgroups: A case-study on mortality of COVID-19 patients admitted to the ICU
BACKGROUND: – Subgroup discovery (SGD) is the automated splitting of the data into complex subgroups. Various SGD methods have been applied to the medical domain, but none have been extensively evaluated. We assess the numerical and clinical quality of SGD methods. METHOD: – We applied the improved...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Author(s). Published by Elsevier Ltd.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10266884/ https://www.ncbi.nlm.nih.gov/pubmed/37356293 http://dx.doi.org/10.1016/j.compbiomed.2023.107146 |
_version_ | 1785058827312300032 |
---|---|
author | Vagliano, I. Kingma, M.Y. Dongelmans, D.A. de Lange, D.W. de Keizer, N.F. Schut, M.C. |
author_facet | Vagliano, I. Kingma, M.Y. Dongelmans, D.A. de Lange, D.W. de Keizer, N.F. Schut, M.C. |
author_sort | Vagliano, I. |
collection | PubMed |
description | BACKGROUND: – Subgroup discovery (SGD) is the automated splitting of the data into complex subgroups. Various SGD methods have been applied to the medical domain, but none have been extensively evaluated. We assess the numerical and clinical quality of SGD methods. METHOD: – We applied the improved Subgroup Set Discovery (SSD++), Patient Rule Induction Method (PRIM) and APRIORI – Subgroup Discovery (APRIORI-SD) algorithms to obtain patient subgroups on observational data of 14,548 COVID-19 patients admitted to 73 Dutch intensive care units. Hospital mortality was the clinical outcome. Numerical significance of the subgroups was assessed with information-theoretic measures. Clinical significance of the subgroups was assessed by comparing variable importance on population and subgroup levels and by expert evaluation. RESULTS: – The tested algorithms varied widely in the total number of discovered subgroups (5-62), the number of selected variables, and the predictive value of the subgroups. Qualitative assessment showed that the found subgroups make clinical sense. SSD++ found most subgroups (n = 62), which added predictive value and generally showed high potential for clinical use. APRIORI-SD and PRIM found fewer subgroups (n = 5 and 6), which did not add predictive value and were clinically less relevant. CONCLUSION: – Automated SGD methods find clinical subgroups that are relevant when assessed quantitatively (yield added predictive value) and qualitatively (intensivists consider the subgroups significant). Different methods yield different subgroups with varying degrees of predictive performance and clinical quality. External validation is needed to generalize the results to other populations and future research should explore which algorithm performs best in other settings. |
format | Online Article Text |
id | pubmed-10266884 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | The Author(s). Published by Elsevier Ltd. |
record_format | MEDLINE/PubMed |
spelling | pubmed-102668842023-06-15 Automated identification of patient subgroups: A case-study on mortality of COVID-19 patients admitted to the ICU Vagliano, I. Kingma, M.Y. Dongelmans, D.A. de Lange, D.W. de Keizer, N.F. Schut, M.C. Comput Biol Med Article BACKGROUND: – Subgroup discovery (SGD) is the automated splitting of the data into complex subgroups. Various SGD methods have been applied to the medical domain, but none have been extensively evaluated. We assess the numerical and clinical quality of SGD methods. METHOD: – We applied the improved Subgroup Set Discovery (SSD++), Patient Rule Induction Method (PRIM) and APRIORI – Subgroup Discovery (APRIORI-SD) algorithms to obtain patient subgroups on observational data of 14,548 COVID-19 patients admitted to 73 Dutch intensive care units. Hospital mortality was the clinical outcome. Numerical significance of the subgroups was assessed with information-theoretic measures. Clinical significance of the subgroups was assessed by comparing variable importance on population and subgroup levels and by expert evaluation. RESULTS: – The tested algorithms varied widely in the total number of discovered subgroups (5-62), the number of selected variables, and the predictive value of the subgroups. Qualitative assessment showed that the found subgroups make clinical sense. SSD++ found most subgroups (n = 62), which added predictive value and generally showed high potential for clinical use. APRIORI-SD and PRIM found fewer subgroups (n = 5 and 6), which did not add predictive value and were clinically less relevant. CONCLUSION: – Automated SGD methods find clinical subgroups that are relevant when assessed quantitatively (yield added predictive value) and qualitatively (intensivists consider the subgroups significant). Different methods yield different subgroups with varying degrees of predictive performance and clinical quality. External validation is needed to generalize the results to other populations and future research should explore which algorithm performs best in other settings. The Author(s). Published by Elsevier Ltd. 2023-09 2023-06-15 /pmc/articles/PMC10266884/ /pubmed/37356293 http://dx.doi.org/10.1016/j.compbiomed.2023.107146 Text en © 2023 The Author(s) Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Vagliano, I. Kingma, M.Y. Dongelmans, D.A. de Lange, D.W. de Keizer, N.F. Schut, M.C. Automated identification of patient subgroups: A case-study on mortality of COVID-19 patients admitted to the ICU |
title | Automated identification of patient subgroups: A case-study on mortality of COVID-19 patients admitted to the ICU |
title_full | Automated identification of patient subgroups: A case-study on mortality of COVID-19 patients admitted to the ICU |
title_fullStr | Automated identification of patient subgroups: A case-study on mortality of COVID-19 patients admitted to the ICU |
title_full_unstemmed | Automated identification of patient subgroups: A case-study on mortality of COVID-19 patients admitted to the ICU |
title_short | Automated identification of patient subgroups: A case-study on mortality of COVID-19 patients admitted to the ICU |
title_sort | automated identification of patient subgroups: a case-study on mortality of covid-19 patients admitted to the icu |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10266884/ https://www.ncbi.nlm.nih.gov/pubmed/37356293 http://dx.doi.org/10.1016/j.compbiomed.2023.107146 |
work_keys_str_mv | AT vaglianoi automatedidentificationofpatientsubgroupsacasestudyonmortalityofcovid19patientsadmittedtotheicu AT kingmamy automatedidentificationofpatientsubgroupsacasestudyonmortalityofcovid19patientsadmittedtotheicu AT dongelmansda automatedidentificationofpatientsubgroupsacasestudyonmortalityofcovid19patientsadmittedtotheicu AT delangedw automatedidentificationofpatientsubgroupsacasestudyonmortalityofcovid19patientsadmittedtotheicu AT dekeizernf automatedidentificationofpatientsubgroupsacasestudyonmortalityofcovid19patientsadmittedtotheicu AT schutmc automatedidentificationofpatientsubgroupsacasestudyonmortalityofcovid19patientsadmittedtotheicu AT automatedidentificationofpatientsubgroupsacasestudyonmortalityofcovid19patientsadmittedtotheicu |