Cargando…

Using Unsupervised Machine Learning to Identify Age- and Sex-Independent Severity Subgroups Among Patients with COVID-19: Observational Longitudinal Study

BACKGROUND: Early detection and intervention are the key factors for improving outcomes in patients with COVID-19. OBJECTIVE: The objective of this observational longitudinal study was to identify nonoverlapping severity subgroups (ie, clusters) among patients with COVID-19, based exclusively on cli...

Descripción completa

Detalles Bibliográficos
Autores principales: Benito-León, Julián, del Castillo, Mª Dolores, Estirado, Alberto, Ghosh, Ritwik, Dubey, Souvik, Serrano, J Ignacio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8163491/
https://www.ncbi.nlm.nih.gov/pubmed/33872186
http://dx.doi.org/10.2196/25988
_version_ 1783700919253204992
author Benito-León, Julián
del Castillo, Mª Dolores
Estirado, Alberto
Ghosh, Ritwik
Dubey, Souvik
Serrano, J Ignacio
author_facet Benito-León, Julián
del Castillo, Mª Dolores
Estirado, Alberto
Ghosh, Ritwik
Dubey, Souvik
Serrano, J Ignacio
author_sort Benito-León, Julián
collection PubMed
description BACKGROUND: Early detection and intervention are the key factors for improving outcomes in patients with COVID-19. OBJECTIVE: The objective of this observational longitudinal study was to identify nonoverlapping severity subgroups (ie, clusters) among patients with COVID-19, based exclusively on clinical data and standard laboratory tests obtained during patient assessment in the emergency department. METHODS: We applied unsupervised machine learning to a data set of 853 patients with COVID-19 from the HM group of hospitals (HM Hospitales) in Madrid, Spain. Age and sex were not considered while building the clusters, as these variables could introduce biases in machine learning algorithms and raise ethical implications or enable discrimination in triage protocols. RESULTS: From 850 clinical and laboratory variables, four tests—the serum levels of aspartate transaminase (AST), lactate dehydrogenase (LDH), C-reactive protein (CRP), and the number of neutrophils—were enough to segregate the entire patient pool into three separate clusters. Further, the percentage of monocytes and lymphocytes and the levels of alanine transaminase (ALT) distinguished cluster 3 patients from the other two clusters. The highest proportion of deceased patients; the highest levels of AST, ALT, LDH, and CRP; the highest number of neutrophils; and the lowest percentages of monocytes and lymphocytes characterized cluster 1. Cluster 2 included a lower proportion of deceased patients and intermediate levels of the previous laboratory tests. The lowest proportion of deceased patients; the lowest levels of AST, ALT, LDH, and CRP; the lowest number of neutrophils; and the highest percentages of monocytes and lymphocytes characterized cluster 3. CONCLUSIONS: A few standard laboratory tests, deemed available in all emergency departments, have shown good discriminative power for the characterization of severity subgroups among patients with COVID-19.
format Online
Article
Text
id pubmed-8163491
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-81634912021-06-03 Using Unsupervised Machine Learning to Identify Age- and Sex-Independent Severity Subgroups Among Patients with COVID-19: Observational Longitudinal Study Benito-León, Julián del Castillo, Mª Dolores Estirado, Alberto Ghosh, Ritwik Dubey, Souvik Serrano, J Ignacio J Med Internet Res Original Paper BACKGROUND: Early detection and intervention are the key factors for improving outcomes in patients with COVID-19. OBJECTIVE: The objective of this observational longitudinal study was to identify nonoverlapping severity subgroups (ie, clusters) among patients with COVID-19, based exclusively on clinical data and standard laboratory tests obtained during patient assessment in the emergency department. METHODS: We applied unsupervised machine learning to a data set of 853 patients with COVID-19 from the HM group of hospitals (HM Hospitales) in Madrid, Spain. Age and sex were not considered while building the clusters, as these variables could introduce biases in machine learning algorithms and raise ethical implications or enable discrimination in triage protocols. RESULTS: From 850 clinical and laboratory variables, four tests—the serum levels of aspartate transaminase (AST), lactate dehydrogenase (LDH), C-reactive protein (CRP), and the number of neutrophils—were enough to segregate the entire patient pool into three separate clusters. Further, the percentage of monocytes and lymphocytes and the levels of alanine transaminase (ALT) distinguished cluster 3 patients from the other two clusters. The highest proportion of deceased patients; the highest levels of AST, ALT, LDH, and CRP; the highest number of neutrophils; and the lowest percentages of monocytes and lymphocytes characterized cluster 1. Cluster 2 included a lower proportion of deceased patients and intermediate levels of the previous laboratory tests. The lowest proportion of deceased patients; the lowest levels of AST, ALT, LDH, and CRP; the lowest number of neutrophils; and the highest percentages of monocytes and lymphocytes characterized cluster 3. CONCLUSIONS: A few standard laboratory tests, deemed available in all emergency departments, have shown good discriminative power for the characterization of severity subgroups among patients with COVID-19. JMIR Publications 2021-05-27 /pmc/articles/PMC8163491/ /pubmed/33872186 http://dx.doi.org/10.2196/25988 Text en ©Julián Benito-León, Mª Dolores del Castillo, Alberto Estirado, Ritwik Ghosh, Souvik Dubey, J Ignacio Serrano. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 27.05.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Benito-León, Julián
del Castillo, Mª Dolores
Estirado, Alberto
Ghosh, Ritwik
Dubey, Souvik
Serrano, J Ignacio
Using Unsupervised Machine Learning to Identify Age- and Sex-Independent Severity Subgroups Among Patients with COVID-19: Observational Longitudinal Study
title Using Unsupervised Machine Learning to Identify Age- and Sex-Independent Severity Subgroups Among Patients with COVID-19: Observational Longitudinal Study
title_full Using Unsupervised Machine Learning to Identify Age- and Sex-Independent Severity Subgroups Among Patients with COVID-19: Observational Longitudinal Study
title_fullStr Using Unsupervised Machine Learning to Identify Age- and Sex-Independent Severity Subgroups Among Patients with COVID-19: Observational Longitudinal Study
title_full_unstemmed Using Unsupervised Machine Learning to Identify Age- and Sex-Independent Severity Subgroups Among Patients with COVID-19: Observational Longitudinal Study
title_short Using Unsupervised Machine Learning to Identify Age- and Sex-Independent Severity Subgroups Among Patients with COVID-19: Observational Longitudinal Study
title_sort using unsupervised machine learning to identify age- and sex-independent severity subgroups among patients with covid-19: observational longitudinal study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8163491/
https://www.ncbi.nlm.nih.gov/pubmed/33872186
http://dx.doi.org/10.2196/25988
work_keys_str_mv AT benitoleonjulian usingunsupervisedmachinelearningtoidentifyageandsexindependentseveritysubgroupsamongpatientswithcovid19observationallongitudinalstudy
AT delcastillomadolores usingunsupervisedmachinelearningtoidentifyageandsexindependentseveritysubgroupsamongpatientswithcovid19observationallongitudinalstudy
AT estiradoalberto usingunsupervisedmachinelearningtoidentifyageandsexindependentseveritysubgroupsamongpatientswithcovid19observationallongitudinalstudy
AT ghoshritwik usingunsupervisedmachinelearningtoidentifyageandsexindependentseveritysubgroupsamongpatientswithcovid19observationallongitudinalstudy
AT dubeysouvik usingunsupervisedmachinelearningtoidentifyageandsexindependentseveritysubgroupsamongpatientswithcovid19observationallongitudinalstudy
AT serranojignacio usingunsupervisedmachinelearningtoidentifyageandsexindependentseveritysubgroupsamongpatientswithcovid19observationallongitudinalstudy