Cargando…

A machine learning model to assess potential misdiagnosed dengue hospitalization

Dengue, like other arboviruses with broad clinical spectra, can easily be misdiagnosed as other infectious diseases due to the overlap of signs and symptoms. During large outbreaks, severe dengue cases have the potential to overwhelm the health care system and understanding the burden of dengue hosp...

Descripción completa

Detalles Bibliográficos
Autores principales: Santos, Claudia Yang, Tuboi, Suely, de Jesus Lopes de Abreu, Ariane, Abud, Denise Alves, Lobao Neto, Abner Augusto, Pereira, Ramon, Siqueira, Joao Bosco
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10258378/
https://www.ncbi.nlm.nih.gov/pubmed/37313173
http://dx.doi.org/10.1016/j.heliyon.2023.e16634
_version_ 1785057453242580992
author Santos, Claudia Yang
Tuboi, Suely
de Jesus Lopes de Abreu, Ariane
Abud, Denise Alves
Lobao Neto, Abner Augusto
Pereira, Ramon
Siqueira, Joao Bosco
author_facet Santos, Claudia Yang
Tuboi, Suely
de Jesus Lopes de Abreu, Ariane
Abud, Denise Alves
Lobao Neto, Abner Augusto
Pereira, Ramon
Siqueira, Joao Bosco
author_sort Santos, Claudia Yang
collection PubMed
description Dengue, like other arboviruses with broad clinical spectra, can easily be misdiagnosed as other infectious diseases due to the overlap of signs and symptoms. During large outbreaks, severe dengue cases have the potential to overwhelm the health care system and understanding the burden of dengue hospitalizations is therefore important to better allocate medical care and public health resources. A machine learning model that used data from the Brazilian public healthcare system database and the National Institute of Meteorology (INMET) was developed to estimate potential misdiagnosed dengue hospitalizations in Brazil. The data was modeled into a hospitalization level linked dataset. Then, Random Forest, Logistic Regression and Support Vector Machine algorithms were assessed. The algorithms were trained by dividing the dataset in training/test set and performing a cross validation to select the best hyperparameters in each algorithm tested. The evaluation was done based on accuracy, precision, recall, F1 score, sensitivity, and specificity. The best model developed was Random Forest with an accuracy of 85% on the final reviewed test. This model shows that 3.4% (13,608) of all hospitalizations in the public healthcare system from 2014 to 2020 could have been dengue misdiagnosed as other diseases. The model was helpful in finding potentially misdiagnosed dengue and might be a useful tool to help public health decision makers in planning resource allocation.
format Online
Article
Text
id pubmed-10258378
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-102583782023-06-13 A machine learning model to assess potential misdiagnosed dengue hospitalization Santos, Claudia Yang Tuboi, Suely de Jesus Lopes de Abreu, Ariane Abud, Denise Alves Lobao Neto, Abner Augusto Pereira, Ramon Siqueira, Joao Bosco Heliyon Research Article Dengue, like other arboviruses with broad clinical spectra, can easily be misdiagnosed as other infectious diseases due to the overlap of signs and symptoms. During large outbreaks, severe dengue cases have the potential to overwhelm the health care system and understanding the burden of dengue hospitalizations is therefore important to better allocate medical care and public health resources. A machine learning model that used data from the Brazilian public healthcare system database and the National Institute of Meteorology (INMET) was developed to estimate potential misdiagnosed dengue hospitalizations in Brazil. The data was modeled into a hospitalization level linked dataset. Then, Random Forest, Logistic Regression and Support Vector Machine algorithms were assessed. The algorithms were trained by dividing the dataset in training/test set and performing a cross validation to select the best hyperparameters in each algorithm tested. The evaluation was done based on accuracy, precision, recall, F1 score, sensitivity, and specificity. The best model developed was Random Forest with an accuracy of 85% on the final reviewed test. This model shows that 3.4% (13,608) of all hospitalizations in the public healthcare system from 2014 to 2020 could have been dengue misdiagnosed as other diseases. The model was helpful in finding potentially misdiagnosed dengue and might be a useful tool to help public health decision makers in planning resource allocation. Elsevier 2023-05-30 /pmc/articles/PMC10258378/ /pubmed/37313173 http://dx.doi.org/10.1016/j.heliyon.2023.e16634 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Santos, Claudia Yang
Tuboi, Suely
de Jesus Lopes de Abreu, Ariane
Abud, Denise Alves
Lobao Neto, Abner Augusto
Pereira, Ramon
Siqueira, Joao Bosco
A machine learning model to assess potential misdiagnosed dengue hospitalization
title A machine learning model to assess potential misdiagnosed dengue hospitalization
title_full A machine learning model to assess potential misdiagnosed dengue hospitalization
title_fullStr A machine learning model to assess potential misdiagnosed dengue hospitalization
title_full_unstemmed A machine learning model to assess potential misdiagnosed dengue hospitalization
title_short A machine learning model to assess potential misdiagnosed dengue hospitalization
title_sort machine learning model to assess potential misdiagnosed dengue hospitalization
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10258378/
https://www.ncbi.nlm.nih.gov/pubmed/37313173
http://dx.doi.org/10.1016/j.heliyon.2023.e16634
work_keys_str_mv AT santosclaudiayang amachinelearningmodeltoassesspotentialmisdiagnoseddenguehospitalization
AT tuboisuely amachinelearningmodeltoassesspotentialmisdiagnoseddenguehospitalization
AT dejesuslopesdeabreuariane amachinelearningmodeltoassesspotentialmisdiagnoseddenguehospitalization
AT abuddenisealves amachinelearningmodeltoassesspotentialmisdiagnoseddenguehospitalization
AT lobaonetoabneraugusto amachinelearningmodeltoassesspotentialmisdiagnoseddenguehospitalization
AT pereiraramon amachinelearningmodeltoassesspotentialmisdiagnoseddenguehospitalization
AT siqueirajoaobosco amachinelearningmodeltoassesspotentialmisdiagnoseddenguehospitalization
AT santosclaudiayang machinelearningmodeltoassesspotentialmisdiagnoseddenguehospitalization
AT tuboisuely machinelearningmodeltoassesspotentialmisdiagnoseddenguehospitalization
AT dejesuslopesdeabreuariane machinelearningmodeltoassesspotentialmisdiagnoseddenguehospitalization
AT abuddenisealves machinelearningmodeltoassesspotentialmisdiagnoseddenguehospitalization
AT lobaonetoabneraugusto machinelearningmodeltoassesspotentialmisdiagnoseddenguehospitalization
AT pereiraramon machinelearningmodeltoassesspotentialmisdiagnoseddenguehospitalization
AT siqueirajoaobosco machinelearningmodeltoassesspotentialmisdiagnoseddenguehospitalization