Cargando…

Evaluating Time Influence over Performance of Machine-Learning-Based Diagnosis: A Case Study of COVID-19 Pandemic in Brazil

Efficiently recognising severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) symptoms enables a quick and accurate diagnosis to be made, and helps in mitigating the spread of the coronavirus disease 2019. However, the emergence of new variants has caused constant changes in the symptoms asso...

Descripción completa

Detalles Bibliográficos
Autores principales: Marques, Julliana Gonçalves, Guedes, Luiz Affonso, da Costa Abreu, Márjory Cristiany
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9819042/
https://www.ncbi.nlm.nih.gov/pubmed/36612458
http://dx.doi.org/10.3390/ijerph20010136
_version_ 1784865133760086016
author Marques, Julliana Gonçalves
Guedes, Luiz Affonso
da Costa Abreu, Márjory Cristiany
author_facet Marques, Julliana Gonçalves
Guedes, Luiz Affonso
da Costa Abreu, Márjory Cristiany
author_sort Marques, Julliana Gonçalves
collection PubMed
description Efficiently recognising severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) symptoms enables a quick and accurate diagnosis to be made, and helps in mitigating the spread of the coronavirus disease 2019. However, the emergence of new variants has caused constant changes in the symptoms associate with COVID-19. These constant changes directly impact the performance of machine-learning-based diagnose. In this context, considering the impact of these changes in symptoms over time is necessary for accurate diagnoses. Thus, in this study, we propose a machine-learning-based approach for diagnosing COVID-19 that considers the importance of time in model predictions. Our approach analyses the performance of XGBoost using two different time-based strategies for model training: month-to-month and accumulated strategies. The model was evaluated using known metrics: accuracy, precision, and recall. Furthermore, to explain the impact of feature changes on model prediction, feature importance was measured using the SHAP technique, an XAI technique. We obtained very interesting results: considering time when creating a COVID-19 diagnostic prediction model is advantageous.
format Online
Article
Text
id pubmed-9819042
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-98190422023-01-07 Evaluating Time Influence over Performance of Machine-Learning-Based Diagnosis: A Case Study of COVID-19 Pandemic in Brazil Marques, Julliana Gonçalves Guedes, Luiz Affonso da Costa Abreu, Márjory Cristiany Int J Environ Res Public Health Article Efficiently recognising severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) symptoms enables a quick and accurate diagnosis to be made, and helps in mitigating the spread of the coronavirus disease 2019. However, the emergence of new variants has caused constant changes in the symptoms associate with COVID-19. These constant changes directly impact the performance of machine-learning-based diagnose. In this context, considering the impact of these changes in symptoms over time is necessary for accurate diagnoses. Thus, in this study, we propose a machine-learning-based approach for diagnosing COVID-19 that considers the importance of time in model predictions. Our approach analyses the performance of XGBoost using two different time-based strategies for model training: month-to-month and accumulated strategies. The model was evaluated using known metrics: accuracy, precision, and recall. Furthermore, to explain the impact of feature changes on model prediction, feature importance was measured using the SHAP technique, an XAI technique. We obtained very interesting results: considering time when creating a COVID-19 diagnostic prediction model is advantageous. MDPI 2022-12-22 /pmc/articles/PMC9819042/ /pubmed/36612458 http://dx.doi.org/10.3390/ijerph20010136 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Marques, Julliana Gonçalves
Guedes, Luiz Affonso
da Costa Abreu, Márjory Cristiany
Evaluating Time Influence over Performance of Machine-Learning-Based Diagnosis: A Case Study of COVID-19 Pandemic in Brazil
title Evaluating Time Influence over Performance of Machine-Learning-Based Diagnosis: A Case Study of COVID-19 Pandemic in Brazil
title_full Evaluating Time Influence over Performance of Machine-Learning-Based Diagnosis: A Case Study of COVID-19 Pandemic in Brazil
title_fullStr Evaluating Time Influence over Performance of Machine-Learning-Based Diagnosis: A Case Study of COVID-19 Pandemic in Brazil
title_full_unstemmed Evaluating Time Influence over Performance of Machine-Learning-Based Diagnosis: A Case Study of COVID-19 Pandemic in Brazil
title_short Evaluating Time Influence over Performance of Machine-Learning-Based Diagnosis: A Case Study of COVID-19 Pandemic in Brazil
title_sort evaluating time influence over performance of machine-learning-based diagnosis: a case study of covid-19 pandemic in brazil
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9819042/
https://www.ncbi.nlm.nih.gov/pubmed/36612458
http://dx.doi.org/10.3390/ijerph20010136
work_keys_str_mv AT marquesjullianagoncalves evaluatingtimeinfluenceoverperformanceofmachinelearningbaseddiagnosisacasestudyofcovid19pandemicinbrazil
AT guedesluizaffonso evaluatingtimeinfluenceoverperformanceofmachinelearningbaseddiagnosisacasestudyofcovid19pandemicinbrazil
AT dacostaabreumarjorycristiany evaluatingtimeinfluenceoverperformanceofmachinelearningbaseddiagnosisacasestudyofcovid19pandemicinbrazil