Cargando…

Feature Importance Analysis by Nowcasting Perspective to Predict COVID-19

The present work raises an investigation about prediction and the feature importance to estimate the COVID-19 infection, using Machine Learning approach. Our work analyzed the inclusion of climatic features, mobility, government actions and the number of cases per health sub-territory from an existi...

Descripción completa

Detalles Bibliográficos
Autores principales: Gonçalves, André Vinícius, de Araujo, Gustavo Medeiros, Garcia, Leandro Pereira, Amaral, Fernanda Vargas, Schneider, Ione Jayce Ceola
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9033308/
http://dx.doi.org/10.1007/s11036-022-01966-y
_version_ 1784692856187781120
author Gonçalves, André Vinícius
de Araujo, Gustavo Medeiros
Garcia, Leandro Pereira
Amaral, Fernanda Vargas
Schneider, Ione Jayce Ceola
author_facet Gonçalves, André Vinícius
de Araujo, Gustavo Medeiros
Garcia, Leandro Pereira
Amaral, Fernanda Vargas
Schneider, Ione Jayce Ceola
author_sort Gonçalves, André Vinícius
collection PubMed
description The present work raises an investigation about prediction and the feature importance to estimate the COVID-19 infection, using Machine Learning approach. Our work analyzed the inclusion of climatic features, mobility, government actions and the number of cases per health sub-territory from an existing model. The Random Forest with Permutation Importance method was used to assess the importance and list the thirty most relevant that represent the probability of infection of the disease. Among all features, the most important were: i) the variables per region health stand out, ii) period comprised between the date of notification and symptom onset, iii) symptoms features as fever, cough and sore throat, iv) variables of the traffic flow and mobility, and also v) wheathers features. The model was validated and reached an accuracy average of 81.82%, whereas the sensitivity and specificity achieved 87.52% and the 78.67% respectively in the infection estimate. Therefore, the proposed investigation represents an alternative to guide authorities in understanding aspects related to the disease.
format Online
Article
Text
id pubmed-9033308
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-90333082022-04-25 Feature Importance Analysis by Nowcasting Perspective to Predict COVID-19 Gonçalves, André Vinícius de Araujo, Gustavo Medeiros Garcia, Leandro Pereira Amaral, Fernanda Vargas Schneider, Ione Jayce Ceola Mobile Netw Appl Article The present work raises an investigation about prediction and the feature importance to estimate the COVID-19 infection, using Machine Learning approach. Our work analyzed the inclusion of climatic features, mobility, government actions and the number of cases per health sub-territory from an existing model. The Random Forest with Permutation Importance method was used to assess the importance and list the thirty most relevant that represent the probability of infection of the disease. Among all features, the most important were: i) the variables per region health stand out, ii) period comprised between the date of notification and symptom onset, iii) symptoms features as fever, cough and sore throat, iv) variables of the traffic flow and mobility, and also v) wheathers features. The model was validated and reached an accuracy average of 81.82%, whereas the sensitivity and specificity achieved 87.52% and the 78.67% respectively in the infection estimate. Therefore, the proposed investigation represents an alternative to guide authorities in understanding aspects related to the disease. Springer US 2022-04-23 2022 /pmc/articles/PMC9033308/ http://dx.doi.org/10.1007/s11036-022-01966-y Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Gonçalves, André Vinícius
de Araujo, Gustavo Medeiros
Garcia, Leandro Pereira
Amaral, Fernanda Vargas
Schneider, Ione Jayce Ceola
Feature Importance Analysis by Nowcasting Perspective to Predict COVID-19
title Feature Importance Analysis by Nowcasting Perspective to Predict COVID-19
title_full Feature Importance Analysis by Nowcasting Perspective to Predict COVID-19
title_fullStr Feature Importance Analysis by Nowcasting Perspective to Predict COVID-19
title_full_unstemmed Feature Importance Analysis by Nowcasting Perspective to Predict COVID-19
title_short Feature Importance Analysis by Nowcasting Perspective to Predict COVID-19
title_sort feature importance analysis by nowcasting perspective to predict covid-19
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9033308/
http://dx.doi.org/10.1007/s11036-022-01966-y
work_keys_str_mv AT goncalvesandrevinicius featureimportanceanalysisbynowcastingperspectivetopredictcovid19
AT dearaujogustavomedeiros featureimportanceanalysisbynowcastingperspectivetopredictcovid19
AT garcialeandropereira featureimportanceanalysisbynowcastingperspectivetopredictcovid19
AT amaralfernandavargas featureimportanceanalysisbynowcastingperspectivetopredictcovid19
AT schneiderionejayceceola featureimportanceanalysisbynowcastingperspectivetopredictcovid19