Cargando…

Forecasting Covid-19 Dynamics in Brazil: A Data Driven Approach

The contribution of this paper is twofold. First, a new data driven approach for predicting the Covid-19 pandemic dynamics is introduced. The second contribution consists in reporting and discussing the results that were obtained with this approach for the Brazilian states, with predictions starting...

Descripción completa

Detalles Bibliográficos
Autores principales: Pereira, Igor Gadelha, Guerin, Joris Michel, Silva Júnior, Andouglas Gonçalves, Garcia, Gabriel Santos, Piscitelli, Prisco, Miani, Alessandro, Distante, Cosimo, Gonçalves, Luiz Marcos Garcia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7400194/
https://www.ncbi.nlm.nih.gov/pubmed/32679861
http://dx.doi.org/10.3390/ijerph17145115
_version_ 1783566307983097856
author Pereira, Igor Gadelha
Guerin, Joris Michel
Silva Júnior, Andouglas Gonçalves
Garcia, Gabriel Santos
Piscitelli, Prisco
Miani, Alessandro
Distante, Cosimo
Gonçalves, Luiz Marcos Garcia
author_facet Pereira, Igor Gadelha
Guerin, Joris Michel
Silva Júnior, Andouglas Gonçalves
Garcia, Gabriel Santos
Piscitelli, Prisco
Miani, Alessandro
Distante, Cosimo
Gonçalves, Luiz Marcos Garcia
author_sort Pereira, Igor Gadelha
collection PubMed
description The contribution of this paper is twofold. First, a new data driven approach for predicting the Covid-19 pandemic dynamics is introduced. The second contribution consists in reporting and discussing the results that were obtained with this approach for the Brazilian states, with predictions starting as of 4 May 2020. As a preliminary study, we first used an Long Short Term Memory for Data Training-SAE (LSTM-SAE) network model. Although this first approach led to somewhat disappointing results, it served as a good baseline for testing other ANN types. Subsequently, in order to identify relevant countries and regions to be used for training ANN models, we conduct a clustering of the world’s regions where the pandemic is at an advanced stage. This clustering is based on manually engineered features representing a country’s response to the early spread of the pandemic, and the different clusters obtained are used to select the relevant countries for training the models. The final models retained are Modified Auto-Encoder networks, that are trained on these clusters and learn to predict future data for Brazilian states. These predictions are used to estimate important statistics about the disease, such as peaks and number of confirmed cases. Finally, curve fitting is carried out to find the distribution that best fits the outputs of the MAE, and to refine the estimates of the peaks of the pandemic. Predicted numbers reach a total of more than one million infected Brazilians, distributed among the different states, with São Paulo leading with about 150 thousand confirmed cases predicted. The results indicate that the pandemic is still growing in Brazil, with most states peaks of infection estimated in the second half of May 2020. The estimated end of the pandemics (97% of cases reaching an outcome) spread between June and the end of August 2020, depending on the states.
format Online
Article
Text
id pubmed-7400194
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-74001942020-08-23 Forecasting Covid-19 Dynamics in Brazil: A Data Driven Approach Pereira, Igor Gadelha Guerin, Joris Michel Silva Júnior, Andouglas Gonçalves Garcia, Gabriel Santos Piscitelli, Prisco Miani, Alessandro Distante, Cosimo Gonçalves, Luiz Marcos Garcia Int J Environ Res Public Health Article The contribution of this paper is twofold. First, a new data driven approach for predicting the Covid-19 pandemic dynamics is introduced. The second contribution consists in reporting and discussing the results that were obtained with this approach for the Brazilian states, with predictions starting as of 4 May 2020. As a preliminary study, we first used an Long Short Term Memory for Data Training-SAE (LSTM-SAE) network model. Although this first approach led to somewhat disappointing results, it served as a good baseline for testing other ANN types. Subsequently, in order to identify relevant countries and regions to be used for training ANN models, we conduct a clustering of the world’s regions where the pandemic is at an advanced stage. This clustering is based on manually engineered features representing a country’s response to the early spread of the pandemic, and the different clusters obtained are used to select the relevant countries for training the models. The final models retained are Modified Auto-Encoder networks, that are trained on these clusters and learn to predict future data for Brazilian states. These predictions are used to estimate important statistics about the disease, such as peaks and number of confirmed cases. Finally, curve fitting is carried out to find the distribution that best fits the outputs of the MAE, and to refine the estimates of the peaks of the pandemic. Predicted numbers reach a total of more than one million infected Brazilians, distributed among the different states, with São Paulo leading with about 150 thousand confirmed cases predicted. The results indicate that the pandemic is still growing in Brazil, with most states peaks of infection estimated in the second half of May 2020. The estimated end of the pandemics (97% of cases reaching an outcome) spread between June and the end of August 2020, depending on the states. MDPI 2020-07-15 2020-07 /pmc/articles/PMC7400194/ /pubmed/32679861 http://dx.doi.org/10.3390/ijerph17145115 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Pereira, Igor Gadelha
Guerin, Joris Michel
Silva Júnior, Andouglas Gonçalves
Garcia, Gabriel Santos
Piscitelli, Prisco
Miani, Alessandro
Distante, Cosimo
Gonçalves, Luiz Marcos Garcia
Forecasting Covid-19 Dynamics in Brazil: A Data Driven Approach
title Forecasting Covid-19 Dynamics in Brazil: A Data Driven Approach
title_full Forecasting Covid-19 Dynamics in Brazil: A Data Driven Approach
title_fullStr Forecasting Covid-19 Dynamics in Brazil: A Data Driven Approach
title_full_unstemmed Forecasting Covid-19 Dynamics in Brazil: A Data Driven Approach
title_short Forecasting Covid-19 Dynamics in Brazil: A Data Driven Approach
title_sort forecasting covid-19 dynamics in brazil: a data driven approach
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7400194/
https://www.ncbi.nlm.nih.gov/pubmed/32679861
http://dx.doi.org/10.3390/ijerph17145115
work_keys_str_mv AT pereiraigorgadelha forecastingcovid19dynamicsinbraziladatadrivenapproach
AT guerinjorismichel forecastingcovid19dynamicsinbraziladatadrivenapproach
AT silvajuniorandouglasgoncalves forecastingcovid19dynamicsinbraziladatadrivenapproach
AT garciagabrielsantos forecastingcovid19dynamicsinbraziladatadrivenapproach
AT piscitelliprisco forecastingcovid19dynamicsinbraziladatadrivenapproach
AT mianialessandro forecastingcovid19dynamicsinbraziladatadrivenapproach
AT distantecosimo forecastingcovid19dynamicsinbraziladatadrivenapproach
AT goncalvesluizmarcosgarcia forecastingcovid19dynamicsinbraziladatadrivenapproach