Cargando…

Multivariate data driven prediction of COVID-19 dynamics: Towards new results with temperature, humidity and air quality data

Since the start of the COVID-19 pandemic many studies investigated the correlation between climate variables such as air quality, humidity and temperature and the lethality of COVID-19 around the world. In this work we investigate the use of climate variables, as additional features to train a data-...

Descripción completa

Detalles Bibliográficos
Autores principales: Aragão, Dunfrey P., Oliveira, Emerson V., Bezerra, Arthur A., dos Santos, Davi H., da Silva Junior, Andouglas G., Pereira, Igor G., Piscitelli, Prisco, Miani, Alessandro, Distante, Cosimo, Cuno, Jordan S., Conci, Aura, Gonçalves, Luiz M.G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8577104/
https://www.ncbi.nlm.nih.gov/pubmed/34767822
http://dx.doi.org/10.1016/j.envres.2021.112348
_version_ 1784596010513727488
author Aragão, Dunfrey P.
Oliveira, Emerson V.
Bezerra, Arthur A.
dos Santos, Davi H.
da Silva Junior, Andouglas G.
Pereira, Igor G.
Piscitelli, Prisco
Miani, Alessandro
Distante, Cosimo
Cuno, Jordan S.
Conci, Aura
Gonçalves, Luiz M.G.
author_facet Aragão, Dunfrey P.
Oliveira, Emerson V.
Bezerra, Arthur A.
dos Santos, Davi H.
da Silva Junior, Andouglas G.
Pereira, Igor G.
Piscitelli, Prisco
Miani, Alessandro
Distante, Cosimo
Cuno, Jordan S.
Conci, Aura
Gonçalves, Luiz M.G.
author_sort Aragão, Dunfrey P.
collection PubMed
description Since the start of the COVID-19 pandemic many studies investigated the correlation between climate variables such as air quality, humidity and temperature and the lethality of COVID-19 around the world. In this work we investigate the use of climate variables, as additional features to train a data-driven multivariate forecast model to predict the short-term expected number of COVID-19 deaths in Brazilian states and major cities. The main idea is that by adding these climate features as inputs to the training of data-driven models, the predictive performance improves when compared to equivalent single input models. We use a Stacked LSTM as the network architecture for both the multivariate and univariate model. We compare both approaches by training forecast models for the COVID-19 deaths time series of the city of São Paulo. In addition, we present a previous analysis based on grouping K-means on AQI curves. The results produced will allow achieving the application of transfer learning, once a locality is eventually added to the task, regressing out using a model based on the cluster of similarities in the AQI curve. The experiments show that the best multivariate model is more skilled than the best standard data-driven univariate model that we could find, using as evaluation metrics the average fitting error, average forecast error, and the profile of the accumulated deaths for the forecast. These results show that by adding more useful features as input to a multivariate approach could further improve the quality of the prediction models.
format Online
Article
Text
id pubmed-8577104
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier Inc.
record_format MEDLINE/PubMed
spelling pubmed-85771042021-11-09 Multivariate data driven prediction of COVID-19 dynamics: Towards new results with temperature, humidity and air quality data Aragão, Dunfrey P. Oliveira, Emerson V. Bezerra, Arthur A. dos Santos, Davi H. da Silva Junior, Andouglas G. Pereira, Igor G. Piscitelli, Prisco Miani, Alessandro Distante, Cosimo Cuno, Jordan S. Conci, Aura Gonçalves, Luiz M.G. Environ Res Article Since the start of the COVID-19 pandemic many studies investigated the correlation between climate variables such as air quality, humidity and temperature and the lethality of COVID-19 around the world. In this work we investigate the use of climate variables, as additional features to train a data-driven multivariate forecast model to predict the short-term expected number of COVID-19 deaths in Brazilian states and major cities. The main idea is that by adding these climate features as inputs to the training of data-driven models, the predictive performance improves when compared to equivalent single input models. We use a Stacked LSTM as the network architecture for both the multivariate and univariate model. We compare both approaches by training forecast models for the COVID-19 deaths time series of the city of São Paulo. In addition, we present a previous analysis based on grouping K-means on AQI curves. The results produced will allow achieving the application of transfer learning, once a locality is eventually added to the task, regressing out using a model based on the cluster of similarities in the AQI curve. The experiments show that the best multivariate model is more skilled than the best standard data-driven univariate model that we could find, using as evaluation metrics the average fitting error, average forecast error, and the profile of the accumulated deaths for the forecast. These results show that by adding more useful features as input to a multivariate approach could further improve the quality of the prediction models. Elsevier Inc. 2022-03 2021-11-09 /pmc/articles/PMC8577104/ /pubmed/34767822 http://dx.doi.org/10.1016/j.envres.2021.112348 Text en © 2021 Elsevier Inc. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Aragão, Dunfrey P.
Oliveira, Emerson V.
Bezerra, Arthur A.
dos Santos, Davi H.
da Silva Junior, Andouglas G.
Pereira, Igor G.
Piscitelli, Prisco
Miani, Alessandro
Distante, Cosimo
Cuno, Jordan S.
Conci, Aura
Gonçalves, Luiz M.G.
Multivariate data driven prediction of COVID-19 dynamics: Towards new results with temperature, humidity and air quality data
title Multivariate data driven prediction of COVID-19 dynamics: Towards new results with temperature, humidity and air quality data
title_full Multivariate data driven prediction of COVID-19 dynamics: Towards new results with temperature, humidity and air quality data
title_fullStr Multivariate data driven prediction of COVID-19 dynamics: Towards new results with temperature, humidity and air quality data
title_full_unstemmed Multivariate data driven prediction of COVID-19 dynamics: Towards new results with temperature, humidity and air quality data
title_short Multivariate data driven prediction of COVID-19 dynamics: Towards new results with temperature, humidity and air quality data
title_sort multivariate data driven prediction of covid-19 dynamics: towards new results with temperature, humidity and air quality data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8577104/
https://www.ncbi.nlm.nih.gov/pubmed/34767822
http://dx.doi.org/10.1016/j.envres.2021.112348
work_keys_str_mv AT aragaodunfreyp multivariatedatadrivenpredictionofcovid19dynamicstowardsnewresultswithtemperaturehumidityandairqualitydata
AT oliveiraemersonv multivariatedatadrivenpredictionofcovid19dynamicstowardsnewresultswithtemperaturehumidityandairqualitydata
AT bezerraarthura multivariatedatadrivenpredictionofcovid19dynamicstowardsnewresultswithtemperaturehumidityandairqualitydata
AT dossantosdavih multivariatedatadrivenpredictionofcovid19dynamicstowardsnewresultswithtemperaturehumidityandairqualitydata
AT dasilvajuniorandouglasg multivariatedatadrivenpredictionofcovid19dynamicstowardsnewresultswithtemperaturehumidityandairqualitydata
AT pereiraigorg multivariatedatadrivenpredictionofcovid19dynamicstowardsnewresultswithtemperaturehumidityandairqualitydata
AT piscitelliprisco multivariatedatadrivenpredictionofcovid19dynamicstowardsnewresultswithtemperaturehumidityandairqualitydata
AT mianialessandro multivariatedatadrivenpredictionofcovid19dynamicstowardsnewresultswithtemperaturehumidityandairqualitydata
AT distantecosimo multivariatedatadrivenpredictionofcovid19dynamicstowardsnewresultswithtemperaturehumidityandairqualitydata
AT cunojordans multivariatedatadrivenpredictionofcovid19dynamicstowardsnewresultswithtemperaturehumidityandairqualitydata
AT conciaura multivariatedatadrivenpredictionofcovid19dynamicstowardsnewresultswithtemperaturehumidityandairqualitydata
AT goncalvesluizmg multivariatedatadrivenpredictionofcovid19dynamicstowardsnewresultswithtemperaturehumidityandairqualitydata