Cargando…

A spatiotemporal machine learning approach to forecasting COVID-19 incidence at the county level in the USA

With COVID-19 affecting every country globally and changing everyday life, the ability to forecast the spread of the disease is more important than any previous epidemic. The conventional methods of disease-spread modeling, compartmental models, are based on the assumption of spatiotemporal homogene...

Descripción completa

Detalles Bibliográficos
Autores principales: Lucas, Benjamin, Vahedi, Behzad, Karimzadeh, Morteza
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8760128/
https://www.ncbi.nlm.nih.gov/pubmed/35071733
http://dx.doi.org/10.1007/s41060-021-00295-9
_version_ 1784633254724239360
author Lucas, Benjamin
Vahedi, Behzad
Karimzadeh, Morteza
author_facet Lucas, Benjamin
Vahedi, Behzad
Karimzadeh, Morteza
author_sort Lucas, Benjamin
collection PubMed
description With COVID-19 affecting every country globally and changing everyday life, the ability to forecast the spread of the disease is more important than any previous epidemic. The conventional methods of disease-spread modeling, compartmental models, are based on the assumption of spatiotemporal homogeneity of the spread of the virus, which may cause forecasting to underperform, especially at high spatial resolutions. In this paper, we approach the forecasting task with an alternative technique—spatiotemporal machine learning. We present COVID-LSTM, a data-driven model based on a long short-term memory deep learning architecture for forecasting COVID-19 incidence at the county level in the USA. We use the weekly number of new positive cases as temporal input, and hand-engineered spatial features from Facebook movement and connectedness datasets to capture the spread of the disease in time and space. COVID-LSTM outperforms the COVID-19 Forecast Hub’s Ensemble model (COVIDhub-ensemble) on our 17-week evaluation period, making it the first model to be more accurate than the COVIDhub-ensemble over one or more forecast periods. Over the 4-week forecast horizon, our model is on average 50 cases per county more accurate than the COVIDhub-ensemble. We highlight that the underutilization of data-driven forecasting of disease spread prior to COVID-19 is likely due to the lack of sufficient data available for previous diseases, in addition to the recent advances in machine learning methods for spatiotemporal forecasting. We discuss the impediments to the wider uptake of data-driven forecasting, and whether it is likely that more deep learning-based models will be used in the future.
format Online
Article
Text
id pubmed-8760128
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-87601282022-01-18 A spatiotemporal machine learning approach to forecasting COVID-19 incidence at the county level in the USA Lucas, Benjamin Vahedi, Behzad Karimzadeh, Morteza Int J Data Sci Anal Regular Paper With COVID-19 affecting every country globally and changing everyday life, the ability to forecast the spread of the disease is more important than any previous epidemic. The conventional methods of disease-spread modeling, compartmental models, are based on the assumption of spatiotemporal homogeneity of the spread of the virus, which may cause forecasting to underperform, especially at high spatial resolutions. In this paper, we approach the forecasting task with an alternative technique—spatiotemporal machine learning. We present COVID-LSTM, a data-driven model based on a long short-term memory deep learning architecture for forecasting COVID-19 incidence at the county level in the USA. We use the weekly number of new positive cases as temporal input, and hand-engineered spatial features from Facebook movement and connectedness datasets to capture the spread of the disease in time and space. COVID-LSTM outperforms the COVID-19 Forecast Hub’s Ensemble model (COVIDhub-ensemble) on our 17-week evaluation period, making it the first model to be more accurate than the COVIDhub-ensemble over one or more forecast periods. Over the 4-week forecast horizon, our model is on average 50 cases per county more accurate than the COVIDhub-ensemble. We highlight that the underutilization of data-driven forecasting of disease spread prior to COVID-19 is likely due to the lack of sufficient data available for previous diseases, in addition to the recent advances in machine learning methods for spatiotemporal forecasting. We discuss the impediments to the wider uptake of data-driven forecasting, and whether it is likely that more deep learning-based models will be used in the future. Springer International Publishing 2022-01-15 2023 /pmc/articles/PMC8760128/ /pubmed/35071733 http://dx.doi.org/10.1007/s41060-021-00295-9 Text en © The Author(s), under exclusive licence to Springer Nature Switzerland AG 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Regular Paper
Lucas, Benjamin
Vahedi, Behzad
Karimzadeh, Morteza
A spatiotemporal machine learning approach to forecasting COVID-19 incidence at the county level in the USA
title A spatiotemporal machine learning approach to forecasting COVID-19 incidence at the county level in the USA
title_full A spatiotemporal machine learning approach to forecasting COVID-19 incidence at the county level in the USA
title_fullStr A spatiotemporal machine learning approach to forecasting COVID-19 incidence at the county level in the USA
title_full_unstemmed A spatiotemporal machine learning approach to forecasting COVID-19 incidence at the county level in the USA
title_short A spatiotemporal machine learning approach to forecasting COVID-19 incidence at the county level in the USA
title_sort spatiotemporal machine learning approach to forecasting covid-19 incidence at the county level in the usa
topic Regular Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8760128/
https://www.ncbi.nlm.nih.gov/pubmed/35071733
http://dx.doi.org/10.1007/s41060-021-00295-9
work_keys_str_mv AT lucasbenjamin aspatiotemporalmachinelearningapproachtoforecastingcovid19incidenceatthecountylevelintheusa
AT vahedibehzad aspatiotemporalmachinelearningapproachtoforecastingcovid19incidenceatthecountylevelintheusa
AT karimzadehmorteza aspatiotemporalmachinelearningapproachtoforecastingcovid19incidenceatthecountylevelintheusa
AT lucasbenjamin spatiotemporalmachinelearningapproachtoforecastingcovid19incidenceatthecountylevelintheusa
AT vahedibehzad spatiotemporalmachinelearningapproachtoforecastingcovid19incidenceatthecountylevelintheusa
AT karimzadehmorteza spatiotemporalmachinelearningapproachtoforecastingcovid19incidenceatthecountylevelintheusa