Cargando…

A seq2seq model to forecast the COVID-19 cases, deaths and reproductive R numbers in US counties

The global pandemic of coronavirus disease 2019 (COVID-19) has killed almost two million people worldwide and over 400 thousand in the United States (US). As the pandemic evolves, informed policy-making and strategic resource allocation relies on accurate forecasts. To predict the spread of the viru...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang-James, Yanli, Hess, Jonathan, Salekin, Asif, Wang, Dongliang, Chen, Samuel, Winkelstein, Peter, Morley, Christopher P, Faraone, Stephen V
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8077584/
https://www.ncbi.nlm.nih.gov/pubmed/33907761
http://dx.doi.org/10.1101/2021.04.14.21255507
_version_ 1783684906176479232
author Zhang-James, Yanli
Hess, Jonathan
Salekin, Asif
Wang, Dongliang
Chen, Samuel
Winkelstein, Peter
Morley, Christopher P
Faraone, Stephen V
author_facet Zhang-James, Yanli
Hess, Jonathan
Salekin, Asif
Wang, Dongliang
Chen, Samuel
Winkelstein, Peter
Morley, Christopher P
Faraone, Stephen V
author_sort Zhang-James, Yanli
collection PubMed
description The global pandemic of coronavirus disease 2019 (COVID-19) has killed almost two million people worldwide and over 400 thousand in the United States (US). As the pandemic evolves, informed policy-making and strategic resource allocation relies on accurate forecasts. To predict the spread of the virus within US counties, we curated an array of county-level demographic and COVID-19-relevant health risk factors. In combination with the county-level case and death numbers curated by John Hopkins university, we developed a forecasting model using deep learning (DL). We implemented an autoencoder-based Seq2Seq model with gated recurrent units (GRUs) in the deep recurrent layers. We trained the model to predict future incident cases, deaths and the reproductive number, R. For most counties, it makes accurate predictions of new incident cases, deaths and R values, up to 30 days in the future. Our framework can also be used to predict other targets that are useful indices for policymaking, for example hospitalization or the occupancy of intensive care units. Our DL framework is publicly available on GitHub and can be adapted for other indices of the COVID-19 spread. We hope that our forecasts and model can help local governments in the continued fight against COVID-19.
format Online
Article
Text
id pubmed-8077584
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-80775842021-04-28 A seq2seq model to forecast the COVID-19 cases, deaths and reproductive R numbers in US counties Zhang-James, Yanli Hess, Jonathan Salekin, Asif Wang, Dongliang Chen, Samuel Winkelstein, Peter Morley, Christopher P Faraone, Stephen V medRxiv Article The global pandemic of coronavirus disease 2019 (COVID-19) has killed almost two million people worldwide and over 400 thousand in the United States (US). As the pandemic evolves, informed policy-making and strategic resource allocation relies on accurate forecasts. To predict the spread of the virus within US counties, we curated an array of county-level demographic and COVID-19-relevant health risk factors. In combination with the county-level case and death numbers curated by John Hopkins university, we developed a forecasting model using deep learning (DL). We implemented an autoencoder-based Seq2Seq model with gated recurrent units (GRUs) in the deep recurrent layers. We trained the model to predict future incident cases, deaths and the reproductive number, R. For most counties, it makes accurate predictions of new incident cases, deaths and R values, up to 30 days in the future. Our framework can also be used to predict other targets that are useful indices for policymaking, for example hospitalization or the occupancy of intensive care units. Our DL framework is publicly available on GitHub and can be adapted for other indices of the COVID-19 spread. We hope that our forecasts and model can help local governments in the continued fight against COVID-19. Cold Spring Harbor Laboratory 2021-04-20 /pmc/articles/PMC8077584/ /pubmed/33907761 http://dx.doi.org/10.1101/2021.04.14.21255507 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Zhang-James, Yanli
Hess, Jonathan
Salekin, Asif
Wang, Dongliang
Chen, Samuel
Winkelstein, Peter
Morley, Christopher P
Faraone, Stephen V
A seq2seq model to forecast the COVID-19 cases, deaths and reproductive R numbers in US counties
title A seq2seq model to forecast the COVID-19 cases, deaths and reproductive R numbers in US counties
title_full A seq2seq model to forecast the COVID-19 cases, deaths and reproductive R numbers in US counties
title_fullStr A seq2seq model to forecast the COVID-19 cases, deaths and reproductive R numbers in US counties
title_full_unstemmed A seq2seq model to forecast the COVID-19 cases, deaths and reproductive R numbers in US counties
title_short A seq2seq model to forecast the COVID-19 cases, deaths and reproductive R numbers in US counties
title_sort seq2seq model to forecast the covid-19 cases, deaths and reproductive r numbers in us counties
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8077584/
https://www.ncbi.nlm.nih.gov/pubmed/33907761
http://dx.doi.org/10.1101/2021.04.14.21255507
work_keys_str_mv AT zhangjamesyanli aseq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties
AT hessjonathan aseq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties
AT salekinasif aseq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties
AT wangdongliang aseq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties
AT chensamuel aseq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties
AT winkelsteinpeter aseq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties
AT morleychristopherp aseq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties
AT faraonestephenv aseq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties
AT zhangjamesyanli seq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties
AT hessjonathan seq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties
AT salekinasif seq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties
AT wangdongliang seq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties
AT chensamuel seq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties
AT winkelsteinpeter seq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties
AT morleychristopherp seq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties
AT faraonestephenv seq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties