Cargando…
A seq2seq model to forecast the COVID-19 cases, deaths and reproductive R numbers in US counties
The global pandemic of coronavirus disease 2019 (COVID-19) has killed almost two million people worldwide and over 400 thousand in the United States (US). As the pandemic evolves, informed policy-making and strategic resource allocation relies on accurate forecasts. To predict the spread of the viru...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8077584/ https://www.ncbi.nlm.nih.gov/pubmed/33907761 http://dx.doi.org/10.1101/2021.04.14.21255507 |
_version_ | 1783684906176479232 |
---|---|
author | Zhang-James, Yanli Hess, Jonathan Salekin, Asif Wang, Dongliang Chen, Samuel Winkelstein, Peter Morley, Christopher P Faraone, Stephen V |
author_facet | Zhang-James, Yanli Hess, Jonathan Salekin, Asif Wang, Dongliang Chen, Samuel Winkelstein, Peter Morley, Christopher P Faraone, Stephen V |
author_sort | Zhang-James, Yanli |
collection | PubMed |
description | The global pandemic of coronavirus disease 2019 (COVID-19) has killed almost two million people worldwide and over 400 thousand in the United States (US). As the pandemic evolves, informed policy-making and strategic resource allocation relies on accurate forecasts. To predict the spread of the virus within US counties, we curated an array of county-level demographic and COVID-19-relevant health risk factors. In combination with the county-level case and death numbers curated by John Hopkins university, we developed a forecasting model using deep learning (DL). We implemented an autoencoder-based Seq2Seq model with gated recurrent units (GRUs) in the deep recurrent layers. We trained the model to predict future incident cases, deaths and the reproductive number, R. For most counties, it makes accurate predictions of new incident cases, deaths and R values, up to 30 days in the future. Our framework can also be used to predict other targets that are useful indices for policymaking, for example hospitalization or the occupancy of intensive care units. Our DL framework is publicly available on GitHub and can be adapted for other indices of the COVID-19 spread. We hope that our forecasts and model can help local governments in the continued fight against COVID-19. |
format | Online Article Text |
id | pubmed-8077584 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-80775842021-04-28 A seq2seq model to forecast the COVID-19 cases, deaths and reproductive R numbers in US counties Zhang-James, Yanli Hess, Jonathan Salekin, Asif Wang, Dongliang Chen, Samuel Winkelstein, Peter Morley, Christopher P Faraone, Stephen V medRxiv Article The global pandemic of coronavirus disease 2019 (COVID-19) has killed almost two million people worldwide and over 400 thousand in the United States (US). As the pandemic evolves, informed policy-making and strategic resource allocation relies on accurate forecasts. To predict the spread of the virus within US counties, we curated an array of county-level demographic and COVID-19-relevant health risk factors. In combination with the county-level case and death numbers curated by John Hopkins university, we developed a forecasting model using deep learning (DL). We implemented an autoencoder-based Seq2Seq model with gated recurrent units (GRUs) in the deep recurrent layers. We trained the model to predict future incident cases, deaths and the reproductive number, R. For most counties, it makes accurate predictions of new incident cases, deaths and R values, up to 30 days in the future. Our framework can also be used to predict other targets that are useful indices for policymaking, for example hospitalization or the occupancy of intensive care units. Our DL framework is publicly available on GitHub and can be adapted for other indices of the COVID-19 spread. We hope that our forecasts and model can help local governments in the continued fight against COVID-19. Cold Spring Harbor Laboratory 2021-04-20 /pmc/articles/PMC8077584/ /pubmed/33907761 http://dx.doi.org/10.1101/2021.04.14.21255507 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Zhang-James, Yanli Hess, Jonathan Salekin, Asif Wang, Dongliang Chen, Samuel Winkelstein, Peter Morley, Christopher P Faraone, Stephen V A seq2seq model to forecast the COVID-19 cases, deaths and reproductive R numbers in US counties |
title | A seq2seq model to forecast the COVID-19 cases, deaths and reproductive R numbers in US counties |
title_full | A seq2seq model to forecast the COVID-19 cases, deaths and reproductive R numbers in US counties |
title_fullStr | A seq2seq model to forecast the COVID-19 cases, deaths and reproductive R numbers in US counties |
title_full_unstemmed | A seq2seq model to forecast the COVID-19 cases, deaths and reproductive R numbers in US counties |
title_short | A seq2seq model to forecast the COVID-19 cases, deaths and reproductive R numbers in US counties |
title_sort | seq2seq model to forecast the covid-19 cases, deaths and reproductive r numbers in us counties |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8077584/ https://www.ncbi.nlm.nih.gov/pubmed/33907761 http://dx.doi.org/10.1101/2021.04.14.21255507 |
work_keys_str_mv | AT zhangjamesyanli aseq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties AT hessjonathan aseq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties AT salekinasif aseq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties AT wangdongliang aseq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties AT chensamuel aseq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties AT winkelsteinpeter aseq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties AT morleychristopherp aseq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties AT faraonestephenv aseq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties AT zhangjamesyanli seq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties AT hessjonathan seq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties AT salekinasif seq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties AT wangdongliang seq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties AT chensamuel seq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties AT winkelsteinpeter seq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties AT morleychristopherp seq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties AT faraonestephenv seq2seqmodeltoforecastthecovid19casesdeathsandreproductivernumbersinuscounties |