Cargando…

A machine learning model for nowcasting epidemic incidence

Due to delay in reporting, the daily national and statewide COVID-19 incidence counts are often unreliable and need to be estimated from recent data. This process is known in economics as nowcasting. We describe in this paper a simple random forest statistical model for nowcasting the COVID-19 daily...

Descripción completa

Detalles Bibliográficos
Autores principales: Sahai, Saumya Yashmohini, Gurukar, Saket, KhudaBukhsh, Wasiur R., Parthasarathy, Srinivasan, Rempała, Grzegorz A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8635898/
https://www.ncbi.nlm.nih.gov/pubmed/34848217
http://dx.doi.org/10.1016/j.mbs.2021.108677
_version_ 1784608418187706368
author Sahai, Saumya Yashmohini
Gurukar, Saket
KhudaBukhsh, Wasiur R.
Parthasarathy, Srinivasan
Rempała, Grzegorz A.
author_facet Sahai, Saumya Yashmohini
Gurukar, Saket
KhudaBukhsh, Wasiur R.
Parthasarathy, Srinivasan
Rempała, Grzegorz A.
author_sort Sahai, Saumya Yashmohini
collection PubMed
description Due to delay in reporting, the daily national and statewide COVID-19 incidence counts are often unreliable and need to be estimated from recent data. This process is known in economics as nowcasting. We describe in this paper a simple random forest statistical model for nowcasting the COVID-19 daily new infection counts based on historic data along with a set of simple covariates, such as the currently reported infection counts, day of the week, and time since first reporting. We apply the model to adjust the daily infection counts in Ohio, and show that the predictions from this simple data-driven method compare favorably both in quality and computational burden to those obtained from the state-of-the-art hierarchical Bayesian model employing a complex statistical algorithm. The interactive notebook for performing nowcasting is available online at https://tinyurl.com/simpleMLnowcasting.
format Online
Article
Text
id pubmed-8635898
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier Inc.
record_format MEDLINE/PubMed
spelling pubmed-86358982021-12-02 A machine learning model for nowcasting epidemic incidence Sahai, Saumya Yashmohini Gurukar, Saket KhudaBukhsh, Wasiur R. Parthasarathy, Srinivasan Rempała, Grzegorz A. Math Biosci Original Research Article Due to delay in reporting, the daily national and statewide COVID-19 incidence counts are often unreliable and need to be estimated from recent data. This process is known in economics as nowcasting. We describe in this paper a simple random forest statistical model for nowcasting the COVID-19 daily new infection counts based on historic data along with a set of simple covariates, such as the currently reported infection counts, day of the week, and time since first reporting. We apply the model to adjust the daily infection counts in Ohio, and show that the predictions from this simple data-driven method compare favorably both in quality and computational burden to those obtained from the state-of-the-art hierarchical Bayesian model employing a complex statistical algorithm. The interactive notebook for performing nowcasting is available online at https://tinyurl.com/simpleMLnowcasting. Elsevier Inc. 2022-01 2021-11-27 /pmc/articles/PMC8635898/ /pubmed/34848217 http://dx.doi.org/10.1016/j.mbs.2021.108677 Text en © 2021 Elsevier Inc. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Original Research Article
Sahai, Saumya Yashmohini
Gurukar, Saket
KhudaBukhsh, Wasiur R.
Parthasarathy, Srinivasan
Rempała, Grzegorz A.
A machine learning model for nowcasting epidemic incidence
title A machine learning model for nowcasting epidemic incidence
title_full A machine learning model for nowcasting epidemic incidence
title_fullStr A machine learning model for nowcasting epidemic incidence
title_full_unstemmed A machine learning model for nowcasting epidemic incidence
title_short A machine learning model for nowcasting epidemic incidence
title_sort machine learning model for nowcasting epidemic incidence
topic Original Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8635898/
https://www.ncbi.nlm.nih.gov/pubmed/34848217
http://dx.doi.org/10.1016/j.mbs.2021.108677
work_keys_str_mv AT sahaisaumyayashmohini amachinelearningmodelfornowcastingepidemicincidence
AT gurukarsaket amachinelearningmodelfornowcastingepidemicincidence
AT khudabukhshwasiurr amachinelearningmodelfornowcastingepidemicincidence
AT parthasarathysrinivasan amachinelearningmodelfornowcastingepidemicincidence
AT rempałagrzegorza amachinelearningmodelfornowcastingepidemicincidence
AT sahaisaumyayashmohini machinelearningmodelfornowcastingepidemicincidence
AT gurukarsaket machinelearningmodelfornowcastingepidemicincidence
AT khudabukhshwasiurr machinelearningmodelfornowcastingepidemicincidence
AT parthasarathysrinivasan machinelearningmodelfornowcastingepidemicincidence
AT rempałagrzegorza machinelearningmodelfornowcastingepidemicincidence