Cargando…

An interpretable time series machine learning method for varying forecast and nowcast lengths in wastewater-based epidemiology

Wastewater-based epidemiology has emerged as a viable tool for monitoring disease prevalence in a population. This paper details a time series machine learning (TSML) method for predicting COVID-19 cases from wastewater and environmental variables. The TSML method utilizes a number of techniques to...

Descripción completa

Detalles Bibliográficos
Autores principales: Lai, Mallory, Wulff, Shaun S., Cao, Yongtao, Robinson, Timothy J., Rajapaksha, Rasika
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10562867/
https://www.ncbi.nlm.nih.gov/pubmed/37822674
http://dx.doi.org/10.1016/j.mex.2023.102382
Descripción
Sumario:Wastewater-based epidemiology has emerged as a viable tool for monitoring disease prevalence in a population. This paper details a time series machine learning (TSML) method for predicting COVID-19 cases from wastewater and environmental variables. The TSML method utilizes a number of techniques to create an interpretable, hypothesis-driven framework for machine learning that can handle different nowcast and forecast lengths. Some of the techniques employed include: • Feature engineering to construct interpretable features, like site-specific lead times, hypothesized to be potential predictors of COVID-19 cases. • Feature selection to identify features with the best predictive performance for the tasks of nowcasting and forecasting. • Prequential evaluation to prevent data leakage while evaluating the performance of the machine learning algorithm.