Cargando…

Assessment of autoregressive integrated moving average (ARIMA), generalized linear autoregressive moving average (GLARMA), and random forest (RF) time series regression models for predicting influenza A virus frequency in swine in Ontario, Canada

Influenza A virus commonly circulating in swine (IAV-S) is characterized by large genetic and antigenic diversity and, thus, improvements in different aspects of IAV-S surveillance are needed to achieve desirable goals of surveillance such as to establish the capacity to forecast with the greatest a...

Descripción completa

Detalles Bibliográficos
Autores principales: Petukhova, Tatiana, Ojkic, Davor, McEwen, Beverly, Deardon, Rob, Poljak, Zvonimir
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5983852/
https://www.ncbi.nlm.nih.gov/pubmed/29856881
http://dx.doi.org/10.1371/journal.pone.0198313
_version_ 1783328514762604544
author Petukhova, Tatiana
Ojkic, Davor
McEwen, Beverly
Deardon, Rob
Poljak, Zvonimir
author_facet Petukhova, Tatiana
Ojkic, Davor
McEwen, Beverly
Deardon, Rob
Poljak, Zvonimir
author_sort Petukhova, Tatiana
collection PubMed
description Influenza A virus commonly circulating in swine (IAV-S) is characterized by large genetic and antigenic diversity and, thus, improvements in different aspects of IAV-S surveillance are needed to achieve desirable goals of surveillance such as to establish the capacity to forecast with the greatest accuracy the number of influenza cases likely to arise. Advancements in modeling approaches provide the opportunity to use different models for surveillance. However, in order to make improvements in surveillance, it is necessary to assess the predictive ability of such models. This study compares the sensitivity and predictive accuracy of the autoregressive integrated moving average (ARIMA) model, the generalized linear autoregressive moving average (GLARMA) model, and the random forest (RF) model with respect to the frequency of influenza A virus (IAV) in Ontario swine. Diagnostic data on IAV submissions in Ontario swine between 2007 and 2015 were obtained from the Animal Health Laboratory (University of Guelph, Guelph, ON, Canada). Each modeling approach was examined for predictive accuracy, evaluated by the root mean square error, the normalized root mean square error, and the model’s ability to anticipate increases and decreases in disease frequency. Likewise, we verified the magnitude of improvement offered by the ARIMA, GLARMA and RF models over a seasonal-naïve method. Using the diagnostic submissions, the occurrence of seasonality and the long-term trend in IAV infections were also investigated. The RF model had the smallest root mean square error in the prospective analysis and tended to predict increases in the number of diagnostic submissions and positive virological submissions at weekly and monthly intervals with a higher degree of sensitivity than the ARIMA and GLARMA models. The number of weekly positive virological submissions is significantly higher in the fall calendar season compared to the summer calendar season. Positive counts at weekly and monthly intervals demonstrated a significant increasing trend. Overall, this study shows that the RF model offers enhanced prediction ability over the ARIMA and GLARMA time series models for predicting the frequency of IAV infections in diagnostic submissions.
format Online
Article
Text
id pubmed-5983852
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-59838522018-06-16 Assessment of autoregressive integrated moving average (ARIMA), generalized linear autoregressive moving average (GLARMA), and random forest (RF) time series regression models for predicting influenza A virus frequency in swine in Ontario, Canada Petukhova, Tatiana Ojkic, Davor McEwen, Beverly Deardon, Rob Poljak, Zvonimir PLoS One Research Article Influenza A virus commonly circulating in swine (IAV-S) is characterized by large genetic and antigenic diversity and, thus, improvements in different aspects of IAV-S surveillance are needed to achieve desirable goals of surveillance such as to establish the capacity to forecast with the greatest accuracy the number of influenza cases likely to arise. Advancements in modeling approaches provide the opportunity to use different models for surveillance. However, in order to make improvements in surveillance, it is necessary to assess the predictive ability of such models. This study compares the sensitivity and predictive accuracy of the autoregressive integrated moving average (ARIMA) model, the generalized linear autoregressive moving average (GLARMA) model, and the random forest (RF) model with respect to the frequency of influenza A virus (IAV) in Ontario swine. Diagnostic data on IAV submissions in Ontario swine between 2007 and 2015 were obtained from the Animal Health Laboratory (University of Guelph, Guelph, ON, Canada). Each modeling approach was examined for predictive accuracy, evaluated by the root mean square error, the normalized root mean square error, and the model’s ability to anticipate increases and decreases in disease frequency. Likewise, we verified the magnitude of improvement offered by the ARIMA, GLARMA and RF models over a seasonal-naïve method. Using the diagnostic submissions, the occurrence of seasonality and the long-term trend in IAV infections were also investigated. The RF model had the smallest root mean square error in the prospective analysis and tended to predict increases in the number of diagnostic submissions and positive virological submissions at weekly and monthly intervals with a higher degree of sensitivity than the ARIMA and GLARMA models. The number of weekly positive virological submissions is significantly higher in the fall calendar season compared to the summer calendar season. Positive counts at weekly and monthly intervals demonstrated a significant increasing trend. Overall, this study shows that the RF model offers enhanced prediction ability over the ARIMA and GLARMA time series models for predicting the frequency of IAV infections in diagnostic submissions. Public Library of Science 2018-06-01 /pmc/articles/PMC5983852/ /pubmed/29856881 http://dx.doi.org/10.1371/journal.pone.0198313 Text en © 2018 Petukhova et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Petukhova, Tatiana
Ojkic, Davor
McEwen, Beverly
Deardon, Rob
Poljak, Zvonimir
Assessment of autoregressive integrated moving average (ARIMA), generalized linear autoregressive moving average (GLARMA), and random forest (RF) time series regression models for predicting influenza A virus frequency in swine in Ontario, Canada
title Assessment of autoregressive integrated moving average (ARIMA), generalized linear autoregressive moving average (GLARMA), and random forest (RF) time series regression models for predicting influenza A virus frequency in swine in Ontario, Canada
title_full Assessment of autoregressive integrated moving average (ARIMA), generalized linear autoregressive moving average (GLARMA), and random forest (RF) time series regression models for predicting influenza A virus frequency in swine in Ontario, Canada
title_fullStr Assessment of autoregressive integrated moving average (ARIMA), generalized linear autoregressive moving average (GLARMA), and random forest (RF) time series regression models for predicting influenza A virus frequency in swine in Ontario, Canada
title_full_unstemmed Assessment of autoregressive integrated moving average (ARIMA), generalized linear autoregressive moving average (GLARMA), and random forest (RF) time series regression models for predicting influenza A virus frequency in swine in Ontario, Canada
title_short Assessment of autoregressive integrated moving average (ARIMA), generalized linear autoregressive moving average (GLARMA), and random forest (RF) time series regression models for predicting influenza A virus frequency in swine in Ontario, Canada
title_sort assessment of autoregressive integrated moving average (arima), generalized linear autoregressive moving average (glarma), and random forest (rf) time series regression models for predicting influenza a virus frequency in swine in ontario, canada
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5983852/
https://www.ncbi.nlm.nih.gov/pubmed/29856881
http://dx.doi.org/10.1371/journal.pone.0198313
work_keys_str_mv AT petukhovatatiana assessmentofautoregressiveintegratedmovingaveragearimageneralizedlinearautoregressivemovingaverageglarmaandrandomforestrftimeseriesregressionmodelsforpredictinginfluenzaavirusfrequencyinswineinontariocanada
AT ojkicdavor assessmentofautoregressiveintegratedmovingaveragearimageneralizedlinearautoregressivemovingaverageglarmaandrandomforestrftimeseriesregressionmodelsforpredictinginfluenzaavirusfrequencyinswineinontariocanada
AT mcewenbeverly assessmentofautoregressiveintegratedmovingaveragearimageneralizedlinearautoregressivemovingaverageglarmaandrandomforestrftimeseriesregressionmodelsforpredictinginfluenzaavirusfrequencyinswineinontariocanada
AT deardonrob assessmentofautoregressiveintegratedmovingaveragearimageneralizedlinearautoregressivemovingaverageglarmaandrandomforestrftimeseriesregressionmodelsforpredictinginfluenzaavirusfrequencyinswineinontariocanada
AT poljakzvonimir assessmentofautoregressiveintegratedmovingaveragearimageneralizedlinearautoregressivemovingaverageglarmaandrandomforestrftimeseriesregressionmodelsforpredictinginfluenzaavirusfrequencyinswineinontariocanada