Cargando…

Cluster analysis and forecasting of viruses incidence growth curves: Application to SARS-CoV-2

The sanitary emergency caused by COVID-19 has compromised countries and generated a worldwide health and economic crisis. To provide support to the countries’ responses, numerous lines of research have been developed. The spotlight was put on effectively and rapidly diagnosing and predicting the evo...

Descripción completa

Detalles Bibliográficos
Autores principales: Díaz-Lozano, Miguel, Guijo-Rubio, David, Gutiérrez, Pedro Antonio, Hervás-Martínez, César
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Author(s). Published by Elsevier Ltd. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10108563/
https://www.ncbi.nlm.nih.gov/pubmed/37090447
http://dx.doi.org/10.1016/j.eswa.2023.120103
_version_ 1785026874580140032
author Díaz-Lozano, Miguel
Guijo-Rubio, David
Gutiérrez, Pedro Antonio
Hervás-Martínez, César
author_facet Díaz-Lozano, Miguel
Guijo-Rubio, David
Gutiérrez, Pedro Antonio
Hervás-Martínez, César
author_sort Díaz-Lozano, Miguel
collection PubMed
description The sanitary emergency caused by COVID-19 has compromised countries and generated a worldwide health and economic crisis. To provide support to the countries’ responses, numerous lines of research have been developed. The spotlight was put on effectively and rapidly diagnosing and predicting the evolution of the pandemic, one of the most challenging problems of the past months. This work contributes to the existing literature by developing a two-step methodology to analyze the transmission rate, designing models applied to territories with similar pandemic behavior characteristics. Virus transmission is considered as bacterial growth curves to understand the spread of the virus and to make predictions about its future evolution. Hence, an analytical clustering procedure is first applied to create groups of locations where the virus transmission rate behaved similarly in the different outbreaks. A curve decomposition process based on an iterative polynomial process is then applied, obtaining meaningful forecasting features. Information of the territories belonging to the same cluster is merged to build models capable of simultaneously predicting the 14-day incidence in several locations using Evolutionary Artificial Neural Networks. The methodology is applied to Andalusia (Spain), although it is applicable to any region across the world. Individual models trained for a specific territory are carried out for comparison purposes. The results demonstrate that this methodology achieves statistically similar, or even better, performance for most of the locations. In addition to being extremely competitive, the main advantage of the proposal lies in its complexity cost reduction. The total number of parameters to be estimated is reduced up to 93.51% for the short term and 93.31% for the mid-term forecasting, respectively. Moreover, the number of required models is reduced by 73.53% and 58.82% for the short- and mid-term forecasting horizons.
format Online
Article
Text
id pubmed-10108563
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher The Author(s). Published by Elsevier Ltd.
record_format MEDLINE/PubMed
spelling pubmed-101085632023-04-18 Cluster analysis and forecasting of viruses incidence growth curves: Application to SARS-CoV-2 Díaz-Lozano, Miguel Guijo-Rubio, David Gutiérrez, Pedro Antonio Hervás-Martínez, César Expert Syst Appl Article The sanitary emergency caused by COVID-19 has compromised countries and generated a worldwide health and economic crisis. To provide support to the countries’ responses, numerous lines of research have been developed. The spotlight was put on effectively and rapidly diagnosing and predicting the evolution of the pandemic, one of the most challenging problems of the past months. This work contributes to the existing literature by developing a two-step methodology to analyze the transmission rate, designing models applied to territories with similar pandemic behavior characteristics. Virus transmission is considered as bacterial growth curves to understand the spread of the virus and to make predictions about its future evolution. Hence, an analytical clustering procedure is first applied to create groups of locations where the virus transmission rate behaved similarly in the different outbreaks. A curve decomposition process based on an iterative polynomial process is then applied, obtaining meaningful forecasting features. Information of the territories belonging to the same cluster is merged to build models capable of simultaneously predicting the 14-day incidence in several locations using Evolutionary Artificial Neural Networks. The methodology is applied to Andalusia (Spain), although it is applicable to any region across the world. Individual models trained for a specific territory are carried out for comparison purposes. The results demonstrate that this methodology achieves statistically similar, or even better, performance for most of the locations. In addition to being extremely competitive, the main advantage of the proposal lies in its complexity cost reduction. The total number of parameters to be estimated is reduced up to 93.51% for the short term and 93.31% for the mid-term forecasting, respectively. Moreover, the number of required models is reduced by 73.53% and 58.82% for the short- and mid-term forecasting horizons. The Author(s). Published by Elsevier Ltd. 2023-09-01 2023-04-17 /pmc/articles/PMC10108563/ /pubmed/37090447 http://dx.doi.org/10.1016/j.eswa.2023.120103 Text en © 2023 The Author(s) Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Díaz-Lozano, Miguel
Guijo-Rubio, David
Gutiérrez, Pedro Antonio
Hervás-Martínez, César
Cluster analysis and forecasting of viruses incidence growth curves: Application to SARS-CoV-2
title Cluster analysis and forecasting of viruses incidence growth curves: Application to SARS-CoV-2
title_full Cluster analysis and forecasting of viruses incidence growth curves: Application to SARS-CoV-2
title_fullStr Cluster analysis and forecasting of viruses incidence growth curves: Application to SARS-CoV-2
title_full_unstemmed Cluster analysis and forecasting of viruses incidence growth curves: Application to SARS-CoV-2
title_short Cluster analysis and forecasting of viruses incidence growth curves: Application to SARS-CoV-2
title_sort cluster analysis and forecasting of viruses incidence growth curves: application to sars-cov-2
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10108563/
https://www.ncbi.nlm.nih.gov/pubmed/37090447
http://dx.doi.org/10.1016/j.eswa.2023.120103
work_keys_str_mv AT diazlozanomiguel clusteranalysisandforecastingofvirusesincidencegrowthcurvesapplicationtosarscov2
AT guijorubiodavid clusteranalysisandforecastingofvirusesincidencegrowthcurvesapplicationtosarscov2
AT gutierrezpedroantonio clusteranalysisandforecastingofvirusesincidencegrowthcurvesapplicationtosarscov2
AT hervasmartinezcesar clusteranalysisandforecastingofvirusesincidencegrowthcurvesapplicationtosarscov2