Cargando…
Machine learning models for the prediction of the SEIRD variables for the COVID-19 pandemic based on a deep dependence analysis of variables
The SEIRD (Susceptible, Exposed, Infected, Recovered, and Dead) model is a mathematical model based on dynamic equations; widely used for characterization of the COVID-19 pandemic. In this paper, a different approach has been discussed, which is the development of predictive models for the SEIRD var...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier Ltd.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8142792/ https://www.ncbi.nlm.nih.gov/pubmed/34052570 http://dx.doi.org/10.1016/j.compbiomed.2021.104500 |
_version_ | 1783696624311074816 |
---|---|
author | Quintero, Yullis Ardila, Douglas Camargo, Edgar Rivas, Francklin Aguilar, Jose |
author_facet | Quintero, Yullis Ardila, Douglas Camargo, Edgar Rivas, Francklin Aguilar, Jose |
author_sort | Quintero, Yullis |
collection | PubMed |
description | The SEIRD (Susceptible, Exposed, Infected, Recovered, and Dead) model is a mathematical model based on dynamic equations; widely used for characterization of the COVID-19 pandemic. In this paper, a different approach has been discussed, which is the development of predictive models for the SEIRD variables that have been based on the historical data collected, and the context variables to where this model has been applied to. Particularly, the context variables examined in this paper include total population, number of people over 65 years old, poverty index, morbidity rates, average age, and population density. For the construction of the SEIRD predictive models, this study encompasses a deep analysis of the dependence of these variables and also, their relationship with the context variables. Hence, before the development of predictive models using machine learning techniques, a methodology to analyze the interdependence of the SEIRD variables has been proposed. The dependence with the context variables is also discussed; to avoid the curse of dimensionality and multicollinearity problems, leading to better results and the reduction of the computational cost. Finally, several prediction models based on varied machine learning techniques and inputs are considered, these include temporal interdependence, temporal intra-dependence, and dependence with context variables. Each of the predictive models has been studied, as well as their quality of prediction. This paper focuses on the analysis of the quality of this approach, applied in Colombia, obtaining the results about the performance of the predictive models for the SEIRD variables. The results are very encouraging since the values obtained with the quality metrics are quite good for different prediction horizons. |
format | Online Article Text |
id | pubmed-8142792 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Elsevier Ltd. |
record_format | MEDLINE/PubMed |
spelling | pubmed-81427922021-05-25 Machine learning models for the prediction of the SEIRD variables for the COVID-19 pandemic based on a deep dependence analysis of variables Quintero, Yullis Ardila, Douglas Camargo, Edgar Rivas, Francklin Aguilar, Jose Comput Biol Med Article The SEIRD (Susceptible, Exposed, Infected, Recovered, and Dead) model is a mathematical model based on dynamic equations; widely used for characterization of the COVID-19 pandemic. In this paper, a different approach has been discussed, which is the development of predictive models for the SEIRD variables that have been based on the historical data collected, and the context variables to where this model has been applied to. Particularly, the context variables examined in this paper include total population, number of people over 65 years old, poverty index, morbidity rates, average age, and population density. For the construction of the SEIRD predictive models, this study encompasses a deep analysis of the dependence of these variables and also, their relationship with the context variables. Hence, before the development of predictive models using machine learning techniques, a methodology to analyze the interdependence of the SEIRD variables has been proposed. The dependence with the context variables is also discussed; to avoid the curse of dimensionality and multicollinearity problems, leading to better results and the reduction of the computational cost. Finally, several prediction models based on varied machine learning techniques and inputs are considered, these include temporal interdependence, temporal intra-dependence, and dependence with context variables. Each of the predictive models has been studied, as well as their quality of prediction. This paper focuses on the analysis of the quality of this approach, applied in Colombia, obtaining the results about the performance of the predictive models for the SEIRD variables. The results are very encouraging since the values obtained with the quality metrics are quite good for different prediction horizons. Elsevier Ltd. 2021-07 2021-05-24 /pmc/articles/PMC8142792/ /pubmed/34052570 http://dx.doi.org/10.1016/j.compbiomed.2021.104500 Text en © 2021 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Quintero, Yullis Ardila, Douglas Camargo, Edgar Rivas, Francklin Aguilar, Jose Machine learning models for the prediction of the SEIRD variables for the COVID-19 pandemic based on a deep dependence analysis of variables |
title | Machine learning models for the prediction of the SEIRD variables for the COVID-19 pandemic based on a deep dependence analysis of variables |
title_full | Machine learning models for the prediction of the SEIRD variables for the COVID-19 pandemic based on a deep dependence analysis of variables |
title_fullStr | Machine learning models for the prediction of the SEIRD variables for the COVID-19 pandemic based on a deep dependence analysis of variables |
title_full_unstemmed | Machine learning models for the prediction of the SEIRD variables for the COVID-19 pandemic based on a deep dependence analysis of variables |
title_short | Machine learning models for the prediction of the SEIRD variables for the COVID-19 pandemic based on a deep dependence analysis of variables |
title_sort | machine learning models for the prediction of the seird variables for the covid-19 pandemic based on a deep dependence analysis of variables |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8142792/ https://www.ncbi.nlm.nih.gov/pubmed/34052570 http://dx.doi.org/10.1016/j.compbiomed.2021.104500 |
work_keys_str_mv | AT quinteroyullis machinelearningmodelsforthepredictionoftheseirdvariablesforthecovid19pandemicbasedonadeepdependenceanalysisofvariables AT ardiladouglas machinelearningmodelsforthepredictionoftheseirdvariablesforthecovid19pandemicbasedonadeepdependenceanalysisofvariables AT camargoedgar machinelearningmodelsforthepredictionoftheseirdvariablesforthecovid19pandemicbasedonadeepdependenceanalysisofvariables AT rivasfrancklin machinelearningmodelsforthepredictionoftheseirdvariablesforthecovid19pandemicbasedonadeepdependenceanalysisofvariables AT aguilarjose machinelearningmodelsforthepredictionoftheseirdvariablesforthecovid19pandemicbasedonadeepdependenceanalysisofvariables |