Cargando…

Uncertain imputation for time-series forecasting: Application to COVID-19 daily mortality prediction

The object of this study is to put forward uncertainty modeling associated with missing time series data imputation in a predictive context. We propose three imputation methods associated with uncertainty modeling. These methods are evaluated on a COVID-19 dataset out of which some values have been...

Descripción completa

Detalles Bibliográficos
Autores principales: Elimam, Rayane, Sutton-Charani, Nicolas, Perrey, Stéphane, Montmain, Jacky
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9931261/
https://www.ncbi.nlm.nih.gov/pubmed/36812528
http://dx.doi.org/10.1371/journal.pdig.0000115
_version_ 1784889210233159680
author Elimam, Rayane
Sutton-Charani, Nicolas
Perrey, Stéphane
Montmain, Jacky
author_facet Elimam, Rayane
Sutton-Charani, Nicolas
Perrey, Stéphane
Montmain, Jacky
author_sort Elimam, Rayane
collection PubMed
description The object of this study is to put forward uncertainty modeling associated with missing time series data imputation in a predictive context. We propose three imputation methods associated with uncertainty modeling. These methods are evaluated on a COVID-19 dataset out of which some values have been randomly removed. The dataset contains the numbers of daily COVID-19 confirmed diagnoses (“new cases”) and daily deaths (“new deaths”) recorded since the start of the pandemic up to July 2021. The considered task is to predict the number of new deaths 7 days in advance. The more values are missing, the higher the imputation impact is on the predictive performances. The Evidential K-Nearest Neighbors (EKNN) algorithm is used for its ability to take into account labels uncertainty. Experiments are provided to measure the benefits of the label uncertainty models. Results show the positive impact of uncertainty models on imputation performances, especially in a noisy context where the number of missing values is high.
format Online
Article
Text
id pubmed-9931261
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-99312612023-02-16 Uncertain imputation for time-series forecasting: Application to COVID-19 daily mortality prediction Elimam, Rayane Sutton-Charani, Nicolas Perrey, Stéphane Montmain, Jacky PLOS Digit Health Research Article The object of this study is to put forward uncertainty modeling associated with missing time series data imputation in a predictive context. We propose three imputation methods associated with uncertainty modeling. These methods are evaluated on a COVID-19 dataset out of which some values have been randomly removed. The dataset contains the numbers of daily COVID-19 confirmed diagnoses (“new cases”) and daily deaths (“new deaths”) recorded since the start of the pandemic up to July 2021. The considered task is to predict the number of new deaths 7 days in advance. The more values are missing, the higher the imputation impact is on the predictive performances. The Evidential K-Nearest Neighbors (EKNN) algorithm is used for its ability to take into account labels uncertainty. Experiments are provided to measure the benefits of the label uncertainty models. Results show the positive impact of uncertainty models on imputation performances, especially in a noisy context where the number of missing values is high. Public Library of Science 2022-10-25 /pmc/articles/PMC9931261/ /pubmed/36812528 http://dx.doi.org/10.1371/journal.pdig.0000115 Text en © 2022 Elimam et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Elimam, Rayane
Sutton-Charani, Nicolas
Perrey, Stéphane
Montmain, Jacky
Uncertain imputation for time-series forecasting: Application to COVID-19 daily mortality prediction
title Uncertain imputation for time-series forecasting: Application to COVID-19 daily mortality prediction
title_full Uncertain imputation for time-series forecasting: Application to COVID-19 daily mortality prediction
title_fullStr Uncertain imputation for time-series forecasting: Application to COVID-19 daily mortality prediction
title_full_unstemmed Uncertain imputation for time-series forecasting: Application to COVID-19 daily mortality prediction
title_short Uncertain imputation for time-series forecasting: Application to COVID-19 daily mortality prediction
title_sort uncertain imputation for time-series forecasting: application to covid-19 daily mortality prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9931261/
https://www.ncbi.nlm.nih.gov/pubmed/36812528
http://dx.doi.org/10.1371/journal.pdig.0000115
work_keys_str_mv AT elimamrayane uncertainimputationfortimeseriesforecastingapplicationtocovid19dailymortalityprediction
AT suttoncharaninicolas uncertainimputationfortimeseriesforecastingapplicationtocovid19dailymortalityprediction
AT perreystephane uncertainimputationfortimeseriesforecastingapplicationtocovid19dailymortalityprediction
AT montmainjacky uncertainimputationfortimeseriesforecastingapplicationtocovid19dailymortalityprediction