Cargando…

On the Relation between Prediction and Imputation Accuracy under Missing Covariates

Missing covariates in regression or classification problems can prohibit the direct use of advanced tools for further analysis. Recent research has realized an increasing trend towards the use of modern Machine-Learning algorithms for imputation. This originates from their capability of showing favo...

Descripción completa

Detalles Bibliográficos
Autores principales: Ramosaj, Burim, Tulowietzki, Justus, Pauly, Markus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8947649/
https://www.ncbi.nlm.nih.gov/pubmed/35327897
http://dx.doi.org/10.3390/e24030386
_version_ 1784674489864290304
author Ramosaj, Burim
Tulowietzki, Justus
Pauly, Markus
author_facet Ramosaj, Burim
Tulowietzki, Justus
Pauly, Markus
author_sort Ramosaj, Burim
collection PubMed
description Missing covariates in regression or classification problems can prohibit the direct use of advanced tools for further analysis. Recent research has realized an increasing trend towards the use of modern Machine-Learning algorithms for imputation. This originates from their capability of showing favorable prediction accuracy in different learning problems. In this work, we analyze through simulation the interaction between imputation accuracy and prediction accuracy in regression learning problems with missing covariates when Machine-Learning-based methods for both imputation and prediction are used. We see that even a slight decrease in imputation accuracy can seriously affect the prediction accuracy. In addition, we explore imputation performance when using statistical inference procedures in prediction settings, such as the coverage rates of (valid) prediction intervals. Our analysis is based on empirical datasets provided by the UCI Machine Learning repository and an extensive simulation study.
format Online
Article
Text
id pubmed-8947649
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-89476492022-03-25 On the Relation between Prediction and Imputation Accuracy under Missing Covariates Ramosaj, Burim Tulowietzki, Justus Pauly, Markus Entropy (Basel) Article Missing covariates in regression or classification problems can prohibit the direct use of advanced tools for further analysis. Recent research has realized an increasing trend towards the use of modern Machine-Learning algorithms for imputation. This originates from their capability of showing favorable prediction accuracy in different learning problems. In this work, we analyze through simulation the interaction between imputation accuracy and prediction accuracy in regression learning problems with missing covariates when Machine-Learning-based methods for both imputation and prediction are used. We see that even a slight decrease in imputation accuracy can seriously affect the prediction accuracy. In addition, we explore imputation performance when using statistical inference procedures in prediction settings, such as the coverage rates of (valid) prediction intervals. Our analysis is based on empirical datasets provided by the UCI Machine Learning repository and an extensive simulation study. MDPI 2022-03-09 /pmc/articles/PMC8947649/ /pubmed/35327897 http://dx.doi.org/10.3390/e24030386 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Ramosaj, Burim
Tulowietzki, Justus
Pauly, Markus
On the Relation between Prediction and Imputation Accuracy under Missing Covariates
title On the Relation between Prediction and Imputation Accuracy under Missing Covariates
title_full On the Relation between Prediction and Imputation Accuracy under Missing Covariates
title_fullStr On the Relation between Prediction and Imputation Accuracy under Missing Covariates
title_full_unstemmed On the Relation between Prediction and Imputation Accuracy under Missing Covariates
title_short On the Relation between Prediction and Imputation Accuracy under Missing Covariates
title_sort on the relation between prediction and imputation accuracy under missing covariates
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8947649/
https://www.ncbi.nlm.nih.gov/pubmed/35327897
http://dx.doi.org/10.3390/e24030386
work_keys_str_mv AT ramosajburim ontherelationbetweenpredictionandimputationaccuracyundermissingcovariates
AT tulowietzkijustus ontherelationbetweenpredictionandimputationaccuracyundermissingcovariates
AT paulymarkus ontherelationbetweenpredictionandimputationaccuracyundermissingcovariates