Cargando…
On the Relation between Prediction and Imputation Accuracy under Missing Covariates
Missing covariates in regression or classification problems can prohibit the direct use of advanced tools for further analysis. Recent research has realized an increasing trend towards the use of modern Machine-Learning algorithms for imputation. This originates from their capability of showing favo...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8947649/ https://www.ncbi.nlm.nih.gov/pubmed/35327897 http://dx.doi.org/10.3390/e24030386 |
_version_ | 1784674489864290304 |
---|---|
author | Ramosaj, Burim Tulowietzki, Justus Pauly, Markus |
author_facet | Ramosaj, Burim Tulowietzki, Justus Pauly, Markus |
author_sort | Ramosaj, Burim |
collection | PubMed |
description | Missing covariates in regression or classification problems can prohibit the direct use of advanced tools for further analysis. Recent research has realized an increasing trend towards the use of modern Machine-Learning algorithms for imputation. This originates from their capability of showing favorable prediction accuracy in different learning problems. In this work, we analyze through simulation the interaction between imputation accuracy and prediction accuracy in regression learning problems with missing covariates when Machine-Learning-based methods for both imputation and prediction are used. We see that even a slight decrease in imputation accuracy can seriously affect the prediction accuracy. In addition, we explore imputation performance when using statistical inference procedures in prediction settings, such as the coverage rates of (valid) prediction intervals. Our analysis is based on empirical datasets provided by the UCI Machine Learning repository and an extensive simulation study. |
format | Online Article Text |
id | pubmed-8947649 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-89476492022-03-25 On the Relation between Prediction and Imputation Accuracy under Missing Covariates Ramosaj, Burim Tulowietzki, Justus Pauly, Markus Entropy (Basel) Article Missing covariates in regression or classification problems can prohibit the direct use of advanced tools for further analysis. Recent research has realized an increasing trend towards the use of modern Machine-Learning algorithms for imputation. This originates from their capability of showing favorable prediction accuracy in different learning problems. In this work, we analyze through simulation the interaction between imputation accuracy and prediction accuracy in regression learning problems with missing covariates when Machine-Learning-based methods for both imputation and prediction are used. We see that even a slight decrease in imputation accuracy can seriously affect the prediction accuracy. In addition, we explore imputation performance when using statistical inference procedures in prediction settings, such as the coverage rates of (valid) prediction intervals. Our analysis is based on empirical datasets provided by the UCI Machine Learning repository and an extensive simulation study. MDPI 2022-03-09 /pmc/articles/PMC8947649/ /pubmed/35327897 http://dx.doi.org/10.3390/e24030386 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Ramosaj, Burim Tulowietzki, Justus Pauly, Markus On the Relation between Prediction and Imputation Accuracy under Missing Covariates |
title | On the Relation between Prediction and Imputation Accuracy under Missing Covariates |
title_full | On the Relation between Prediction and Imputation Accuracy under Missing Covariates |
title_fullStr | On the Relation between Prediction and Imputation Accuracy under Missing Covariates |
title_full_unstemmed | On the Relation between Prediction and Imputation Accuracy under Missing Covariates |
title_short | On the Relation between Prediction and Imputation Accuracy under Missing Covariates |
title_sort | on the relation between prediction and imputation accuracy under missing covariates |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8947649/ https://www.ncbi.nlm.nih.gov/pubmed/35327897 http://dx.doi.org/10.3390/e24030386 |
work_keys_str_mv | AT ramosajburim ontherelationbetweenpredictionandimputationaccuracyundermissingcovariates AT tulowietzkijustus ontherelationbetweenpredictionandimputationaccuracyundermissingcovariates AT paulymarkus ontherelationbetweenpredictionandimputationaccuracyundermissingcovariates |