Cargando…

Discrete Missing Data Imputation Using Multilayer Perceptron and Momentum Gradient Descent †

Data are a strategic resource for industrial production, and an efficient data-mining process will increase productivity. However, there exist many missing values in data collected in real life due to various problems. Because the missing data may reduce productivity, missing value imputation is an...

Descripción completa

Detalles Bibliográficos
Autores principales: Pan, Hu, Ye, Zhiwei, He, Qiyi, Yan, Chunyan, Yuan, Jianyu, Lai, Xudong, Su, Jun, Li, Ruihan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9371018/
https://www.ncbi.nlm.nih.gov/pubmed/35957197
http://dx.doi.org/10.3390/s22155645
_version_ 1784767002206797824
author Pan, Hu
Ye, Zhiwei
He, Qiyi
Yan, Chunyan
Yuan, Jianyu
Lai, Xudong
Su, Jun
Li, Ruihan
author_facet Pan, Hu
Ye, Zhiwei
He, Qiyi
Yan, Chunyan
Yuan, Jianyu
Lai, Xudong
Su, Jun
Li, Ruihan
author_sort Pan, Hu
collection PubMed
description Data are a strategic resource for industrial production, and an efficient data-mining process will increase productivity. However, there exist many missing values in data collected in real life due to various problems. Because the missing data may reduce productivity, missing value imputation is an important research topic in data mining. At present, most studies mainly focus on imputation methods for continuous missing data, while a few concentrate on discrete missing data. In this paper, a discrete missing value imputation method based on a multilayer perceptron (MLP) is proposed, which employs a momentum gradient descent algorithm, and some prefilling strategies are utilized to improve the convergence speed of the MLP. To verify the effectiveness of the method, experiments are conducted to compare the classification accuracy with eight common imputation methods, such as the mode, random, hot-deck, KNN, autoencoder, and MLP, under different missing mechanisms and missing proportions. Experimental results verify that the improved MLP model (IMLP) can effectively impute discrete missing values in most situations under three missing patterns.
format Online
Article
Text
id pubmed-9371018
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-93710182022-08-12 Discrete Missing Data Imputation Using Multilayer Perceptron and Momentum Gradient Descent † Pan, Hu Ye, Zhiwei He, Qiyi Yan, Chunyan Yuan, Jianyu Lai, Xudong Su, Jun Li, Ruihan Sensors (Basel) Article Data are a strategic resource for industrial production, and an efficient data-mining process will increase productivity. However, there exist many missing values in data collected in real life due to various problems. Because the missing data may reduce productivity, missing value imputation is an important research topic in data mining. At present, most studies mainly focus on imputation methods for continuous missing data, while a few concentrate on discrete missing data. In this paper, a discrete missing value imputation method based on a multilayer perceptron (MLP) is proposed, which employs a momentum gradient descent algorithm, and some prefilling strategies are utilized to improve the convergence speed of the MLP. To verify the effectiveness of the method, experiments are conducted to compare the classification accuracy with eight common imputation methods, such as the mode, random, hot-deck, KNN, autoencoder, and MLP, under different missing mechanisms and missing proportions. Experimental results verify that the improved MLP model (IMLP) can effectively impute discrete missing values in most situations under three missing patterns. MDPI 2022-07-28 /pmc/articles/PMC9371018/ /pubmed/35957197 http://dx.doi.org/10.3390/s22155645 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Pan, Hu
Ye, Zhiwei
He, Qiyi
Yan, Chunyan
Yuan, Jianyu
Lai, Xudong
Su, Jun
Li, Ruihan
Discrete Missing Data Imputation Using Multilayer Perceptron and Momentum Gradient Descent †
title Discrete Missing Data Imputation Using Multilayer Perceptron and Momentum Gradient Descent †
title_full Discrete Missing Data Imputation Using Multilayer Perceptron and Momentum Gradient Descent †
title_fullStr Discrete Missing Data Imputation Using Multilayer Perceptron and Momentum Gradient Descent †
title_full_unstemmed Discrete Missing Data Imputation Using Multilayer Perceptron and Momentum Gradient Descent †
title_short Discrete Missing Data Imputation Using Multilayer Perceptron and Momentum Gradient Descent †
title_sort discrete missing data imputation using multilayer perceptron and momentum gradient descent †
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9371018/
https://www.ncbi.nlm.nih.gov/pubmed/35957197
http://dx.doi.org/10.3390/s22155645
work_keys_str_mv AT panhu discretemissingdataimputationusingmultilayerperceptronandmomentumgradientdescent
AT yezhiwei discretemissingdataimputationusingmultilayerperceptronandmomentumgradientdescent
AT heqiyi discretemissingdataimputationusingmultilayerperceptronandmomentumgradientdescent
AT yanchunyan discretemissingdataimputationusingmultilayerperceptronandmomentumgradientdescent
AT yuanjianyu discretemissingdataimputationusingmultilayerperceptronandmomentumgradientdescent
AT laixudong discretemissingdataimputationusingmultilayerperceptronandmomentumgradientdescent
AT sujun discretemissingdataimputationusingmultilayerperceptronandmomentumgradientdescent
AT liruihan discretemissingdataimputationusingmultilayerperceptronandmomentumgradientdescent