Cargando…
Discrete Missing Data Imputation Using Multilayer Perceptron and Momentum Gradient Descent †
Data are a strategic resource for industrial production, and an efficient data-mining process will increase productivity. However, there exist many missing values in data collected in real life due to various problems. Because the missing data may reduce productivity, missing value imputation is an...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9371018/ https://www.ncbi.nlm.nih.gov/pubmed/35957197 http://dx.doi.org/10.3390/s22155645 |
_version_ | 1784767002206797824 |
---|---|
author | Pan, Hu Ye, Zhiwei He, Qiyi Yan, Chunyan Yuan, Jianyu Lai, Xudong Su, Jun Li, Ruihan |
author_facet | Pan, Hu Ye, Zhiwei He, Qiyi Yan, Chunyan Yuan, Jianyu Lai, Xudong Su, Jun Li, Ruihan |
author_sort | Pan, Hu |
collection | PubMed |
description | Data are a strategic resource for industrial production, and an efficient data-mining process will increase productivity. However, there exist many missing values in data collected in real life due to various problems. Because the missing data may reduce productivity, missing value imputation is an important research topic in data mining. At present, most studies mainly focus on imputation methods for continuous missing data, while a few concentrate on discrete missing data. In this paper, a discrete missing value imputation method based on a multilayer perceptron (MLP) is proposed, which employs a momentum gradient descent algorithm, and some prefilling strategies are utilized to improve the convergence speed of the MLP. To verify the effectiveness of the method, experiments are conducted to compare the classification accuracy with eight common imputation methods, such as the mode, random, hot-deck, KNN, autoencoder, and MLP, under different missing mechanisms and missing proportions. Experimental results verify that the improved MLP model (IMLP) can effectively impute discrete missing values in most situations under three missing patterns. |
format | Online Article Text |
id | pubmed-9371018 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-93710182022-08-12 Discrete Missing Data Imputation Using Multilayer Perceptron and Momentum Gradient Descent † Pan, Hu Ye, Zhiwei He, Qiyi Yan, Chunyan Yuan, Jianyu Lai, Xudong Su, Jun Li, Ruihan Sensors (Basel) Article Data are a strategic resource for industrial production, and an efficient data-mining process will increase productivity. However, there exist many missing values in data collected in real life due to various problems. Because the missing data may reduce productivity, missing value imputation is an important research topic in data mining. At present, most studies mainly focus on imputation methods for continuous missing data, while a few concentrate on discrete missing data. In this paper, a discrete missing value imputation method based on a multilayer perceptron (MLP) is proposed, which employs a momentum gradient descent algorithm, and some prefilling strategies are utilized to improve the convergence speed of the MLP. To verify the effectiveness of the method, experiments are conducted to compare the classification accuracy with eight common imputation methods, such as the mode, random, hot-deck, KNN, autoencoder, and MLP, under different missing mechanisms and missing proportions. Experimental results verify that the improved MLP model (IMLP) can effectively impute discrete missing values in most situations under three missing patterns. MDPI 2022-07-28 /pmc/articles/PMC9371018/ /pubmed/35957197 http://dx.doi.org/10.3390/s22155645 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Pan, Hu Ye, Zhiwei He, Qiyi Yan, Chunyan Yuan, Jianyu Lai, Xudong Su, Jun Li, Ruihan Discrete Missing Data Imputation Using Multilayer Perceptron and Momentum Gradient Descent † |
title | Discrete Missing Data Imputation Using Multilayer Perceptron and Momentum Gradient Descent † |
title_full | Discrete Missing Data Imputation Using Multilayer Perceptron and Momentum Gradient Descent † |
title_fullStr | Discrete Missing Data Imputation Using Multilayer Perceptron and Momentum Gradient Descent † |
title_full_unstemmed | Discrete Missing Data Imputation Using Multilayer Perceptron and Momentum Gradient Descent † |
title_short | Discrete Missing Data Imputation Using Multilayer Perceptron and Momentum Gradient Descent † |
title_sort | discrete missing data imputation using multilayer perceptron and momentum gradient descent † |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9371018/ https://www.ncbi.nlm.nih.gov/pubmed/35957197 http://dx.doi.org/10.3390/s22155645 |
work_keys_str_mv | AT panhu discretemissingdataimputationusingmultilayerperceptronandmomentumgradientdescent AT yezhiwei discretemissingdataimputationusingmultilayerperceptronandmomentumgradientdescent AT heqiyi discretemissingdataimputationusingmultilayerperceptronandmomentumgradientdescent AT yanchunyan discretemissingdataimputationusingmultilayerperceptronandmomentumgradientdescent AT yuanjianyu discretemissingdataimputationusingmultilayerperceptronandmomentumgradientdescent AT laixudong discretemissingdataimputationusingmultilayerperceptronandmomentumgradientdescent AT sujun discretemissingdataimputationusingmultilayerperceptronandmomentumgradientdescent AT liruihan discretemissingdataimputationusingmultilayerperceptronandmomentumgradientdescent |