Cargando…

Missing Value Imputation Method for Multiclass Matrix Data Based on Closed Itemset

Handling missing values in matrix data is an important step in data analysis. To date, many methods to estimate missing values based on data pattern similarity have been proposed. Most previously proposed methods perform missing value imputation based on data trends over the entire feature space. Ho...

Descripción completa

Detalles Bibliográficos
Autores principales: Tada, Mayu, Suzuki, Natsumi, Okada, Yoshifumi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8870971/
https://www.ncbi.nlm.nih.gov/pubmed/35205580
http://dx.doi.org/10.3390/e24020286
_version_ 1784656884494499840
author Tada, Mayu
Suzuki, Natsumi
Okada, Yoshifumi
author_facet Tada, Mayu
Suzuki, Natsumi
Okada, Yoshifumi
author_sort Tada, Mayu
collection PubMed
description Handling missing values in matrix data is an important step in data analysis. To date, many methods to estimate missing values based on data pattern similarity have been proposed. Most previously proposed methods perform missing value imputation based on data trends over the entire feature space. However, individual missing values are likely to show similarity to data patterns in local feature space. In addition, most existing methods focus on single class data, while multiclass analysis is frequently required in various fields. Missing value imputation for multiclass data must consider the characteristics of each class. In this paper, we propose two methods based on closed itemsets, CIimpute and ICIimpute, to achieve missing value imputation using local feature space for multiclass matrix data. CIimpute estimates missing values using closed itemsets extracted from each class. ICIimpute is an improved method of CIimpute in which an attribute reduction process is introduced. Experimental results demonstrate that attribute reduction considerably reduces computational time and improves imputation accuracy. Furthermore, it is shown that, compared to existing methods, ICIimpute provides superior imputation accuracy but requires more computational time.
format Online
Article
Text
id pubmed-8870971
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-88709712022-02-25 Missing Value Imputation Method for Multiclass Matrix Data Based on Closed Itemset Tada, Mayu Suzuki, Natsumi Okada, Yoshifumi Entropy (Basel) Article Handling missing values in matrix data is an important step in data analysis. To date, many methods to estimate missing values based on data pattern similarity have been proposed. Most previously proposed methods perform missing value imputation based on data trends over the entire feature space. However, individual missing values are likely to show similarity to data patterns in local feature space. In addition, most existing methods focus on single class data, while multiclass analysis is frequently required in various fields. Missing value imputation for multiclass data must consider the characteristics of each class. In this paper, we propose two methods based on closed itemsets, CIimpute and ICIimpute, to achieve missing value imputation using local feature space for multiclass matrix data. CIimpute estimates missing values using closed itemsets extracted from each class. ICIimpute is an improved method of CIimpute in which an attribute reduction process is introduced. Experimental results demonstrate that attribute reduction considerably reduces computational time and improves imputation accuracy. Furthermore, it is shown that, compared to existing methods, ICIimpute provides superior imputation accuracy but requires more computational time. MDPI 2022-02-16 /pmc/articles/PMC8870971/ /pubmed/35205580 http://dx.doi.org/10.3390/e24020286 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Tada, Mayu
Suzuki, Natsumi
Okada, Yoshifumi
Missing Value Imputation Method for Multiclass Matrix Data Based on Closed Itemset
title Missing Value Imputation Method for Multiclass Matrix Data Based on Closed Itemset
title_full Missing Value Imputation Method for Multiclass Matrix Data Based on Closed Itemset
title_fullStr Missing Value Imputation Method for Multiclass Matrix Data Based on Closed Itemset
title_full_unstemmed Missing Value Imputation Method for Multiclass Matrix Data Based on Closed Itemset
title_short Missing Value Imputation Method for Multiclass Matrix Data Based on Closed Itemset
title_sort missing value imputation method for multiclass matrix data based on closed itemset
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8870971/
https://www.ncbi.nlm.nih.gov/pubmed/35205580
http://dx.doi.org/10.3390/e24020286
work_keys_str_mv AT tadamayu missingvalueimputationmethodformulticlassmatrixdatabasedoncloseditemset
AT suzukinatsumi missingvalueimputationmethodformulticlassmatrixdatabasedoncloseditemset
AT okadayoshifumi missingvalueimputationmethodformulticlassmatrixdatabasedoncloseditemset