Cargando…

The Optimally Designed Variational Autoencoder Networks for Clustering and Recovery of Incomplete Multimedia Data

Clustering analysis of massive data in wireless multimedia sensor networks (WMSN) has become a hot topic. However, most data clustering algorithms have difficulty in obtaining latent nonlinear correlations of data features, resulting in a low clustering accuracy. In addition, it is difficult to extr...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Xiulan, Li, Hongyu, Zhang, Zufan, Gan, Chenquan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6413117/
https://www.ncbi.nlm.nih.gov/pubmed/30781499
http://dx.doi.org/10.3390/s19040809
_version_ 1783402764329549824
author Yu, Xiulan
Li, Hongyu
Zhang, Zufan
Gan, Chenquan
author_facet Yu, Xiulan
Li, Hongyu
Zhang, Zufan
Gan, Chenquan
author_sort Yu, Xiulan
collection PubMed
description Clustering analysis of massive data in wireless multimedia sensor networks (WMSN) has become a hot topic. However, most data clustering algorithms have difficulty in obtaining latent nonlinear correlations of data features, resulting in a low clustering accuracy. In addition, it is difficult to extract features from missing or corrupted data, so incomplete data are widely used in practical work. In this paper, the optimally designed variational autoencoder networks is proposed for extracting features of incomplete data and using high-order fuzzy c-means algorithm (HOFCM) to improve cluster performance of incomplete data. Specifically, the feature extraction model is improved by using variational autoencoder to learn the feature of incomplete data. To capture nonlinear correlations in different heterogeneous data patterns, tensor based fuzzy c-means algorithm is used to cluster low-dimensional features. The tensor distance is used as the distance measure to capture the unknown correlations of data as much as possible. Finally, in the case that the clustering results are obtained, the missing data can be restored by using the low-dimensional features. Experiments on real datasets show that the proposed algorithm not only can improve the clustering performance of incomplete data effectively, but also can fill in missing features and get better data reconstruction results.
format Online
Article
Text
id pubmed-6413117
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-64131172019-04-03 The Optimally Designed Variational Autoencoder Networks for Clustering and Recovery of Incomplete Multimedia Data Yu, Xiulan Li, Hongyu Zhang, Zufan Gan, Chenquan Sensors (Basel) Article Clustering analysis of massive data in wireless multimedia sensor networks (WMSN) has become a hot topic. However, most data clustering algorithms have difficulty in obtaining latent nonlinear correlations of data features, resulting in a low clustering accuracy. In addition, it is difficult to extract features from missing or corrupted data, so incomplete data are widely used in practical work. In this paper, the optimally designed variational autoencoder networks is proposed for extracting features of incomplete data and using high-order fuzzy c-means algorithm (HOFCM) to improve cluster performance of incomplete data. Specifically, the feature extraction model is improved by using variational autoencoder to learn the feature of incomplete data. To capture nonlinear correlations in different heterogeneous data patterns, tensor based fuzzy c-means algorithm is used to cluster low-dimensional features. The tensor distance is used as the distance measure to capture the unknown correlations of data as much as possible. Finally, in the case that the clustering results are obtained, the missing data can be restored by using the low-dimensional features. Experiments on real datasets show that the proposed algorithm not only can improve the clustering performance of incomplete data effectively, but also can fill in missing features and get better data reconstruction results. MDPI 2019-02-16 /pmc/articles/PMC6413117/ /pubmed/30781499 http://dx.doi.org/10.3390/s19040809 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Yu, Xiulan
Li, Hongyu
Zhang, Zufan
Gan, Chenquan
The Optimally Designed Variational Autoencoder Networks for Clustering and Recovery of Incomplete Multimedia Data
title The Optimally Designed Variational Autoencoder Networks for Clustering and Recovery of Incomplete Multimedia Data
title_full The Optimally Designed Variational Autoencoder Networks for Clustering and Recovery of Incomplete Multimedia Data
title_fullStr The Optimally Designed Variational Autoencoder Networks for Clustering and Recovery of Incomplete Multimedia Data
title_full_unstemmed The Optimally Designed Variational Autoencoder Networks for Clustering and Recovery of Incomplete Multimedia Data
title_short The Optimally Designed Variational Autoencoder Networks for Clustering and Recovery of Incomplete Multimedia Data
title_sort optimally designed variational autoencoder networks for clustering and recovery of incomplete multimedia data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6413117/
https://www.ncbi.nlm.nih.gov/pubmed/30781499
http://dx.doi.org/10.3390/s19040809
work_keys_str_mv AT yuxiulan theoptimallydesignedvariationalautoencodernetworksforclusteringandrecoveryofincompletemultimediadata
AT lihongyu theoptimallydesignedvariationalautoencodernetworksforclusteringandrecoveryofincompletemultimediadata
AT zhangzufan theoptimallydesignedvariationalautoencodernetworksforclusteringandrecoveryofincompletemultimediadata
AT ganchenquan theoptimallydesignedvariationalautoencodernetworksforclusteringandrecoveryofincompletemultimediadata
AT yuxiulan optimallydesignedvariationalautoencodernetworksforclusteringandrecoveryofincompletemultimediadata
AT lihongyu optimallydesignedvariationalautoencodernetworksforclusteringandrecoveryofincompletemultimediadata
AT zhangzufan optimallydesignedvariationalautoencodernetworksforclusteringandrecoveryofincompletemultimediadata
AT ganchenquan optimallydesignedvariationalautoencodernetworksforclusteringandrecoveryofincompletemultimediadata