Cargando…

Deep Multimodal Clustering with Cross Reconstruction

Recently, there has been surging interests in multimodal clustering. And extracting common features plays a critical role in these methods. However, since the ignorance of the fact that data in different modalities shares similar distributions in feature space, most works did not mining the inter-mo...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Xianchao, Tang, Xiaorui, Zong, Linlin, Liu, Xinyue, Mu, Jie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206244/
http://dx.doi.org/10.1007/978-3-030-47426-3_24
_version_ 1783530376416722944
author Zhang, Xianchao
Tang, Xiaorui
Zong, Linlin
Liu, Xinyue
Mu, Jie
author_facet Zhang, Xianchao
Tang, Xiaorui
Zong, Linlin
Liu, Xinyue
Mu, Jie
author_sort Zhang, Xianchao
collection PubMed
description Recently, there has been surging interests in multimodal clustering. And extracting common features plays a critical role in these methods. However, since the ignorance of the fact that data in different modalities shares similar distributions in feature space, most works did not mining the inter-modal distribution relationships completely, which eventually leads to unacceptable common features. To address this issue, we propose the deep multimodal clustering with cross reconstruction method, which firstly focuses on multimodal feature extraction in an unsupervised way and then clusters these extracted features. The proposed cross reconstruction aims to build latent connections among different modalities, which effectively reduces the distribution differences in feature space. The theoretical analysis shows that the cross reconstruction reduces the Wasserstein distance of multimodal feature distributions. Experimental results on six benchmark datasets demonstrate that our method achieves obviously improvement over several state-of-arts.
format Online
Article
Text
id pubmed-7206244
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-72062442020-05-08 Deep Multimodal Clustering with Cross Reconstruction Zhang, Xianchao Tang, Xiaorui Zong, Linlin Liu, Xinyue Mu, Jie Advances in Knowledge Discovery and Data Mining Article Recently, there has been surging interests in multimodal clustering. And extracting common features plays a critical role in these methods. However, since the ignorance of the fact that data in different modalities shares similar distributions in feature space, most works did not mining the inter-modal distribution relationships completely, which eventually leads to unacceptable common features. To address this issue, we propose the deep multimodal clustering with cross reconstruction method, which firstly focuses on multimodal feature extraction in an unsupervised way and then clusters these extracted features. The proposed cross reconstruction aims to build latent connections among different modalities, which effectively reduces the distribution differences in feature space. The theoretical analysis shows that the cross reconstruction reduces the Wasserstein distance of multimodal feature distributions. Experimental results on six benchmark datasets demonstrate that our method achieves obviously improvement over several state-of-arts. 2020-04-17 /pmc/articles/PMC7206244/ http://dx.doi.org/10.1007/978-3-030-47426-3_24 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Zhang, Xianchao
Tang, Xiaorui
Zong, Linlin
Liu, Xinyue
Mu, Jie
Deep Multimodal Clustering with Cross Reconstruction
title Deep Multimodal Clustering with Cross Reconstruction
title_full Deep Multimodal Clustering with Cross Reconstruction
title_fullStr Deep Multimodal Clustering with Cross Reconstruction
title_full_unstemmed Deep Multimodal Clustering with Cross Reconstruction
title_short Deep Multimodal Clustering with Cross Reconstruction
title_sort deep multimodal clustering with cross reconstruction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206244/
http://dx.doi.org/10.1007/978-3-030-47426-3_24
work_keys_str_mv AT zhangxianchao deepmultimodalclusteringwithcrossreconstruction
AT tangxiaorui deepmultimodalclusteringwithcrossreconstruction
AT zonglinlin deepmultimodalclusteringwithcrossreconstruction
AT liuxinyue deepmultimodalclusteringwithcrossreconstruction
AT mujie deepmultimodalclusteringwithcrossreconstruction