Cargando…
Deep Multimodal Clustering with Cross Reconstruction
Recently, there has been surging interests in multimodal clustering. And extracting common features plays a critical role in these methods. However, since the ignorance of the fact that data in different modalities shares similar distributions in feature space, most works did not mining the inter-mo...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206244/ http://dx.doi.org/10.1007/978-3-030-47426-3_24 |
_version_ | 1783530376416722944 |
---|---|
author | Zhang, Xianchao Tang, Xiaorui Zong, Linlin Liu, Xinyue Mu, Jie |
author_facet | Zhang, Xianchao Tang, Xiaorui Zong, Linlin Liu, Xinyue Mu, Jie |
author_sort | Zhang, Xianchao |
collection | PubMed |
description | Recently, there has been surging interests in multimodal clustering. And extracting common features plays a critical role in these methods. However, since the ignorance of the fact that data in different modalities shares similar distributions in feature space, most works did not mining the inter-modal distribution relationships completely, which eventually leads to unacceptable common features. To address this issue, we propose the deep multimodal clustering with cross reconstruction method, which firstly focuses on multimodal feature extraction in an unsupervised way and then clusters these extracted features. The proposed cross reconstruction aims to build latent connections among different modalities, which effectively reduces the distribution differences in feature space. The theoretical analysis shows that the cross reconstruction reduces the Wasserstein distance of multimodal feature distributions. Experimental results on six benchmark datasets demonstrate that our method achieves obviously improvement over several state-of-arts. |
format | Online Article Text |
id | pubmed-7206244 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-72062442020-05-08 Deep Multimodal Clustering with Cross Reconstruction Zhang, Xianchao Tang, Xiaorui Zong, Linlin Liu, Xinyue Mu, Jie Advances in Knowledge Discovery and Data Mining Article Recently, there has been surging interests in multimodal clustering. And extracting common features plays a critical role in these methods. However, since the ignorance of the fact that data in different modalities shares similar distributions in feature space, most works did not mining the inter-modal distribution relationships completely, which eventually leads to unacceptable common features. To address this issue, we propose the deep multimodal clustering with cross reconstruction method, which firstly focuses on multimodal feature extraction in an unsupervised way and then clusters these extracted features. The proposed cross reconstruction aims to build latent connections among different modalities, which effectively reduces the distribution differences in feature space. The theoretical analysis shows that the cross reconstruction reduces the Wasserstein distance of multimodal feature distributions. Experimental results on six benchmark datasets demonstrate that our method achieves obviously improvement over several state-of-arts. 2020-04-17 /pmc/articles/PMC7206244/ http://dx.doi.org/10.1007/978-3-030-47426-3_24 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Zhang, Xianchao Tang, Xiaorui Zong, Linlin Liu, Xinyue Mu, Jie Deep Multimodal Clustering with Cross Reconstruction |
title | Deep Multimodal Clustering with Cross Reconstruction |
title_full | Deep Multimodal Clustering with Cross Reconstruction |
title_fullStr | Deep Multimodal Clustering with Cross Reconstruction |
title_full_unstemmed | Deep Multimodal Clustering with Cross Reconstruction |
title_short | Deep Multimodal Clustering with Cross Reconstruction |
title_sort | deep multimodal clustering with cross reconstruction |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206244/ http://dx.doi.org/10.1007/978-3-030-47426-3_24 |
work_keys_str_mv | AT zhangxianchao deepmultimodalclusteringwithcrossreconstruction AT tangxiaorui deepmultimodalclusteringwithcrossreconstruction AT zonglinlin deepmultimodalclusteringwithcrossreconstruction AT liuxinyue deepmultimodalclusteringwithcrossreconstruction AT mujie deepmultimodalclusteringwithcrossreconstruction |