Cargando…

Hierarchical semantic interaction-based deep hashing network for cross-modal retrieval

Due to the high efficiency of hashing technology and the high abstraction of deep networks, deep hashing has achieved appealing effectiveness and efficiency for large-scale cross-modal retrieval. However, how to efficiently measure the similarity of fine-grained multi-labels for multi-modal data and...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Shubai, Wu, Song, Wang, Li
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8176532/
https://www.ncbi.nlm.nih.gov/pubmed/34141884
http://dx.doi.org/10.7717/peerj-cs.552
_version_ 1783703274213343232
author Chen, Shubai
Wu, Song
Wang, Li
author_facet Chen, Shubai
Wu, Song
Wang, Li
author_sort Chen, Shubai
collection PubMed
description Due to the high efficiency of hashing technology and the high abstraction of deep networks, deep hashing has achieved appealing effectiveness and efficiency for large-scale cross-modal retrieval. However, how to efficiently measure the similarity of fine-grained multi-labels for multi-modal data and thoroughly explore the intermediate layers specific information of networks are still two challenges for high-performance cross-modal hashing retrieval. Thus, in this paper, we propose a novel Hierarchical Semantic Interaction-based Deep Hashing Network (HSIDHN) for large-scale cross-modal retrieval. In the proposed HSIDHN, the multi-scale and fusion operations are first applied to each layer of the network. A Bidirectional Bi-linear Interaction (BBI) policy is then designed to achieve the hierarchical semantic interaction among different layers, such that the capability of hash representations can be enhanced. Moreover, a dual-similarity measurement (“hard” similarity and “soft” similarity) is designed to calculate the semantic similarity of different modality data, aiming to better preserve the semantic correlation of multi-labels. Extensive experiment results on two large-scale public datasets have shown that the performance of our HSIDHN is competitive to state-of-the-art deep cross-modal hashing methods.
format Online
Article
Text
id pubmed-8176532
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-81765322021-06-16 Hierarchical semantic interaction-based deep hashing network for cross-modal retrieval Chen, Shubai Wu, Song Wang, Li PeerJ Comput Sci Artificial Intelligence Due to the high efficiency of hashing technology and the high abstraction of deep networks, deep hashing has achieved appealing effectiveness and efficiency for large-scale cross-modal retrieval. However, how to efficiently measure the similarity of fine-grained multi-labels for multi-modal data and thoroughly explore the intermediate layers specific information of networks are still two challenges for high-performance cross-modal hashing retrieval. Thus, in this paper, we propose a novel Hierarchical Semantic Interaction-based Deep Hashing Network (HSIDHN) for large-scale cross-modal retrieval. In the proposed HSIDHN, the multi-scale and fusion operations are first applied to each layer of the network. A Bidirectional Bi-linear Interaction (BBI) policy is then designed to achieve the hierarchical semantic interaction among different layers, such that the capability of hash representations can be enhanced. Moreover, a dual-similarity measurement (“hard” similarity and “soft” similarity) is designed to calculate the semantic similarity of different modality data, aiming to better preserve the semantic correlation of multi-labels. Extensive experiment results on two large-scale public datasets have shown that the performance of our HSIDHN is competitive to state-of-the-art deep cross-modal hashing methods. PeerJ Inc. 2021-05-25 /pmc/articles/PMC8176532/ /pubmed/34141884 http://dx.doi.org/10.7717/peerj-cs.552 Text en © 2021 Chen et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Artificial Intelligence
Chen, Shubai
Wu, Song
Wang, Li
Hierarchical semantic interaction-based deep hashing network for cross-modal retrieval
title Hierarchical semantic interaction-based deep hashing network for cross-modal retrieval
title_full Hierarchical semantic interaction-based deep hashing network for cross-modal retrieval
title_fullStr Hierarchical semantic interaction-based deep hashing network for cross-modal retrieval
title_full_unstemmed Hierarchical semantic interaction-based deep hashing network for cross-modal retrieval
title_short Hierarchical semantic interaction-based deep hashing network for cross-modal retrieval
title_sort hierarchical semantic interaction-based deep hashing network for cross-modal retrieval
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8176532/
https://www.ncbi.nlm.nih.gov/pubmed/34141884
http://dx.doi.org/10.7717/peerj-cs.552
work_keys_str_mv AT chenshubai hierarchicalsemanticinteractionbaseddeephashingnetworkforcrossmodalretrieval
AT wusong hierarchicalsemanticinteractionbaseddeephashingnetworkforcrossmodalretrieval
AT wangli hierarchicalsemanticinteractionbaseddeephashingnetworkforcrossmodalretrieval