Cargando…
Hierarchical semantic interaction-based deep hashing network for cross-modal retrieval
Due to the high efficiency of hashing technology and the high abstraction of deep networks, deep hashing has achieved appealing effectiveness and efficiency for large-scale cross-modal retrieval. However, how to efficiently measure the similarity of fine-grained multi-labels for multi-modal data and...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8176532/ https://www.ncbi.nlm.nih.gov/pubmed/34141884 http://dx.doi.org/10.7717/peerj-cs.552 |
_version_ | 1783703274213343232 |
---|---|
author | Chen, Shubai Wu, Song Wang, Li |
author_facet | Chen, Shubai Wu, Song Wang, Li |
author_sort | Chen, Shubai |
collection | PubMed |
description | Due to the high efficiency of hashing technology and the high abstraction of deep networks, deep hashing has achieved appealing effectiveness and efficiency for large-scale cross-modal retrieval. However, how to efficiently measure the similarity of fine-grained multi-labels for multi-modal data and thoroughly explore the intermediate layers specific information of networks are still two challenges for high-performance cross-modal hashing retrieval. Thus, in this paper, we propose a novel Hierarchical Semantic Interaction-based Deep Hashing Network (HSIDHN) for large-scale cross-modal retrieval. In the proposed HSIDHN, the multi-scale and fusion operations are first applied to each layer of the network. A Bidirectional Bi-linear Interaction (BBI) policy is then designed to achieve the hierarchical semantic interaction among different layers, such that the capability of hash representations can be enhanced. Moreover, a dual-similarity measurement (“hard” similarity and “soft” similarity) is designed to calculate the semantic similarity of different modality data, aiming to better preserve the semantic correlation of multi-labels. Extensive experiment results on two large-scale public datasets have shown that the performance of our HSIDHN is competitive to state-of-the-art deep cross-modal hashing methods. |
format | Online Article Text |
id | pubmed-8176532 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-81765322021-06-16 Hierarchical semantic interaction-based deep hashing network for cross-modal retrieval Chen, Shubai Wu, Song Wang, Li PeerJ Comput Sci Artificial Intelligence Due to the high efficiency of hashing technology and the high abstraction of deep networks, deep hashing has achieved appealing effectiveness and efficiency for large-scale cross-modal retrieval. However, how to efficiently measure the similarity of fine-grained multi-labels for multi-modal data and thoroughly explore the intermediate layers specific information of networks are still two challenges for high-performance cross-modal hashing retrieval. Thus, in this paper, we propose a novel Hierarchical Semantic Interaction-based Deep Hashing Network (HSIDHN) for large-scale cross-modal retrieval. In the proposed HSIDHN, the multi-scale and fusion operations are first applied to each layer of the network. A Bidirectional Bi-linear Interaction (BBI) policy is then designed to achieve the hierarchical semantic interaction among different layers, such that the capability of hash representations can be enhanced. Moreover, a dual-similarity measurement (“hard” similarity and “soft” similarity) is designed to calculate the semantic similarity of different modality data, aiming to better preserve the semantic correlation of multi-labels. Extensive experiment results on two large-scale public datasets have shown that the performance of our HSIDHN is competitive to state-of-the-art deep cross-modal hashing methods. PeerJ Inc. 2021-05-25 /pmc/articles/PMC8176532/ /pubmed/34141884 http://dx.doi.org/10.7717/peerj-cs.552 Text en © 2021 Chen et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Artificial Intelligence Chen, Shubai Wu, Song Wang, Li Hierarchical semantic interaction-based deep hashing network for cross-modal retrieval |
title | Hierarchical semantic interaction-based deep hashing network for cross-modal retrieval |
title_full | Hierarchical semantic interaction-based deep hashing network for cross-modal retrieval |
title_fullStr | Hierarchical semantic interaction-based deep hashing network for cross-modal retrieval |
title_full_unstemmed | Hierarchical semantic interaction-based deep hashing network for cross-modal retrieval |
title_short | Hierarchical semantic interaction-based deep hashing network for cross-modal retrieval |
title_sort | hierarchical semantic interaction-based deep hashing network for cross-modal retrieval |
topic | Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8176532/ https://www.ncbi.nlm.nih.gov/pubmed/34141884 http://dx.doi.org/10.7717/peerj-cs.552 |
work_keys_str_mv | AT chenshubai hierarchicalsemanticinteractionbaseddeephashingnetworkforcrossmodalretrieval AT wusong hierarchicalsemanticinteractionbaseddeephashingnetworkforcrossmodalretrieval AT wangli hierarchicalsemanticinteractionbaseddeephashingnetworkforcrossmodalretrieval |