Cargando…
A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval
Recently, with the popularization of camera tools such as mobile phones and the rise of various short video platforms, a lot of videos are being uploaded to the Internet at all times, for which a video retrieval system with fast retrieval speed and high precision is very necessary. Therefore, conten...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8124307/ https://www.ncbi.nlm.nih.gov/pubmed/33946745 http://dx.doi.org/10.3390/s21093094 |
_version_ | 1783693165660733440 |
---|---|
author | Chen, Hanqing Hu, Chunyan Lee, Feifei Lin, Chaowei Yao, Wei Chen, Lu Chen, Qiu |
author_facet | Chen, Hanqing Hu, Chunyan Lee, Feifei Lin, Chaowei Yao, Wei Chen, Lu Chen, Qiu |
author_sort | Chen, Hanqing |
collection | PubMed |
description | Recently, with the popularization of camera tools such as mobile phones and the rise of various short video platforms, a lot of videos are being uploaded to the Internet at all times, for which a video retrieval system with fast retrieval speed and high precision is very necessary. Therefore, content-based video retrieval (CBVR) has aroused the interest of many researchers. A typical CBVR system mainly contains the following two essential parts: video feature extraction and similarity comparison. Feature extraction of video is very challenging, previous video retrieval methods are mostly based on extracting features from single video frames, while resulting the loss of temporal information in the videos. Hashing methods are extensively used in multimedia information retrieval due to its retrieval efficiency, but most of them are currently only applied to image retrieval. In order to solve these problems in video retrieval, we build an end-to-end framework called deep supervised video hashing (DSVH), which employs a 3D convolutional neural network (CNN) to obtain spatial-temporal features of videos, then train a set of hash functions by supervised hashing to transfer the video features into binary space and get the compact binary codes of videos. Finally, we use triplet loss for network training. We conduct a lot of experiments on three public video datasets UCF-101, JHMDB and HMDB-51, and the results show that the proposed method has advantages over many state-of-the-art video retrieval methods. Compared with the DVH method, the mAP value of UCF-101 dataset is improved by 9.3%, and the minimum improvement on JHMDB dataset is also increased by 0.3%. At the same time, we also demonstrate the stability of the algorithm in the HMDB-51 dataset. |
format | Online Article Text |
id | pubmed-8124307 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-81243072021-05-17 A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval Chen, Hanqing Hu, Chunyan Lee, Feifei Lin, Chaowei Yao, Wei Chen, Lu Chen, Qiu Sensors (Basel) Article Recently, with the popularization of camera tools such as mobile phones and the rise of various short video platforms, a lot of videos are being uploaded to the Internet at all times, for which a video retrieval system with fast retrieval speed and high precision is very necessary. Therefore, content-based video retrieval (CBVR) has aroused the interest of many researchers. A typical CBVR system mainly contains the following two essential parts: video feature extraction and similarity comparison. Feature extraction of video is very challenging, previous video retrieval methods are mostly based on extracting features from single video frames, while resulting the loss of temporal information in the videos. Hashing methods are extensively used in multimedia information retrieval due to its retrieval efficiency, but most of them are currently only applied to image retrieval. In order to solve these problems in video retrieval, we build an end-to-end framework called deep supervised video hashing (DSVH), which employs a 3D convolutional neural network (CNN) to obtain spatial-temporal features of videos, then train a set of hash functions by supervised hashing to transfer the video features into binary space and get the compact binary codes of videos. Finally, we use triplet loss for network training. We conduct a lot of experiments on three public video datasets UCF-101, JHMDB and HMDB-51, and the results show that the proposed method has advantages over many state-of-the-art video retrieval methods. Compared with the DVH method, the mAP value of UCF-101 dataset is improved by 9.3%, and the minimum improvement on JHMDB dataset is also increased by 0.3%. At the same time, we also demonstrate the stability of the algorithm in the HMDB-51 dataset. MDPI 2021-04-29 /pmc/articles/PMC8124307/ /pubmed/33946745 http://dx.doi.org/10.3390/s21093094 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Chen, Hanqing Hu, Chunyan Lee, Feifei Lin, Chaowei Yao, Wei Chen, Lu Chen, Qiu A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval |
title | A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval |
title_full | A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval |
title_fullStr | A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval |
title_full_unstemmed | A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval |
title_short | A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval |
title_sort | supervised video hashing method based on a deep 3d convolutional neural network for large-scale video retrieval |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8124307/ https://www.ncbi.nlm.nih.gov/pubmed/33946745 http://dx.doi.org/10.3390/s21093094 |
work_keys_str_mv | AT chenhanqing asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT huchunyan asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT leefeifei asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT linchaowei asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT yaowei asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT chenlu asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT chenqiu asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT chenhanqing supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT huchunyan supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT leefeifei supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT linchaowei supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT yaowei supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT chenlu supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT chenqiu supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval |