Cargando…

A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval

Recently, with the popularization of camera tools such as mobile phones and the rise of various short video platforms, a lot of videos are being uploaded to the Internet at all times, for which a video retrieval system with fast retrieval speed and high precision is very necessary. Therefore, conten...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Hanqing, Hu, Chunyan, Lee, Feifei, Lin, Chaowei, Yao, Wei, Chen, Lu, Chen, Qiu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8124307/
https://www.ncbi.nlm.nih.gov/pubmed/33946745
http://dx.doi.org/10.3390/s21093094
_version_ 1783693165660733440
author Chen, Hanqing
Hu, Chunyan
Lee, Feifei
Lin, Chaowei
Yao, Wei
Chen, Lu
Chen, Qiu
author_facet Chen, Hanqing
Hu, Chunyan
Lee, Feifei
Lin, Chaowei
Yao, Wei
Chen, Lu
Chen, Qiu
author_sort Chen, Hanqing
collection PubMed
description Recently, with the popularization of camera tools such as mobile phones and the rise of various short video platforms, a lot of videos are being uploaded to the Internet at all times, for which a video retrieval system with fast retrieval speed and high precision is very necessary. Therefore, content-based video retrieval (CBVR) has aroused the interest of many researchers. A typical CBVR system mainly contains the following two essential parts: video feature extraction and similarity comparison. Feature extraction of video is very challenging, previous video retrieval methods are mostly based on extracting features from single video frames, while resulting the loss of temporal information in the videos. Hashing methods are extensively used in multimedia information retrieval due to its retrieval efficiency, but most of them are currently only applied to image retrieval. In order to solve these problems in video retrieval, we build an end-to-end framework called deep supervised video hashing (DSVH), which employs a 3D convolutional neural network (CNN) to obtain spatial-temporal features of videos, then train a set of hash functions by supervised hashing to transfer the video features into binary space and get the compact binary codes of videos. Finally, we use triplet loss for network training. We conduct a lot of experiments on three public video datasets UCF-101, JHMDB and HMDB-51, and the results show that the proposed method has advantages over many state-of-the-art video retrieval methods. Compared with the DVH method, the mAP value of UCF-101 dataset is improved by 9.3%, and the minimum improvement on JHMDB dataset is also increased by 0.3%. At the same time, we also demonstrate the stability of the algorithm in the HMDB-51 dataset.
format Online
Article
Text
id pubmed-8124307
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-81243072021-05-17 A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval Chen, Hanqing Hu, Chunyan Lee, Feifei Lin, Chaowei Yao, Wei Chen, Lu Chen, Qiu Sensors (Basel) Article Recently, with the popularization of camera tools such as mobile phones and the rise of various short video platforms, a lot of videos are being uploaded to the Internet at all times, for which a video retrieval system with fast retrieval speed and high precision is very necessary. Therefore, content-based video retrieval (CBVR) has aroused the interest of many researchers. A typical CBVR system mainly contains the following two essential parts: video feature extraction and similarity comparison. Feature extraction of video is very challenging, previous video retrieval methods are mostly based on extracting features from single video frames, while resulting the loss of temporal information in the videos. Hashing methods are extensively used in multimedia information retrieval due to its retrieval efficiency, but most of them are currently only applied to image retrieval. In order to solve these problems in video retrieval, we build an end-to-end framework called deep supervised video hashing (DSVH), which employs a 3D convolutional neural network (CNN) to obtain spatial-temporal features of videos, then train a set of hash functions by supervised hashing to transfer the video features into binary space and get the compact binary codes of videos. Finally, we use triplet loss for network training. We conduct a lot of experiments on three public video datasets UCF-101, JHMDB and HMDB-51, and the results show that the proposed method has advantages over many state-of-the-art video retrieval methods. Compared with the DVH method, the mAP value of UCF-101 dataset is improved by 9.3%, and the minimum improvement on JHMDB dataset is also increased by 0.3%. At the same time, we also demonstrate the stability of the algorithm in the HMDB-51 dataset. MDPI 2021-04-29 /pmc/articles/PMC8124307/ /pubmed/33946745 http://dx.doi.org/10.3390/s21093094 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Chen, Hanqing
Hu, Chunyan
Lee, Feifei
Lin, Chaowei
Yao, Wei
Chen, Lu
Chen, Qiu
A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval
title A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval
title_full A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval
title_fullStr A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval
title_full_unstemmed A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval
title_short A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval
title_sort supervised video hashing method based on a deep 3d convolutional neural network for large-scale video retrieval
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8124307/
https://www.ncbi.nlm.nih.gov/pubmed/33946745
http://dx.doi.org/10.3390/s21093094
work_keys_str_mv AT chenhanqing asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval
AT huchunyan asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval
AT leefeifei asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval
AT linchaowei asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval
AT yaowei asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval
AT chenlu asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval
AT chenqiu asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval
AT chenhanqing supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval
AT huchunyan supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval
AT leefeifei supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval
AT linchaowei supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval
AT yaowei supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval
AT chenlu supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval
AT chenqiu supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval