Cargando…

A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval

Recently, with the popularization of camera tools such as mobile phones and the rise of various short video platforms, a lot of videos are being uploaded to the Internet at all times, for which a video retrieval system with fast retrieval speed and high precision is very necessary. Therefore, conten...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chen, Hanqing, Hu, Chunyan, Lee, Feifei, Lin, Chaowei, Yao, Wei, Chen, Lu, Chen, Qiu
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8124307/ https://www.ncbi.nlm.nih.gov/pubmed/33946745 http://dx.doi.org/10.3390/s21093094

_version_	1783693165660733440
author	Chen, Hanqing Hu, Chunyan Lee, Feifei Lin, Chaowei Yao, Wei Chen, Lu Chen, Qiu
author_facet	Chen, Hanqing Hu, Chunyan Lee, Feifei Lin, Chaowei Yao, Wei Chen, Lu Chen, Qiu
author_sort	Chen, Hanqing
collection	PubMed
description	Recently, with the popularization of camera tools such as mobile phones and the rise of various short video platforms, a lot of videos are being uploaded to the Internet at all times, for which a video retrieval system with fast retrieval speed and high precision is very necessary. Therefore, content-based video retrieval (CBVR) has aroused the interest of many researchers. A typical CBVR system mainly contains the following two essential parts: video feature extraction and similarity comparison. Feature extraction of video is very challenging, previous video retrieval methods are mostly based on extracting features from single video frames, while resulting the loss of temporal information in the videos. Hashing methods are extensively used in multimedia information retrieval due to its retrieval efficiency, but most of them are currently only applied to image retrieval. In order to solve these problems in video retrieval, we build an end-to-end framework called deep supervised video hashing (DSVH), which employs a 3D convolutional neural network (CNN) to obtain spatial-temporal features of videos, then train a set of hash functions by supervised hashing to transfer the video features into binary space and get the compact binary codes of videos. Finally, we use triplet loss for network training. We conduct a lot of experiments on three public video datasets UCF-101, JHMDB and HMDB-51, and the results show that the proposed method has advantages over many state-of-the-art video retrieval methods. Compared with the DVH method, the mAP value of UCF-101 dataset is improved by 9.3%, and the minimum improvement on JHMDB dataset is also increased by 0.3%. At the same time, we also demonstrate the stability of the algorithm in the HMDB-51 dataset.
format	Online Article Text
id	pubmed-8124307
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-81243072021-05-17 A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval Chen, Hanqing Hu, Chunyan Lee, Feifei Lin, Chaowei Yao, Wei Chen, Lu Chen, Qiu Sensors (Basel) Article Recently, with the popularization of camera tools such as mobile phones and the rise of various short video platforms, a lot of videos are being uploaded to the Internet at all times, for which a video retrieval system with fast retrieval speed and high precision is very necessary. Therefore, content-based video retrieval (CBVR) has aroused the interest of many researchers. A typical CBVR system mainly contains the following two essential parts: video feature extraction and similarity comparison. Feature extraction of video is very challenging, previous video retrieval methods are mostly based on extracting features from single video frames, while resulting the loss of temporal information in the videos. Hashing methods are extensively used in multimedia information retrieval due to its retrieval efficiency, but most of them are currently only applied to image retrieval. In order to solve these problems in video retrieval, we build an end-to-end framework called deep supervised video hashing (DSVH), which employs a 3D convolutional neural network (CNN) to obtain spatial-temporal features of videos, then train a set of hash functions by supervised hashing to transfer the video features into binary space and get the compact binary codes of videos. Finally, we use triplet loss for network training. We conduct a lot of experiments on three public video datasets UCF-101, JHMDB and HMDB-51, and the results show that the proposed method has advantages over many state-of-the-art video retrieval methods. Compared with the DVH method, the mAP value of UCF-101 dataset is improved by 9.3%, and the minimum improvement on JHMDB dataset is also increased by 0.3%. At the same time, we also demonstrate the stability of the algorithm in the HMDB-51 dataset. MDPI 2021-04-29 /pmc/articles/PMC8124307/ /pubmed/33946745 http://dx.doi.org/10.3390/s21093094 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Chen, Hanqing Hu, Chunyan Lee, Feifei Lin, Chaowei Yao, Wei Chen, Lu Chen, Qiu A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval
title	A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval
title_full	A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval
title_fullStr	A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval
title_full_unstemmed	A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval
title_short	A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval
title_sort	supervised video hashing method based on a deep 3d convolutional neural network for large-scale video retrieval
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8124307/ https://www.ncbi.nlm.nih.gov/pubmed/33946745 http://dx.doi.org/10.3390/s21093094
work_keys_str_mv	AT chenhanqing asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT huchunyan asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT leefeifei asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT linchaowei asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT yaowei asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT chenlu asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT chenqiu asupervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT chenhanqing supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT huchunyan supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT leefeifei supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT linchaowei supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT yaowei supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT chenlu supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval AT chenqiu supervisedvideohashingmethodbasedonadeep3dconvolutionalneuralnetworkforlargescalevideoretrieval

A Supervised Video Hashing Method Based on a Deep 3D Convolutional Neural Network for Large-Scale Video Retrieval

Ejemplares similares