Cargando…

Visual Feature Learning on Video Object and Human Action Detection: A Systematic Review

Video object and human action detection are applied in many fields, such as video surveillance, face recognition, etc. Video object detection includes object classification and object location within the frame. Human action recognition is the detection of human actions. Usually, video detection is m...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Dengshan, Wang, Rujing, Chen, Peng, Xie, Chengjun, Zhou, Qiong, Jia, Xiufang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8781209/
https://www.ncbi.nlm.nih.gov/pubmed/35056238
http://dx.doi.org/10.3390/mi13010072
_version_ 1784638035129794560
author Li, Dengshan
Wang, Rujing
Chen, Peng
Xie, Chengjun
Zhou, Qiong
Jia, Xiufang
author_facet Li, Dengshan
Wang, Rujing
Chen, Peng
Xie, Chengjun
Zhou, Qiong
Jia, Xiufang
author_sort Li, Dengshan
collection PubMed
description Video object and human action detection are applied in many fields, such as video surveillance, face recognition, etc. Video object detection includes object classification and object location within the frame. Human action recognition is the detection of human actions. Usually, video detection is more challenging than image detection, since video frames are often more blurry than images. Moreover, video detection often has other difficulties, such as video defocus, motion blur, part occlusion, etc. Nowadays, the video detection technology is able to implement real-time detection, or high-accurate detection of blurry video frames. In this paper, various video object and human action detection approaches are reviewed and discussed, many of them have performed state-of-the-art results. We mainly review and discuss the classic video detection methods with supervised learning. In addition, the frequently-used video object detection and human action recognition datasets are reviewed. Finally, a summarization of the video detection is represented, e.g., the video object and human action detection methods could be classified into frame-by-frame (frame-based) detection, extracting-key-frame detection and using-temporal-information detection; the methods of utilizing temporal information of adjacent video frames are mainly the optical flow method, Long Short-Term Memory and convolution among adjacent frames.
format Online
Article
Text
id pubmed-8781209
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-87812092022-01-22 Visual Feature Learning on Video Object and Human Action Detection: A Systematic Review Li, Dengshan Wang, Rujing Chen, Peng Xie, Chengjun Zhou, Qiong Jia, Xiufang Micromachines (Basel) Systematic Review Video object and human action detection are applied in many fields, such as video surveillance, face recognition, etc. Video object detection includes object classification and object location within the frame. Human action recognition is the detection of human actions. Usually, video detection is more challenging than image detection, since video frames are often more blurry than images. Moreover, video detection often has other difficulties, such as video defocus, motion blur, part occlusion, etc. Nowadays, the video detection technology is able to implement real-time detection, or high-accurate detection of blurry video frames. In this paper, various video object and human action detection approaches are reviewed and discussed, many of them have performed state-of-the-art results. We mainly review and discuss the classic video detection methods with supervised learning. In addition, the frequently-used video object detection and human action recognition datasets are reviewed. Finally, a summarization of the video detection is represented, e.g., the video object and human action detection methods could be classified into frame-by-frame (frame-based) detection, extracting-key-frame detection and using-temporal-information detection; the methods of utilizing temporal information of adjacent video frames are mainly the optical flow method, Long Short-Term Memory and convolution among adjacent frames. MDPI 2021-12-31 /pmc/articles/PMC8781209/ /pubmed/35056238 http://dx.doi.org/10.3390/mi13010072 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Systematic Review
Li, Dengshan
Wang, Rujing
Chen, Peng
Xie, Chengjun
Zhou, Qiong
Jia, Xiufang
Visual Feature Learning on Video Object and Human Action Detection: A Systematic Review
title Visual Feature Learning on Video Object and Human Action Detection: A Systematic Review
title_full Visual Feature Learning on Video Object and Human Action Detection: A Systematic Review
title_fullStr Visual Feature Learning on Video Object and Human Action Detection: A Systematic Review
title_full_unstemmed Visual Feature Learning on Video Object and Human Action Detection: A Systematic Review
title_short Visual Feature Learning on Video Object and Human Action Detection: A Systematic Review
title_sort visual feature learning on video object and human action detection: a systematic review
topic Systematic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8781209/
https://www.ncbi.nlm.nih.gov/pubmed/35056238
http://dx.doi.org/10.3390/mi13010072
work_keys_str_mv AT lidengshan visualfeaturelearningonvideoobjectandhumanactiondetectionasystematicreview
AT wangrujing visualfeaturelearningonvideoobjectandhumanactiondetectionasystematicreview
AT chenpeng visualfeaturelearningonvideoobjectandhumanactiondetectionasystematicreview
AT xiechengjun visualfeaturelearningonvideoobjectandhumanactiondetectionasystematicreview
AT zhouqiong visualfeaturelearningonvideoobjectandhumanactiondetectionasystematicreview
AT jiaxiufang visualfeaturelearningonvideoobjectandhumanactiondetectionasystematicreview