Cargando…

Abnormal behavior capture of video dynamic target based on 3D convolutional neural network

The use of computers to understand video content can accurately and quickly label various videos. Behavior recognition technology can help users filter the video by screening the content. However, this calculation mode, which is only sensitive to the features in a pixel neighborhood, cannot effectiv...

Descripción completa

Detalles Bibliográficos
Autor principal: Chen, Fei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9645454/
https://www.ncbi.nlm.nih.gov/pubmed/36386391
http://dx.doi.org/10.3389/fnbot.2022.1017748
_version_ 1784826967376265216
author Chen, Fei
author_facet Chen, Fei
author_sort Chen, Fei
collection PubMed
description The use of computers to understand video content can accurately and quickly label various videos. Behavior recognition technology can help users filter the video by screening the content. However, this calculation mode, which is only sensitive to the features in a pixel neighborhood, cannot effectively extract cross-frame long-range video features. In addition, the common long-range dependency capture methods are based on pixel pairs, which contain less semantic information and cannot accurately model dependencies. Based on this, this paper generates semantic units with rich semantic information in the form of neighborhood pixel aggregation and proposes a multi-semantic long-range dependency capture algorithm to solve this problem, which makes the established dependency relationship more accurate. At the same time, this paper proposes an early dependency transfer technology to speed up the reasoning speed of the multi-semantic long-range dependency capture algorithm. By embedding the proposed algorithm into the original convolutional neural network, and conducting sufficient performance tests and evaluations on different data sets, it is shown that the proposed algorithm outperforms other current algorithms in terms of recognition accuracy and achieves the optimal recognition effect, which can effectively enhance the long-range dependency capture ability and temporal modeling ability of the convolutional network, and improve the quality of video feature representation.
format Online
Article
Text
id pubmed-9645454
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-96454542022-11-15 Abnormal behavior capture of video dynamic target based on 3D convolutional neural network Chen, Fei Front Neurorobot Neuroscience The use of computers to understand video content can accurately and quickly label various videos. Behavior recognition technology can help users filter the video by screening the content. However, this calculation mode, which is only sensitive to the features in a pixel neighborhood, cannot effectively extract cross-frame long-range video features. In addition, the common long-range dependency capture methods are based on pixel pairs, which contain less semantic information and cannot accurately model dependencies. Based on this, this paper generates semantic units with rich semantic information in the form of neighborhood pixel aggregation and proposes a multi-semantic long-range dependency capture algorithm to solve this problem, which makes the established dependency relationship more accurate. At the same time, this paper proposes an early dependency transfer technology to speed up the reasoning speed of the multi-semantic long-range dependency capture algorithm. By embedding the proposed algorithm into the original convolutional neural network, and conducting sufficient performance tests and evaluations on different data sets, it is shown that the proposed algorithm outperforms other current algorithms in terms of recognition accuracy and achieves the optimal recognition effect, which can effectively enhance the long-range dependency capture ability and temporal modeling ability of the convolutional network, and improve the quality of video feature representation. Frontiers Media S.A. 2022-10-26 /pmc/articles/PMC9645454/ /pubmed/36386391 http://dx.doi.org/10.3389/fnbot.2022.1017748 Text en Copyright © 2022 Chen. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Chen, Fei
Abnormal behavior capture of video dynamic target based on 3D convolutional neural network
title Abnormal behavior capture of video dynamic target based on 3D convolutional neural network
title_full Abnormal behavior capture of video dynamic target based on 3D convolutional neural network
title_fullStr Abnormal behavior capture of video dynamic target based on 3D convolutional neural network
title_full_unstemmed Abnormal behavior capture of video dynamic target based on 3D convolutional neural network
title_short Abnormal behavior capture of video dynamic target based on 3D convolutional neural network
title_sort abnormal behavior capture of video dynamic target based on 3d convolutional neural network
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9645454/
https://www.ncbi.nlm.nih.gov/pubmed/36386391
http://dx.doi.org/10.3389/fnbot.2022.1017748
work_keys_str_mv AT chenfei abnormalbehaviorcaptureofvideodynamictargetbasedon3dconvolutionalneuralnetwork