Cargando…

Contextual Action Cues from Camera Sensor for Multi-Stream Action Recognition

In action recognition research, two primary types of information are appearance and motion information that is learned from RGB images through visual sensors. However, depending on the action characteristics, contextual information, such as the existence of specific objects or globally-shared inform...

Descripción completa

Detalles Bibliográficos
Autores principales: Hong, Jongkwang, Cho, Bora, Hong, Yong Won, Byun, Hyeran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6471330/
https://www.ncbi.nlm.nih.gov/pubmed/30897792
http://dx.doi.org/10.3390/s19061382
_version_ 1783412004101292032
author Hong, Jongkwang
Cho, Bora
Hong, Yong Won
Byun, Hyeran
author_facet Hong, Jongkwang
Cho, Bora
Hong, Yong Won
Byun, Hyeran
author_sort Hong, Jongkwang
collection PubMed
description In action recognition research, two primary types of information are appearance and motion information that is learned from RGB images through visual sensors. However, depending on the action characteristics, contextual information, such as the existence of specific objects or globally-shared information in the image, becomes vital information to define the action. For example, the existence of the ball is vital information distinguishing “kicking” from “running”. Furthermore, some actions share typical global abstract poses, which can be used as a key to classify actions. Based on these observations, we propose the multi-stream network model, which incorporates spatial, temporal, and contextual cues in the image for action recognition. We experimented on the proposed method using C3D or inflated 3D ConvNet (I3D) as a backbone network, regarding two different action recognition datasets. As a result, we observed overall improvement in accuracy, demonstrating the effectiveness of our proposed method.
format Online
Article
Text
id pubmed-6471330
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-64713302019-04-26 Contextual Action Cues from Camera Sensor for Multi-Stream Action Recognition Hong, Jongkwang Cho, Bora Hong, Yong Won Byun, Hyeran Sensors (Basel) Article In action recognition research, two primary types of information are appearance and motion information that is learned from RGB images through visual sensors. However, depending on the action characteristics, contextual information, such as the existence of specific objects or globally-shared information in the image, becomes vital information to define the action. For example, the existence of the ball is vital information distinguishing “kicking” from “running”. Furthermore, some actions share typical global abstract poses, which can be used as a key to classify actions. Based on these observations, we propose the multi-stream network model, which incorporates spatial, temporal, and contextual cues in the image for action recognition. We experimented on the proposed method using C3D or inflated 3D ConvNet (I3D) as a backbone network, regarding two different action recognition datasets. As a result, we observed overall improvement in accuracy, demonstrating the effectiveness of our proposed method. MDPI 2019-03-20 /pmc/articles/PMC6471330/ /pubmed/30897792 http://dx.doi.org/10.3390/s19061382 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Hong, Jongkwang
Cho, Bora
Hong, Yong Won
Byun, Hyeran
Contextual Action Cues from Camera Sensor for Multi-Stream Action Recognition
title Contextual Action Cues from Camera Sensor for Multi-Stream Action Recognition
title_full Contextual Action Cues from Camera Sensor for Multi-Stream Action Recognition
title_fullStr Contextual Action Cues from Camera Sensor for Multi-Stream Action Recognition
title_full_unstemmed Contextual Action Cues from Camera Sensor for Multi-Stream Action Recognition
title_short Contextual Action Cues from Camera Sensor for Multi-Stream Action Recognition
title_sort contextual action cues from camera sensor for multi-stream action recognition
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6471330/
https://www.ncbi.nlm.nih.gov/pubmed/30897792
http://dx.doi.org/10.3390/s19061382
work_keys_str_mv AT hongjongkwang contextualactioncuesfromcamerasensorformultistreamactionrecognition
AT chobora contextualactioncuesfromcamerasensorformultistreamactionrecognition
AT hongyongwon contextualactioncuesfromcamerasensorformultistreamactionrecognition
AT byunhyeran contextualactioncuesfromcamerasensorformultistreamactionrecognition