Cargando…

Zero-Shot Action Recognition with Three-Stream Graph Convolutional Networks †

Large datasets are often used to improve the accuracy of action recognition. However, very large datasets are problematic as, for example, the annotation of large datasets is labor-intensive. This has encouraged research in zero-shot action recognition (ZSAR). Presently, most ZSAR methods recognize...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wu, Nan, Kawamoto, Kazuhiko
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8198984/ https://www.ncbi.nlm.nih.gov/pubmed/34070872 http://dx.doi.org/10.3390/s21113793

_version_	1783707269025759232
author	Wu, Nan Kawamoto, Kazuhiko
author_facet	Wu, Nan Kawamoto, Kazuhiko
author_sort	Wu, Nan
collection	PubMed
description	Large datasets are often used to improve the accuracy of action recognition. However, very large datasets are problematic as, for example, the annotation of large datasets is labor-intensive. This has encouraged research in zero-shot action recognition (ZSAR). Presently, most ZSAR methods recognize actions according to each video frame. These methods are affected by light, camera angle, and background, and most methods are unable to process time series data. The accuracy of the model is reduced owing to these reasons. In this paper, in order to solve these problems, we propose a three-stream graph convolutional network that processes both types of data. Our model has two parts. One part can process RGB data, which contains extensive useful information. The other part can process skeleton data, which is not affected by light and background. By combining these two outputs with a weighted sum, our model predicts the final results for ZSAR. Experiments conducted on three datasets demonstrate that our model has greater accuracy than a baseline model. Moreover, we also prove that our model can learn from human experience, which can make the model more accurate.
format	Online Article Text
id	pubmed-8198984
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-81989842021-06-14 Zero-Shot Action Recognition with Three-Stream Graph Convolutional Networks † Wu, Nan Kawamoto, Kazuhiko Sensors (Basel) Article Large datasets are often used to improve the accuracy of action recognition. However, very large datasets are problematic as, for example, the annotation of large datasets is labor-intensive. This has encouraged research in zero-shot action recognition (ZSAR). Presently, most ZSAR methods recognize actions according to each video frame. These methods are affected by light, camera angle, and background, and most methods are unable to process time series data. The accuracy of the model is reduced owing to these reasons. In this paper, in order to solve these problems, we propose a three-stream graph convolutional network that processes both types of data. Our model has two parts. One part can process RGB data, which contains extensive useful information. The other part can process skeleton data, which is not affected by light and background. By combining these two outputs with a weighted sum, our model predicts the final results for ZSAR. Experiments conducted on three datasets demonstrate that our model has greater accuracy than a baseline model. Moreover, we also prove that our model can learn from human experience, which can make the model more accurate. MDPI 2021-05-30 /pmc/articles/PMC8198984/ /pubmed/34070872 http://dx.doi.org/10.3390/s21113793 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Wu, Nan Kawamoto, Kazuhiko Zero-Shot Action Recognition with Three-Stream Graph Convolutional Networks †
title	Zero-Shot Action Recognition with Three-Stream Graph Convolutional Networks †
title_full	Zero-Shot Action Recognition with Three-Stream Graph Convolutional Networks †
title_fullStr	Zero-Shot Action Recognition with Three-Stream Graph Convolutional Networks †
title_full_unstemmed	Zero-Shot Action Recognition with Three-Stream Graph Convolutional Networks †
title_short	Zero-Shot Action Recognition with Three-Stream Graph Convolutional Networks †
title_sort	zero-shot action recognition with three-stream graph convolutional networks †
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8198984/ https://www.ncbi.nlm.nih.gov/pubmed/34070872 http://dx.doi.org/10.3390/s21113793
work_keys_str_mv	AT wunan zeroshotactionrecognitionwiththreestreamgraphconvolutionalnetworks AT kawamotokazuhiko zeroshotactionrecognitionwiththreestreamgraphconvolutionalnetworks

Zero-Shot Action Recognition with Three-Stream Graph Convolutional Networks †

Ejemplares similares