Cargando…

Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting

In this paper, we propose a multi-scene adaptive crowd counting method based on meta-knowledge and multi-task learning. In practice, surveillance cameras are stationarily deployed in various scenes. Considering the extensibility of a surveillance system, the ideal crowd counting method should have a...

Descripción completa

Detalles Bibliográficos
Autores principales:	Tang, Siqi, Pan, Zhisong, Hu, Guyu, Wu, Yang, Li, Yunbo
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9104539/ https://www.ncbi.nlm.nih.gov/pubmed/35591010 http://dx.doi.org/10.3390/s22093320

_version_	1784707818766467072
author	Tang, Siqi Pan, Zhisong Hu, Guyu Wu, Yang Li, Yunbo
author_facet	Tang, Siqi Pan, Zhisong Hu, Guyu Wu, Yang Li, Yunbo
author_sort	Tang, Siqi
collection	PubMed
description	In this paper, we propose a multi-scene adaptive crowd counting method based on meta-knowledge and multi-task learning. In practice, surveillance cameras are stationarily deployed in various scenes. Considering the extensibility of a surveillance system, the ideal crowd counting method should have a strong generalization capability to be deployed in unknown scenes. On the other hand, given the diversity of scenes, it should also effectively suit each scene for better performance. These two objectives are contradictory, so we propose a coarse-to-fine pipeline including meta-knowledge network and multi-task learning. Specifically, at the coarse-grained stage, we propose a generic two-stream network for all existing scenes to encode meta-knowledge especially inter-frame temporal knowledge. At the fine-grained stage, the regression of the crowd density map to the overall number of people in each scene is considered a homogeneous subtask in a multi-task framework. A robust multi-task learning algorithm is applied to effectively learn scene-specific regression parameters for existing and new scenes, which further improve the accuracy of each specific scenes. Taking advantage of multi-task learning, the proposed method can be deployed to multiple new scenes without duplicated model training. Compared with two representative methods, namely AMSNet and MAML-counting, the proposed method reduces the MAE by 10.29% and 13.48%, respectively.
format	Online Article Text
id	pubmed-9104539
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-91045392022-05-14 Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting Tang, Siqi Pan, Zhisong Hu, Guyu Wu, Yang Li, Yunbo Sensors (Basel) Article In this paper, we propose a multi-scene adaptive crowd counting method based on meta-knowledge and multi-task learning. In practice, surveillance cameras are stationarily deployed in various scenes. Considering the extensibility of a surveillance system, the ideal crowd counting method should have a strong generalization capability to be deployed in unknown scenes. On the other hand, given the diversity of scenes, it should also effectively suit each scene for better performance. These two objectives are contradictory, so we propose a coarse-to-fine pipeline including meta-knowledge network and multi-task learning. Specifically, at the coarse-grained stage, we propose a generic two-stream network for all existing scenes to encode meta-knowledge especially inter-frame temporal knowledge. At the fine-grained stage, the regression of the crowd density map to the overall number of people in each scene is considered a homogeneous subtask in a multi-task framework. A robust multi-task learning algorithm is applied to effectively learn scene-specific regression parameters for existing and new scenes, which further improve the accuracy of each specific scenes. Taking advantage of multi-task learning, the proposed method can be deployed to multiple new scenes without duplicated model training. Compared with two representative methods, namely AMSNet and MAML-counting, the proposed method reduces the MAE by 10.29% and 13.48%, respectively. MDPI 2022-04-26 /pmc/articles/PMC9104539/ /pubmed/35591010 http://dx.doi.org/10.3390/s22093320 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Tang, Siqi Pan, Zhisong Hu, Guyu Wu, Yang Li, Yunbo Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting
title	Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting
title_full	Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting
title_fullStr	Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting
title_full_unstemmed	Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting
title_short	Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting
title_sort	meta-knowledge and multi-task learning-based multi-scene adaptive crowd counting
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9104539/ https://www.ncbi.nlm.nih.gov/pubmed/35591010 http://dx.doi.org/10.3390/s22093320
work_keys_str_mv	AT tangsiqi metaknowledgeandmultitasklearningbasedmultisceneadaptivecrowdcounting AT panzhisong metaknowledgeandmultitasklearningbasedmultisceneadaptivecrowdcounting AT huguyu metaknowledgeandmultitasklearningbasedmultisceneadaptivecrowdcounting AT wuyang metaknowledgeandmultitasklearningbasedmultisceneadaptivecrowdcounting AT liyunbo metaknowledgeandmultitasklearningbasedmultisceneadaptivecrowdcounting

Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting

Ejemplares similares