Cargando…

Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting

In this paper, we propose a multi-scene adaptive crowd counting method based on meta-knowledge and multi-task learning. In practice, surveillance cameras are stationarily deployed in various scenes. Considering the extensibility of a surveillance system, the ideal crowd counting method should have a...

Descripción completa

Detalles Bibliográficos
Autores principales: Tang, Siqi, Pan, Zhisong, Hu, Guyu, Wu, Yang, Li, Yunbo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9104539/
https://www.ncbi.nlm.nih.gov/pubmed/35591010
http://dx.doi.org/10.3390/s22093320
_version_ 1784707818766467072
author Tang, Siqi
Pan, Zhisong
Hu, Guyu
Wu, Yang
Li, Yunbo
author_facet Tang, Siqi
Pan, Zhisong
Hu, Guyu
Wu, Yang
Li, Yunbo
author_sort Tang, Siqi
collection PubMed
description In this paper, we propose a multi-scene adaptive crowd counting method based on meta-knowledge and multi-task learning. In practice, surveillance cameras are stationarily deployed in various scenes. Considering the extensibility of a surveillance system, the ideal crowd counting method should have a strong generalization capability to be deployed in unknown scenes. On the other hand, given the diversity of scenes, it should also effectively suit each scene for better performance. These two objectives are contradictory, so we propose a coarse-to-fine pipeline including meta-knowledge network and multi-task learning. Specifically, at the coarse-grained stage, we propose a generic two-stream network for all existing scenes to encode meta-knowledge especially inter-frame temporal knowledge. At the fine-grained stage, the regression of the crowd density map to the overall number of people in each scene is considered a homogeneous subtask in a multi-task framework. A robust multi-task learning algorithm is applied to effectively learn scene-specific regression parameters for existing and new scenes, which further improve the accuracy of each specific scenes. Taking advantage of multi-task learning, the proposed method can be deployed to multiple new scenes without duplicated model training. Compared with two representative methods, namely AMSNet and MAML-counting, the proposed method reduces the MAE by 10.29% and 13.48%, respectively.
format Online
Article
Text
id pubmed-9104539
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-91045392022-05-14 Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting Tang, Siqi Pan, Zhisong Hu, Guyu Wu, Yang Li, Yunbo Sensors (Basel) Article In this paper, we propose a multi-scene adaptive crowd counting method based on meta-knowledge and multi-task learning. In practice, surveillance cameras are stationarily deployed in various scenes. Considering the extensibility of a surveillance system, the ideal crowd counting method should have a strong generalization capability to be deployed in unknown scenes. On the other hand, given the diversity of scenes, it should also effectively suit each scene for better performance. These two objectives are contradictory, so we propose a coarse-to-fine pipeline including meta-knowledge network and multi-task learning. Specifically, at the coarse-grained stage, we propose a generic two-stream network for all existing scenes to encode meta-knowledge especially inter-frame temporal knowledge. At the fine-grained stage, the regression of the crowd density map to the overall number of people in each scene is considered a homogeneous subtask in a multi-task framework. A robust multi-task learning algorithm is applied to effectively learn scene-specific regression parameters for existing and new scenes, which further improve the accuracy of each specific scenes. Taking advantage of multi-task learning, the proposed method can be deployed to multiple new scenes without duplicated model training. Compared with two representative methods, namely AMSNet and MAML-counting, the proposed method reduces the MAE by 10.29% and 13.48%, respectively. MDPI 2022-04-26 /pmc/articles/PMC9104539/ /pubmed/35591010 http://dx.doi.org/10.3390/s22093320 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Tang, Siqi
Pan, Zhisong
Hu, Guyu
Wu, Yang
Li, Yunbo
Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting
title Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting
title_full Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting
title_fullStr Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting
title_full_unstemmed Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting
title_short Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting
title_sort meta-knowledge and multi-task learning-based multi-scene adaptive crowd counting
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9104539/
https://www.ncbi.nlm.nih.gov/pubmed/35591010
http://dx.doi.org/10.3390/s22093320
work_keys_str_mv AT tangsiqi metaknowledgeandmultitasklearningbasedmultisceneadaptivecrowdcounting
AT panzhisong metaknowledgeandmultitasklearningbasedmultisceneadaptivecrowdcounting
AT huguyu metaknowledgeandmultitasklearningbasedmultisceneadaptivecrowdcounting
AT wuyang metaknowledgeandmultitasklearningbasedmultisceneadaptivecrowdcounting
AT liyunbo metaknowledgeandmultitasklearningbasedmultisceneadaptivecrowdcounting