Cargando…
Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting
In this paper, we propose a multi-scene adaptive crowd counting method based on meta-knowledge and multi-task learning. In practice, surveillance cameras are stationarily deployed in various scenes. Considering the extensibility of a surveillance system, the ideal crowd counting method should have a...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9104539/ https://www.ncbi.nlm.nih.gov/pubmed/35591010 http://dx.doi.org/10.3390/s22093320 |
_version_ | 1784707818766467072 |
---|---|
author | Tang, Siqi Pan, Zhisong Hu, Guyu Wu, Yang Li, Yunbo |
author_facet | Tang, Siqi Pan, Zhisong Hu, Guyu Wu, Yang Li, Yunbo |
author_sort | Tang, Siqi |
collection | PubMed |
description | In this paper, we propose a multi-scene adaptive crowd counting method based on meta-knowledge and multi-task learning. In practice, surveillance cameras are stationarily deployed in various scenes. Considering the extensibility of a surveillance system, the ideal crowd counting method should have a strong generalization capability to be deployed in unknown scenes. On the other hand, given the diversity of scenes, it should also effectively suit each scene for better performance. These two objectives are contradictory, so we propose a coarse-to-fine pipeline including meta-knowledge network and multi-task learning. Specifically, at the coarse-grained stage, we propose a generic two-stream network for all existing scenes to encode meta-knowledge especially inter-frame temporal knowledge. At the fine-grained stage, the regression of the crowd density map to the overall number of people in each scene is considered a homogeneous subtask in a multi-task framework. A robust multi-task learning algorithm is applied to effectively learn scene-specific regression parameters for existing and new scenes, which further improve the accuracy of each specific scenes. Taking advantage of multi-task learning, the proposed method can be deployed to multiple new scenes without duplicated model training. Compared with two representative methods, namely AMSNet and MAML-counting, the proposed method reduces the MAE by 10.29% and 13.48%, respectively. |
format | Online Article Text |
id | pubmed-9104539 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-91045392022-05-14 Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting Tang, Siqi Pan, Zhisong Hu, Guyu Wu, Yang Li, Yunbo Sensors (Basel) Article In this paper, we propose a multi-scene adaptive crowd counting method based on meta-knowledge and multi-task learning. In practice, surveillance cameras are stationarily deployed in various scenes. Considering the extensibility of a surveillance system, the ideal crowd counting method should have a strong generalization capability to be deployed in unknown scenes. On the other hand, given the diversity of scenes, it should also effectively suit each scene for better performance. These two objectives are contradictory, so we propose a coarse-to-fine pipeline including meta-knowledge network and multi-task learning. Specifically, at the coarse-grained stage, we propose a generic two-stream network for all existing scenes to encode meta-knowledge especially inter-frame temporal knowledge. At the fine-grained stage, the regression of the crowd density map to the overall number of people in each scene is considered a homogeneous subtask in a multi-task framework. A robust multi-task learning algorithm is applied to effectively learn scene-specific regression parameters for existing and new scenes, which further improve the accuracy of each specific scenes. Taking advantage of multi-task learning, the proposed method can be deployed to multiple new scenes without duplicated model training. Compared with two representative methods, namely AMSNet and MAML-counting, the proposed method reduces the MAE by 10.29% and 13.48%, respectively. MDPI 2022-04-26 /pmc/articles/PMC9104539/ /pubmed/35591010 http://dx.doi.org/10.3390/s22093320 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Tang, Siqi Pan, Zhisong Hu, Guyu Wu, Yang Li, Yunbo Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting |
title | Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting |
title_full | Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting |
title_fullStr | Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting |
title_full_unstemmed | Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting |
title_short | Meta-Knowledge and Multi-Task Learning-Based Multi-Scene Adaptive Crowd Counting |
title_sort | meta-knowledge and multi-task learning-based multi-scene adaptive crowd counting |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9104539/ https://www.ncbi.nlm.nih.gov/pubmed/35591010 http://dx.doi.org/10.3390/s22093320 |
work_keys_str_mv | AT tangsiqi metaknowledgeandmultitasklearningbasedmultisceneadaptivecrowdcounting AT panzhisong metaknowledgeandmultitasklearningbasedmultisceneadaptivecrowdcounting AT huguyu metaknowledgeandmultitasklearningbasedmultisceneadaptivecrowdcounting AT wuyang metaknowledgeandmultitasklearningbasedmultisceneadaptivecrowdcounting AT liyunbo metaknowledgeandmultitasklearningbasedmultisceneadaptivecrowdcounting |