Cargando…

Counting Crowds with Perspective Distortion Correction via Adaptive Learning

The goal of crowd counting is to estimate the number of people in the image. Presently, use regression to count people number became a mainstream method. It is worth noting that, with the development of convolutional neural networks (CNN), methods that are based on CNN have become a research hotspot...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sun, Yixuan, Jin, Jian, Wu, Xingjiao, Ma, Tianlong, Yang, Jing
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7374275/ https://www.ncbi.nlm.nih.gov/pubmed/32640552 http://dx.doi.org/10.3390/s20133781

_version_	1783561661455532032
author	Sun, Yixuan Jin, Jian Wu, Xingjiao Ma, Tianlong Yang, Jing
author_facet	Sun, Yixuan Jin, Jian Wu, Xingjiao Ma, Tianlong Yang, Jing
author_sort	Sun, Yixuan
collection	PubMed
description	The goal of crowd counting is to estimate the number of people in the image. Presently, use regression to count people number became a mainstream method. It is worth noting that, with the development of convolutional neural networks (CNN), methods that are based on CNN have become a research hotspot. It is a more interesting topic that how to locate the site of the person in the image than simply predicting the number of people in the image. The perspective transformation present is still a challenge, because perspective distortion will cause differences in the size of the crowd in the image. To devote perspective distortion and locate the site of the person more accuracy, we design a novel framework named Adaptive Learning Network (CAL). We use the VGG as the backbone. After each pooling layer is output, we collect the 1/2, 1/4, 1/8, and 1/16 features of the original image and combine them with the weights learned by an adaptive learning branch. The object of our adaptive learning branch is each image in the datasets. By combining the output features of different sizes of each image, the challenge of drastic changes in the size of the image crowd due to perspective transformation is reduced. We conducted experiments on four population counting data sets (i.e., ShanghaiTech Part A, ShanghaiTech Part B, UCF_CC_50 and UCF-QNRF), and the results show that our model has a good performance.
format	Online Article Text
id	pubmed-7374275
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-73742752020-08-05 Counting Crowds with Perspective Distortion Correction via Adaptive Learning Sun, Yixuan Jin, Jian Wu, Xingjiao Ma, Tianlong Yang, Jing Sensors (Basel) Article The goal of crowd counting is to estimate the number of people in the image. Presently, use regression to count people number became a mainstream method. It is worth noting that, with the development of convolutional neural networks (CNN), methods that are based on CNN have become a research hotspot. It is a more interesting topic that how to locate the site of the person in the image than simply predicting the number of people in the image. The perspective transformation present is still a challenge, because perspective distortion will cause differences in the size of the crowd in the image. To devote perspective distortion and locate the site of the person more accuracy, we design a novel framework named Adaptive Learning Network (CAL). We use the VGG as the backbone. After each pooling layer is output, we collect the 1/2, 1/4, 1/8, and 1/16 features of the original image and combine them with the weights learned by an adaptive learning branch. The object of our adaptive learning branch is each image in the datasets. By combining the output features of different sizes of each image, the challenge of drastic changes in the size of the image crowd due to perspective transformation is reduced. We conducted experiments on four population counting data sets (i.e., ShanghaiTech Part A, ShanghaiTech Part B, UCF_CC_50 and UCF-QNRF), and the results show that our model has a good performance. MDPI 2020-07-06 /pmc/articles/PMC7374275/ /pubmed/32640552 http://dx.doi.org/10.3390/s20133781 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Sun, Yixuan Jin, Jian Wu, Xingjiao Ma, Tianlong Yang, Jing Counting Crowds with Perspective Distortion Correction via Adaptive Learning
title	Counting Crowds with Perspective Distortion Correction via Adaptive Learning
title_full	Counting Crowds with Perspective Distortion Correction via Adaptive Learning
title_fullStr	Counting Crowds with Perspective Distortion Correction via Adaptive Learning
title_full_unstemmed	Counting Crowds with Perspective Distortion Correction via Adaptive Learning
title_short	Counting Crowds with Perspective Distortion Correction via Adaptive Learning
title_sort	counting crowds with perspective distortion correction via adaptive learning
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7374275/ https://www.ncbi.nlm.nih.gov/pubmed/32640552 http://dx.doi.org/10.3390/s20133781
work_keys_str_mv	AT sunyixuan countingcrowdswithperspectivedistortioncorrectionviaadaptivelearning AT jinjian countingcrowdswithperspectivedistortioncorrectionviaadaptivelearning AT wuxingjiao countingcrowdswithperspectivedistortioncorrectionviaadaptivelearning AT matianlong countingcrowdswithperspectivedistortioncorrectionviaadaptivelearning AT yangjing countingcrowdswithperspectivedistortioncorrectionviaadaptivelearning

Counting Crowds with Perspective Distortion Correction via Adaptive Learning

Ejemplares similares