Cargando…

Crowd Counting with Semantic Scene Segmentation in Helicopter Footage

Continually improving crowd counting neural networks have been developed in recent years. The accuracy of these networks has reached such high levels that further improvement is becoming very difficult. However, this high accuracy lacks deeper semantic information, such as social roles (e.g., studen...

Descripción completa

Detalles Bibliográficos
Autores principales: Csönde, Gergely, Sekimoto, Yoshihide, Kashiyama, Takehiro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7506704/
https://www.ncbi.nlm.nih.gov/pubmed/32867289
http://dx.doi.org/10.3390/s20174855
_version_ 1783585075098550272
author Csönde, Gergely
Sekimoto, Yoshihide
Kashiyama, Takehiro
author_facet Csönde, Gergely
Sekimoto, Yoshihide
Kashiyama, Takehiro
author_sort Csönde, Gergely
collection PubMed
description Continually improving crowd counting neural networks have been developed in recent years. The accuracy of these networks has reached such high levels that further improvement is becoming very difficult. However, this high accuracy lacks deeper semantic information, such as social roles (e.g., student, company worker, or police officer) or location-based roles (e.g., pedestrian, tenant, or construction worker). Some of these can be learned from the same set of features as the human nature of an entity, whereas others require wider contextual information from the human surroundings. The primary end-goal of developing recognition software is to involve them in autonomous decision-making systems. Therefore, it must be foolproof, which is, it must have good semantic understanding of the input. In this study, we focus on counting pedestrians in helicopter footage and introduce a dataset created from helicopter videos for this purpose. We use semantic segmentation to extract the required additional contextual information from the surroundings of an entity. We demonstrate that it is possible to increase the pedestrian counting accuracy in this manner. Furthermore, we show that crowd counting and semantic segmentation can be simultaneously achieved, with comparable or even improved accuracy, by using the same crowd counting neural network for both tasks through hard parameter sharing. The presented method is generic and it can be applied to arbitrary crowd density estimation methods. A link to the dataset is available at the end of the paper.
format Online
Article
Text
id pubmed-7506704
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75067042020-09-26 Crowd Counting with Semantic Scene Segmentation in Helicopter Footage Csönde, Gergely Sekimoto, Yoshihide Kashiyama, Takehiro Sensors (Basel) Article Continually improving crowd counting neural networks have been developed in recent years. The accuracy of these networks has reached such high levels that further improvement is becoming very difficult. However, this high accuracy lacks deeper semantic information, such as social roles (e.g., student, company worker, or police officer) or location-based roles (e.g., pedestrian, tenant, or construction worker). Some of these can be learned from the same set of features as the human nature of an entity, whereas others require wider contextual information from the human surroundings. The primary end-goal of developing recognition software is to involve them in autonomous decision-making systems. Therefore, it must be foolproof, which is, it must have good semantic understanding of the input. In this study, we focus on counting pedestrians in helicopter footage and introduce a dataset created from helicopter videos for this purpose. We use semantic segmentation to extract the required additional contextual information from the surroundings of an entity. We demonstrate that it is possible to increase the pedestrian counting accuracy in this manner. Furthermore, we show that crowd counting and semantic segmentation can be simultaneously achieved, with comparable or even improved accuracy, by using the same crowd counting neural network for both tasks through hard parameter sharing. The presented method is generic and it can be applied to arbitrary crowd density estimation methods. A link to the dataset is available at the end of the paper. MDPI 2020-08-27 /pmc/articles/PMC7506704/ /pubmed/32867289 http://dx.doi.org/10.3390/s20174855 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Csönde, Gergely
Sekimoto, Yoshihide
Kashiyama, Takehiro
Crowd Counting with Semantic Scene Segmentation in Helicopter Footage
title Crowd Counting with Semantic Scene Segmentation in Helicopter Footage
title_full Crowd Counting with Semantic Scene Segmentation in Helicopter Footage
title_fullStr Crowd Counting with Semantic Scene Segmentation in Helicopter Footage
title_full_unstemmed Crowd Counting with Semantic Scene Segmentation in Helicopter Footage
title_short Crowd Counting with Semantic Scene Segmentation in Helicopter Footage
title_sort crowd counting with semantic scene segmentation in helicopter footage
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7506704/
https://www.ncbi.nlm.nih.gov/pubmed/32867289
http://dx.doi.org/10.3390/s20174855
work_keys_str_mv AT csondegergely crowdcountingwithsemanticscenesegmentationinhelicopterfootage
AT sekimotoyoshihide crowdcountingwithsemanticscenesegmentationinhelicopterfootage
AT kashiyamatakehiro crowdcountingwithsemanticscenesegmentationinhelicopterfootage