Cargando…
Crowd Counting with Semantic Scene Segmentation in Helicopter Footage
Continually improving crowd counting neural networks have been developed in recent years. The accuracy of these networks has reached such high levels that further improvement is becoming very difficult. However, this high accuracy lacks deeper semantic information, such as social roles (e.g., studen...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7506704/ https://www.ncbi.nlm.nih.gov/pubmed/32867289 http://dx.doi.org/10.3390/s20174855 |
_version_ | 1783585075098550272 |
---|---|
author | Csönde, Gergely Sekimoto, Yoshihide Kashiyama, Takehiro |
author_facet | Csönde, Gergely Sekimoto, Yoshihide Kashiyama, Takehiro |
author_sort | Csönde, Gergely |
collection | PubMed |
description | Continually improving crowd counting neural networks have been developed in recent years. The accuracy of these networks has reached such high levels that further improvement is becoming very difficult. However, this high accuracy lacks deeper semantic information, such as social roles (e.g., student, company worker, or police officer) or location-based roles (e.g., pedestrian, tenant, or construction worker). Some of these can be learned from the same set of features as the human nature of an entity, whereas others require wider contextual information from the human surroundings. The primary end-goal of developing recognition software is to involve them in autonomous decision-making systems. Therefore, it must be foolproof, which is, it must have good semantic understanding of the input. In this study, we focus on counting pedestrians in helicopter footage and introduce a dataset created from helicopter videos for this purpose. We use semantic segmentation to extract the required additional contextual information from the surroundings of an entity. We demonstrate that it is possible to increase the pedestrian counting accuracy in this manner. Furthermore, we show that crowd counting and semantic segmentation can be simultaneously achieved, with comparable or even improved accuracy, by using the same crowd counting neural network for both tasks through hard parameter sharing. The presented method is generic and it can be applied to arbitrary crowd density estimation methods. A link to the dataset is available at the end of the paper. |
format | Online Article Text |
id | pubmed-7506704 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-75067042020-09-26 Crowd Counting with Semantic Scene Segmentation in Helicopter Footage Csönde, Gergely Sekimoto, Yoshihide Kashiyama, Takehiro Sensors (Basel) Article Continually improving crowd counting neural networks have been developed in recent years. The accuracy of these networks has reached such high levels that further improvement is becoming very difficult. However, this high accuracy lacks deeper semantic information, such as social roles (e.g., student, company worker, or police officer) or location-based roles (e.g., pedestrian, tenant, or construction worker). Some of these can be learned from the same set of features as the human nature of an entity, whereas others require wider contextual information from the human surroundings. The primary end-goal of developing recognition software is to involve them in autonomous decision-making systems. Therefore, it must be foolproof, which is, it must have good semantic understanding of the input. In this study, we focus on counting pedestrians in helicopter footage and introduce a dataset created from helicopter videos for this purpose. We use semantic segmentation to extract the required additional contextual information from the surroundings of an entity. We demonstrate that it is possible to increase the pedestrian counting accuracy in this manner. Furthermore, we show that crowd counting and semantic segmentation can be simultaneously achieved, with comparable or even improved accuracy, by using the same crowd counting neural network for both tasks through hard parameter sharing. The presented method is generic and it can be applied to arbitrary crowd density estimation methods. A link to the dataset is available at the end of the paper. MDPI 2020-08-27 /pmc/articles/PMC7506704/ /pubmed/32867289 http://dx.doi.org/10.3390/s20174855 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Csönde, Gergely Sekimoto, Yoshihide Kashiyama, Takehiro Crowd Counting with Semantic Scene Segmentation in Helicopter Footage |
title | Crowd Counting with Semantic Scene Segmentation in Helicopter Footage |
title_full | Crowd Counting with Semantic Scene Segmentation in Helicopter Footage |
title_fullStr | Crowd Counting with Semantic Scene Segmentation in Helicopter Footage |
title_full_unstemmed | Crowd Counting with Semantic Scene Segmentation in Helicopter Footage |
title_short | Crowd Counting with Semantic Scene Segmentation in Helicopter Footage |
title_sort | crowd counting with semantic scene segmentation in helicopter footage |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7506704/ https://www.ncbi.nlm.nih.gov/pubmed/32867289 http://dx.doi.org/10.3390/s20174855 |
work_keys_str_mv | AT csondegergely crowdcountingwithsemanticscenesegmentationinhelicopterfootage AT sekimotoyoshihide crowdcountingwithsemanticscenesegmentationinhelicopterfootage AT kashiyamatakehiro crowdcountingwithsemanticscenesegmentationinhelicopterfootage |