Cargando…

Occluded Pedestrian-Attribute Recognition for Video Sensors Using Group Sparsity

Pedestrians are often obstructed by other objects or people in real-world vision sensors. These obstacles make pedestrian-attribute recognition (PAR) difficult; hence, occlusion processing for visual sensing is a key issue in PAR. To address this problem, we first formulate the identification of non...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Geonu, Yun, Kimin, Cho, Jungchan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9460213/
https://www.ncbi.nlm.nih.gov/pubmed/36081084
http://dx.doi.org/10.3390/s22176626
_version_ 1784786691961126912
author Lee, Geonu
Yun, Kimin
Cho, Jungchan
author_facet Lee, Geonu
Yun, Kimin
Cho, Jungchan
author_sort Lee, Geonu
collection PubMed
description Pedestrians are often obstructed by other objects or people in real-world vision sensors. These obstacles make pedestrian-attribute recognition (PAR) difficult; hence, occlusion processing for visual sensing is a key issue in PAR. To address this problem, we first formulate the identification of non-occluded frames as temporal attention based on the sparsity of a crowded video. In other words, a model for PAR is guided to prevent paying attention to the occluded frame. However, we deduced that this approach cannot include a correlation between attributes when occlusion occurs. For example, “boots” and “shoe color” cannot be recognized simultaneously when the foot is invisible. To address the uncorrelated attention issue, we propose a novel temporal-attention module based on group sparsity. Group sparsity is applied across attention weights in correlated attributes. Accordingly, physically-adjacent pedestrian attributes are grouped, and the attention weights of a group are forced to focus on the same frames. Experimental results indicate that the proposed method achieved 1.18% and 6.21% higher [Formula: see text]-scores than the advanced baseline method on the occlusion samples in DukeMTMC-VideoReID and MARS video-based PAR datasets, respectively.
format Online
Article
Text
id pubmed-9460213
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-94602132022-09-10 Occluded Pedestrian-Attribute Recognition for Video Sensors Using Group Sparsity Lee, Geonu Yun, Kimin Cho, Jungchan Sensors (Basel) Article Pedestrians are often obstructed by other objects or people in real-world vision sensors. These obstacles make pedestrian-attribute recognition (PAR) difficult; hence, occlusion processing for visual sensing is a key issue in PAR. To address this problem, we first formulate the identification of non-occluded frames as temporal attention based on the sparsity of a crowded video. In other words, a model for PAR is guided to prevent paying attention to the occluded frame. However, we deduced that this approach cannot include a correlation between attributes when occlusion occurs. For example, “boots” and “shoe color” cannot be recognized simultaneously when the foot is invisible. To address the uncorrelated attention issue, we propose a novel temporal-attention module based on group sparsity. Group sparsity is applied across attention weights in correlated attributes. Accordingly, physically-adjacent pedestrian attributes are grouped, and the attention weights of a group are forced to focus on the same frames. Experimental results indicate that the proposed method achieved 1.18% and 6.21% higher [Formula: see text]-scores than the advanced baseline method on the occlusion samples in DukeMTMC-VideoReID and MARS video-based PAR datasets, respectively. MDPI 2022-09-01 /pmc/articles/PMC9460213/ /pubmed/36081084 http://dx.doi.org/10.3390/s22176626 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lee, Geonu
Yun, Kimin
Cho, Jungchan
Occluded Pedestrian-Attribute Recognition for Video Sensors Using Group Sparsity
title Occluded Pedestrian-Attribute Recognition for Video Sensors Using Group Sparsity
title_full Occluded Pedestrian-Attribute Recognition for Video Sensors Using Group Sparsity
title_fullStr Occluded Pedestrian-Attribute Recognition for Video Sensors Using Group Sparsity
title_full_unstemmed Occluded Pedestrian-Attribute Recognition for Video Sensors Using Group Sparsity
title_short Occluded Pedestrian-Attribute Recognition for Video Sensors Using Group Sparsity
title_sort occluded pedestrian-attribute recognition for video sensors using group sparsity
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9460213/
https://www.ncbi.nlm.nih.gov/pubmed/36081084
http://dx.doi.org/10.3390/s22176626
work_keys_str_mv AT leegeonu occludedpedestrianattributerecognitionforvideosensorsusinggroupsparsity
AT yunkimin occludedpedestrianattributerecognitionforvideosensorsusinggroupsparsity
AT chojungchan occludedpedestrianattributerecognitionforvideosensorsusinggroupsparsity