Cargando…

Pose Mask: A Model-Based Augmentation Method for 2D Pose Estimation in Classroom Scenes Using Surveillance Images

Solid developments have been seen in deep-learning-based pose estimation, but few works have explored performance in dense crowds, such as a classroom scene; furthermore, no specific knowledge is considered in the design of image augmentation for pose estimation. A masked autoencoder was shown to ha...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Shichang, Ma, Miao, Li, Haiyang, Ning, Hanyang, Wang, Min
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9655343/
https://www.ncbi.nlm.nih.gov/pubmed/36366027
http://dx.doi.org/10.3390/s22218331
_version_ 1784829162365648896
author Liu, Shichang
Ma, Miao
Li, Haiyang
Ning, Hanyang
Wang, Min
author_facet Liu, Shichang
Ma, Miao
Li, Haiyang
Ning, Hanyang
Wang, Min
author_sort Liu, Shichang
collection PubMed
description Solid developments have been seen in deep-learning-based pose estimation, but few works have explored performance in dense crowds, such as a classroom scene; furthermore, no specific knowledge is considered in the design of image augmentation for pose estimation. A masked autoencoder was shown to have a non-negligible capability in image reconstruction, where the masking mechanism that randomly drops patches forces the model to build unknown pixels from known pixels. Inspired by this self-supervised learning method, where the restoration of the feature loss induced by the mask is consistent with tackling the occlusion problem in classroom scenarios, we discovered that the transfer performance of the pre-trained weights could be used as a model-based augmentation to overcome the intractable occlusion in classroom pose estimation. In this study, we proposed a top-down pose estimation method that utilized the natural reconstruction capability of missing information of the MAE as an effective occluded image augmentation in a pose estimation task. The difference with the original MAE was that instead of using a 75% random mask ratio, we regarded the keypoint distribution probabilistic heatmap as a reference for masking, which we named Pose Mask. To test the performance of our method in heavily occluded classroom scenes, we collected a new dataset for pose estimation in classroom scenes named Class Pose and conducted many experiments, the results of which showed promising performance.
format Online
Article
Text
id pubmed-9655343
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96553432022-11-15 Pose Mask: A Model-Based Augmentation Method for 2D Pose Estimation in Classroom Scenes Using Surveillance Images Liu, Shichang Ma, Miao Li, Haiyang Ning, Hanyang Wang, Min Sensors (Basel) Article Solid developments have been seen in deep-learning-based pose estimation, but few works have explored performance in dense crowds, such as a classroom scene; furthermore, no specific knowledge is considered in the design of image augmentation for pose estimation. A masked autoencoder was shown to have a non-negligible capability in image reconstruction, where the masking mechanism that randomly drops patches forces the model to build unknown pixels from known pixels. Inspired by this self-supervised learning method, where the restoration of the feature loss induced by the mask is consistent with tackling the occlusion problem in classroom scenarios, we discovered that the transfer performance of the pre-trained weights could be used as a model-based augmentation to overcome the intractable occlusion in classroom pose estimation. In this study, we proposed a top-down pose estimation method that utilized the natural reconstruction capability of missing information of the MAE as an effective occluded image augmentation in a pose estimation task. The difference with the original MAE was that instead of using a 75% random mask ratio, we regarded the keypoint distribution probabilistic heatmap as a reference for masking, which we named Pose Mask. To test the performance of our method in heavily occluded classroom scenes, we collected a new dataset for pose estimation in classroom scenes named Class Pose and conducted many experiments, the results of which showed promising performance. MDPI 2022-10-30 /pmc/articles/PMC9655343/ /pubmed/36366027 http://dx.doi.org/10.3390/s22218331 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Liu, Shichang
Ma, Miao
Li, Haiyang
Ning, Hanyang
Wang, Min
Pose Mask: A Model-Based Augmentation Method for 2D Pose Estimation in Classroom Scenes Using Surveillance Images
title Pose Mask: A Model-Based Augmentation Method for 2D Pose Estimation in Classroom Scenes Using Surveillance Images
title_full Pose Mask: A Model-Based Augmentation Method for 2D Pose Estimation in Classroom Scenes Using Surveillance Images
title_fullStr Pose Mask: A Model-Based Augmentation Method for 2D Pose Estimation in Classroom Scenes Using Surveillance Images
title_full_unstemmed Pose Mask: A Model-Based Augmentation Method for 2D Pose Estimation in Classroom Scenes Using Surveillance Images
title_short Pose Mask: A Model-Based Augmentation Method for 2D Pose Estimation in Classroom Scenes Using Surveillance Images
title_sort pose mask: a model-based augmentation method for 2d pose estimation in classroom scenes using surveillance images
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9655343/
https://www.ncbi.nlm.nih.gov/pubmed/36366027
http://dx.doi.org/10.3390/s22218331
work_keys_str_mv AT liushichang posemaskamodelbasedaugmentationmethodfor2dposeestimationinclassroomscenesusingsurveillanceimages
AT mamiao posemaskamodelbasedaugmentationmethodfor2dposeestimationinclassroomscenesusingsurveillanceimages
AT lihaiyang posemaskamodelbasedaugmentationmethodfor2dposeestimationinclassroomscenesusingsurveillanceimages
AT ninghanyang posemaskamodelbasedaugmentationmethodfor2dposeestimationinclassroomscenesusingsurveillanceimages
AT wangmin posemaskamodelbasedaugmentationmethodfor2dposeestimationinclassroomscenesusingsurveillanceimages