Cargando…
Pose Mask: A Model-Based Augmentation Method for 2D Pose Estimation in Classroom Scenes Using Surveillance Images
Solid developments have been seen in deep-learning-based pose estimation, but few works have explored performance in dense crowds, such as a classroom scene; furthermore, no specific knowledge is considered in the design of image augmentation for pose estimation. A masked autoencoder was shown to ha...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9655343/ https://www.ncbi.nlm.nih.gov/pubmed/36366027 http://dx.doi.org/10.3390/s22218331 |
_version_ | 1784829162365648896 |
---|---|
author | Liu, Shichang Ma, Miao Li, Haiyang Ning, Hanyang Wang, Min |
author_facet | Liu, Shichang Ma, Miao Li, Haiyang Ning, Hanyang Wang, Min |
author_sort | Liu, Shichang |
collection | PubMed |
description | Solid developments have been seen in deep-learning-based pose estimation, but few works have explored performance in dense crowds, such as a classroom scene; furthermore, no specific knowledge is considered in the design of image augmentation for pose estimation. A masked autoencoder was shown to have a non-negligible capability in image reconstruction, where the masking mechanism that randomly drops patches forces the model to build unknown pixels from known pixels. Inspired by this self-supervised learning method, where the restoration of the feature loss induced by the mask is consistent with tackling the occlusion problem in classroom scenarios, we discovered that the transfer performance of the pre-trained weights could be used as a model-based augmentation to overcome the intractable occlusion in classroom pose estimation. In this study, we proposed a top-down pose estimation method that utilized the natural reconstruction capability of missing information of the MAE as an effective occluded image augmentation in a pose estimation task. The difference with the original MAE was that instead of using a 75% random mask ratio, we regarded the keypoint distribution probabilistic heatmap as a reference for masking, which we named Pose Mask. To test the performance of our method in heavily occluded classroom scenes, we collected a new dataset for pose estimation in classroom scenes named Class Pose and conducted many experiments, the results of which showed promising performance. |
format | Online Article Text |
id | pubmed-9655343 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-96553432022-11-15 Pose Mask: A Model-Based Augmentation Method for 2D Pose Estimation in Classroom Scenes Using Surveillance Images Liu, Shichang Ma, Miao Li, Haiyang Ning, Hanyang Wang, Min Sensors (Basel) Article Solid developments have been seen in deep-learning-based pose estimation, but few works have explored performance in dense crowds, such as a classroom scene; furthermore, no specific knowledge is considered in the design of image augmentation for pose estimation. A masked autoencoder was shown to have a non-negligible capability in image reconstruction, where the masking mechanism that randomly drops patches forces the model to build unknown pixels from known pixels. Inspired by this self-supervised learning method, where the restoration of the feature loss induced by the mask is consistent with tackling the occlusion problem in classroom scenarios, we discovered that the transfer performance of the pre-trained weights could be used as a model-based augmentation to overcome the intractable occlusion in classroom pose estimation. In this study, we proposed a top-down pose estimation method that utilized the natural reconstruction capability of missing information of the MAE as an effective occluded image augmentation in a pose estimation task. The difference with the original MAE was that instead of using a 75% random mask ratio, we regarded the keypoint distribution probabilistic heatmap as a reference for masking, which we named Pose Mask. To test the performance of our method in heavily occluded classroom scenes, we collected a new dataset for pose estimation in classroom scenes named Class Pose and conducted many experiments, the results of which showed promising performance. MDPI 2022-10-30 /pmc/articles/PMC9655343/ /pubmed/36366027 http://dx.doi.org/10.3390/s22218331 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Liu, Shichang Ma, Miao Li, Haiyang Ning, Hanyang Wang, Min Pose Mask: A Model-Based Augmentation Method for 2D Pose Estimation in Classroom Scenes Using Surveillance Images |
title | Pose Mask: A Model-Based Augmentation Method for 2D Pose Estimation in Classroom Scenes Using Surveillance Images |
title_full | Pose Mask: A Model-Based Augmentation Method for 2D Pose Estimation in Classroom Scenes Using Surveillance Images |
title_fullStr | Pose Mask: A Model-Based Augmentation Method for 2D Pose Estimation in Classroom Scenes Using Surveillance Images |
title_full_unstemmed | Pose Mask: A Model-Based Augmentation Method for 2D Pose Estimation in Classroom Scenes Using Surveillance Images |
title_short | Pose Mask: A Model-Based Augmentation Method for 2D Pose Estimation in Classroom Scenes Using Surveillance Images |
title_sort | pose mask: a model-based augmentation method for 2d pose estimation in classroom scenes using surveillance images |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9655343/ https://www.ncbi.nlm.nih.gov/pubmed/36366027 http://dx.doi.org/10.3390/s22218331 |
work_keys_str_mv | AT liushichang posemaskamodelbasedaugmentationmethodfor2dposeestimationinclassroomscenesusingsurveillanceimages AT mamiao posemaskamodelbasedaugmentationmethodfor2dposeestimationinclassroomscenesusingsurveillanceimages AT lihaiyang posemaskamodelbasedaugmentationmethodfor2dposeestimationinclassroomscenesusingsurveillanceimages AT ninghanyang posemaskamodelbasedaugmentationmethodfor2dposeestimationinclassroomscenesusingsurveillanceimages AT wangmin posemaskamodelbasedaugmentationmethodfor2dposeestimationinclassroomscenesusingsurveillanceimages |