Cargando…

DUA: A Domain-Unified Approach for Cross-Dataset 3D Human Pose Estimation

Human pose estimation is an important Computer Vision problem, whose goal is to estimate the human body through joints. Currently, methods that employ deep learning techniques excel in the task of 2D human pose estimation. However, the use of 3D poses can bring more accurate and robust results. Sinc...

Descripción completa

Detalles Bibliográficos
Autores principales: Manesco, João Renato Ribeiro, Berretti, Stefano, Marana, Aparecido Nilceu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10490197/
https://www.ncbi.nlm.nih.gov/pubmed/37687768
http://dx.doi.org/10.3390/s23177312
_version_ 1785103788181291008
author Manesco, João Renato Ribeiro
Berretti, Stefano
Marana, Aparecido Nilceu
author_facet Manesco, João Renato Ribeiro
Berretti, Stefano
Marana, Aparecido Nilceu
author_sort Manesco, João Renato Ribeiro
collection PubMed
description Human pose estimation is an important Computer Vision problem, whose goal is to estimate the human body through joints. Currently, methods that employ deep learning techniques excel in the task of 2D human pose estimation. However, the use of 3D poses can bring more accurate and robust results. Since 3D pose labels can only be acquired in restricted scenarios, fully convolutional methods tend to perform poorly on the task. One strategy to solve this problem is to use 2D pose estimators, to estimate 3D poses in two steps using 2D pose inputs. Due to database acquisition constraints, the performance improvement of this strategy can only be observed in controlled environments, therefore domain adaptation techniques can be used to increase the generalization capability of the system by inserting information from synthetic domains. In this work, we propose a novel method called Domain Unified approach, aimed at solving pose misalignment problems on a cross-dataset scenario, through a combination of three modules on top of the pose estimator: pose converter, uncertainty estimator, and domain classifier. Our method led to a 44.1mm (29.24%) error reduction, when training with the SURREAL synthetic dataset and evaluating with Human3.6M over a no-adaption scenario, achieving state-of-the-art performance.
format Online
Article
Text
id pubmed-10490197
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-104901972023-09-09 DUA: A Domain-Unified Approach for Cross-Dataset 3D Human Pose Estimation Manesco, João Renato Ribeiro Berretti, Stefano Marana, Aparecido Nilceu Sensors (Basel) Article Human pose estimation is an important Computer Vision problem, whose goal is to estimate the human body through joints. Currently, methods that employ deep learning techniques excel in the task of 2D human pose estimation. However, the use of 3D poses can bring more accurate and robust results. Since 3D pose labels can only be acquired in restricted scenarios, fully convolutional methods tend to perform poorly on the task. One strategy to solve this problem is to use 2D pose estimators, to estimate 3D poses in two steps using 2D pose inputs. Due to database acquisition constraints, the performance improvement of this strategy can only be observed in controlled environments, therefore domain adaptation techniques can be used to increase the generalization capability of the system by inserting information from synthetic domains. In this work, we propose a novel method called Domain Unified approach, aimed at solving pose misalignment problems on a cross-dataset scenario, through a combination of three modules on top of the pose estimator: pose converter, uncertainty estimator, and domain classifier. Our method led to a 44.1mm (29.24%) error reduction, when training with the SURREAL synthetic dataset and evaluating with Human3.6M over a no-adaption scenario, achieving state-of-the-art performance. MDPI 2023-08-22 /pmc/articles/PMC10490197/ /pubmed/37687768 http://dx.doi.org/10.3390/s23177312 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Manesco, João Renato Ribeiro
Berretti, Stefano
Marana, Aparecido Nilceu
DUA: A Domain-Unified Approach for Cross-Dataset 3D Human Pose Estimation
title DUA: A Domain-Unified Approach for Cross-Dataset 3D Human Pose Estimation
title_full DUA: A Domain-Unified Approach for Cross-Dataset 3D Human Pose Estimation
title_fullStr DUA: A Domain-Unified Approach for Cross-Dataset 3D Human Pose Estimation
title_full_unstemmed DUA: A Domain-Unified Approach for Cross-Dataset 3D Human Pose Estimation
title_short DUA: A Domain-Unified Approach for Cross-Dataset 3D Human Pose Estimation
title_sort dua: a domain-unified approach for cross-dataset 3d human pose estimation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10490197/
https://www.ncbi.nlm.nih.gov/pubmed/37687768
http://dx.doi.org/10.3390/s23177312
work_keys_str_mv AT manescojoaorenatoribeiro duaadomainunifiedapproachforcrossdataset3dhumanposeestimation
AT berrettistefano duaadomainunifiedapproachforcrossdataset3dhumanposeestimation
AT maranaaparecidonilceu duaadomainunifiedapproachforcrossdataset3dhumanposeestimation