Cargando…

An Efficient 3D Human Pose Retrieval and Reconstruction from 2D Image-Based Landmarks

We propose an efficient and novel architecture for 3D articulated human pose retrieval and reconstruction from 2D landmarks extracted from a 2D synthetic image, an annotated 2D image, an in-the-wild real RGB image or even a hand-drawn sketch. Given 2D joint positions in a single image, we devise a d...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yasin, Hashim, Krüger, Björn
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8038066/ https://www.ncbi.nlm.nih.gov/pubmed/33915719 http://dx.doi.org/10.3390/s21072415

_version_	1783677289357115392
author	Yasin, Hashim Krüger, Björn
author_facet	Yasin, Hashim Krüger, Björn
author_sort	Yasin, Hashim
collection	PubMed
description	We propose an efficient and novel architecture for 3D articulated human pose retrieval and reconstruction from 2D landmarks extracted from a 2D synthetic image, an annotated 2D image, an in-the-wild real RGB image or even a hand-drawn sketch. Given 2D joint positions in a single image, we devise a data-driven framework to infer the corresponding 3D human pose. To this end, we first normalize 3D human poses from Motion Capture (MoCap) dataset by eliminating translation, orientation, and the skeleton size discrepancies from the poses and then build a knowledge-base by projecting a subset of joints of the normalized 3D poses onto 2D image-planes by fully exploiting a variety of virtual cameras. With this approach, we not only transform 3D pose space to the normalized 2D pose space but also resolve the 2D-3D cross-domain retrieval task efficiently. The proposed architecture searches for poses from a MoCap dataset that are near to a given 2D query pose in a definite feature space made up of specific joint sets. These retrieved poses are then used to construct a weak perspective camera and a final 3D posture under the camera model that minimizes the reconstruction error. To estimate unknown camera parameters, we introduce a nonlinear, two-fold method. We exploit the retrieved similar poses and the viewing directions at which the MoCap dataset was sampled to minimize the projection error. Finally, we evaluate our approach thoroughly on a large number of heterogeneous 2D examples generated synthetically, 2D images with ground-truth, a variety of real in-the-wild internet images, and a proof of concept using 2D hand-drawn sketches of human poses. We conduct a pool of experiments to perform a quantitative study on PARSE dataset. We also show that the proposed system yields competitive, convincing results in comparison to other state-of-the-art methods.
format	Online Article Text
id	pubmed-8038066
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-80380662021-04-12 An Efficient 3D Human Pose Retrieval and Reconstruction from 2D Image-Based Landmarks Yasin, Hashim Krüger, Björn Sensors (Basel) Article We propose an efficient and novel architecture for 3D articulated human pose retrieval and reconstruction from 2D landmarks extracted from a 2D synthetic image, an annotated 2D image, an in-the-wild real RGB image or even a hand-drawn sketch. Given 2D joint positions in a single image, we devise a data-driven framework to infer the corresponding 3D human pose. To this end, we first normalize 3D human poses from Motion Capture (MoCap) dataset by eliminating translation, orientation, and the skeleton size discrepancies from the poses and then build a knowledge-base by projecting a subset of joints of the normalized 3D poses onto 2D image-planes by fully exploiting a variety of virtual cameras. With this approach, we not only transform 3D pose space to the normalized 2D pose space but also resolve the 2D-3D cross-domain retrieval task efficiently. The proposed architecture searches for poses from a MoCap dataset that are near to a given 2D query pose in a definite feature space made up of specific joint sets. These retrieved poses are then used to construct a weak perspective camera and a final 3D posture under the camera model that minimizes the reconstruction error. To estimate unknown camera parameters, we introduce a nonlinear, two-fold method. We exploit the retrieved similar poses and the viewing directions at which the MoCap dataset was sampled to minimize the projection error. Finally, we evaluate our approach thoroughly on a large number of heterogeneous 2D examples generated synthetically, 2D images with ground-truth, a variety of real in-the-wild internet images, and a proof of concept using 2D hand-drawn sketches of human poses. We conduct a pool of experiments to perform a quantitative study on PARSE dataset. We also show that the proposed system yields competitive, convincing results in comparison to other state-of-the-art methods. MDPI 2021-04-01 /pmc/articles/PMC8038066/ /pubmed/33915719 http://dx.doi.org/10.3390/s21072415 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Yasin, Hashim Krüger, Björn An Efficient 3D Human Pose Retrieval and Reconstruction from 2D Image-Based Landmarks
title	An Efficient 3D Human Pose Retrieval and Reconstruction from 2D Image-Based Landmarks
title_full	An Efficient 3D Human Pose Retrieval and Reconstruction from 2D Image-Based Landmarks
title_fullStr	An Efficient 3D Human Pose Retrieval and Reconstruction from 2D Image-Based Landmarks
title_full_unstemmed	An Efficient 3D Human Pose Retrieval and Reconstruction from 2D Image-Based Landmarks
title_short	An Efficient 3D Human Pose Retrieval and Reconstruction from 2D Image-Based Landmarks
title_sort	efficient 3d human pose retrieval and reconstruction from 2d image-based landmarks
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8038066/ https://www.ncbi.nlm.nih.gov/pubmed/33915719 http://dx.doi.org/10.3390/s21072415
work_keys_str_mv	AT yasinhashim anefficient3dhumanposeretrievalandreconstructionfrom2dimagebasedlandmarks AT krugerbjorn anefficient3dhumanposeretrievalandreconstructionfrom2dimagebasedlandmarks AT yasinhashim efficient3dhumanposeretrievalandreconstructionfrom2dimagebasedlandmarks AT krugerbjorn efficient3dhumanposeretrievalandreconstructionfrom2dimagebasedlandmarks

An Efficient 3D Human Pose Retrieval and Reconstruction from 2D Image-Based Landmarks

Ejemplares similares