Cargando…

LHPE-nets: A lightweight 2D and 3D human pose estimation model with well-structural deep networks and multi-view pose sample simplification method

The cross-view 3D human pose estimation model has made significant progress, it better completed the task of human joint positioning and skeleton modeling in 3D through multi-view fusion method. The multi-view 2D pose estimation part of this model is very important, but its training cost is also ver...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Hao, Sun, Ming-hui, Zhang, Hao, Dong, Li-yan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8865690/
https://www.ncbi.nlm.nih.gov/pubmed/35196346
http://dx.doi.org/10.1371/journal.pone.0264302
_version_ 1784655681645707264
author Wang, Hao
Sun, Ming-hui
Zhang, Hao
Dong, Li-yan
author_facet Wang, Hao
Sun, Ming-hui
Zhang, Hao
Dong, Li-yan
author_sort Wang, Hao
collection PubMed
description The cross-view 3D human pose estimation model has made significant progress, it better completed the task of human joint positioning and skeleton modeling in 3D through multi-view fusion method. The multi-view 2D pose estimation part of this model is very important, but its training cost is also very high. It uses some deep learning networks to generate heatmaps for each view. Therefore, in this article, we tested some new deep learning networks for pose estimation tasks. These deep networks include Mobilenetv2, Mobilenetv3, Efficientnetv2 and Resnet. Then, based on the performance and drawbacks of these networks, we built multiple deep learning networks with better performance. We call our network in this article LHPE-nets, which mainly includes Low-Span network and RDNS network. LHPE-nets uses a network structure with evenly distributed channels, inverted residuals, external residual blocks and a framework for processing small-resolution samples to achieve training saturation faster. And we also designed a static pose sample simplification method for 3D pose data. It implemented low-cost sample storage, and it was also convenient for models to read these samples. In the experiment, we used several recent models and two public estimation indicators. The experimental results show the superiority of this work in fast start-up and network lightweight, it is about 1-5 epochs faster than the Resnet-34 during training. And they also show the accuracy improvement of this work in estimating different joints, the estimated performance of approximately 60% of the joints is improved. Its performance in the overall human pose estimation exceeds other networks by more than 7mm. The experiment analyzes the network size, fast start-up and the performance in 2D and 3D pose estimation of the model in this paper in detail. Compared with other pose estimation models, its performance has also reached a higher level of application.
format Online
Article
Text
id pubmed-8865690
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-88656902022-02-24 LHPE-nets: A lightweight 2D and 3D human pose estimation model with well-structural deep networks and multi-view pose sample simplification method Wang, Hao Sun, Ming-hui Zhang, Hao Dong, Li-yan PLoS One Research Article The cross-view 3D human pose estimation model has made significant progress, it better completed the task of human joint positioning and skeleton modeling in 3D through multi-view fusion method. The multi-view 2D pose estimation part of this model is very important, but its training cost is also very high. It uses some deep learning networks to generate heatmaps for each view. Therefore, in this article, we tested some new deep learning networks for pose estimation tasks. These deep networks include Mobilenetv2, Mobilenetv3, Efficientnetv2 and Resnet. Then, based on the performance and drawbacks of these networks, we built multiple deep learning networks with better performance. We call our network in this article LHPE-nets, which mainly includes Low-Span network and RDNS network. LHPE-nets uses a network structure with evenly distributed channels, inverted residuals, external residual blocks and a framework for processing small-resolution samples to achieve training saturation faster. And we also designed a static pose sample simplification method for 3D pose data. It implemented low-cost sample storage, and it was also convenient for models to read these samples. In the experiment, we used several recent models and two public estimation indicators. The experimental results show the superiority of this work in fast start-up and network lightweight, it is about 1-5 epochs faster than the Resnet-34 during training. And they also show the accuracy improvement of this work in estimating different joints, the estimated performance of approximately 60% of the joints is improved. Its performance in the overall human pose estimation exceeds other networks by more than 7mm. The experiment analyzes the network size, fast start-up and the performance in 2D and 3D pose estimation of the model in this paper in detail. Compared with other pose estimation models, its performance has also reached a higher level of application. Public Library of Science 2022-02-23 /pmc/articles/PMC8865690/ /pubmed/35196346 http://dx.doi.org/10.1371/journal.pone.0264302 Text en © 2022 Wang et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wang, Hao
Sun, Ming-hui
Zhang, Hao
Dong, Li-yan
LHPE-nets: A lightweight 2D and 3D human pose estimation model with well-structural deep networks and multi-view pose sample simplification method
title LHPE-nets: A lightweight 2D and 3D human pose estimation model with well-structural deep networks and multi-view pose sample simplification method
title_full LHPE-nets: A lightweight 2D and 3D human pose estimation model with well-structural deep networks and multi-view pose sample simplification method
title_fullStr LHPE-nets: A lightweight 2D and 3D human pose estimation model with well-structural deep networks and multi-view pose sample simplification method
title_full_unstemmed LHPE-nets: A lightweight 2D and 3D human pose estimation model with well-structural deep networks and multi-view pose sample simplification method
title_short LHPE-nets: A lightweight 2D and 3D human pose estimation model with well-structural deep networks and multi-view pose sample simplification method
title_sort lhpe-nets: a lightweight 2d and 3d human pose estimation model with well-structural deep networks and multi-view pose sample simplification method
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8865690/
https://www.ncbi.nlm.nih.gov/pubmed/35196346
http://dx.doi.org/10.1371/journal.pone.0264302
work_keys_str_mv AT wanghao lhpenetsalightweight2dand3dhumanposeestimationmodelwithwellstructuraldeepnetworksandmultiviewposesamplesimplificationmethod
AT sunminghui lhpenetsalightweight2dand3dhumanposeestimationmodelwithwellstructuraldeepnetworksandmultiviewposesamplesimplificationmethod
AT zhanghao lhpenetsalightweight2dand3dhumanposeestimationmodelwithwellstructuraldeepnetworksandmultiviewposesamplesimplificationmethod
AT dongliyan lhpenetsalightweight2dand3dhumanposeestimationmodelwithwellstructuraldeepnetworksandmultiviewposesamplesimplificationmethod