Cargando…

In-Bed Pose Estimation: Deep Learning With Shallow Dataset

This paper presents a robust human posture and body parts detection method under a specific application scenario known as in-bed pose estimation. Although the human pose estimation for various computer vision (CV) applications has been studied extensively in the last few decades, the in-bed pose est...

Descripción completa

Detalles Bibliográficos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: IEEE 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6360998/
https://www.ncbi.nlm.nih.gov/pubmed/30792942
http://dx.doi.org/10.1109/JTEHM.2019.2892970
_version_ 1783392623190343680
collection PubMed
description This paper presents a robust human posture and body parts detection method under a specific application scenario known as in-bed pose estimation. Although the human pose estimation for various computer vision (CV) applications has been studied extensively in the last few decades, the in-bed pose estimation using camera-based vision methods has been ignored by the CV community because it is assumed to be identical to the general purpose pose estimation problems. However, the in-bed pose estimation has its own specialized aspects and comes with specific challenges, including the notable differences in lighting conditions throughout the day and having pose distribution different from the common human surveillance viewpoint. In this paper, we demonstrate that these challenges significantly reduce the effectiveness of the existing general purpose pose estimation models. In order to address the lighting variation challenge, the infrared selective (IRS) image acquisition technique is proposed to provide uniform quality data under various lighting conditions. In addition, to deal with the unconventional pose perspective, a 2- end histogram of oriented gradient (HOG) rectification method is presented. The deep learning framework proves to be the most effective model in human pose estimation; however, the lack of large public dataset for in-bed poses prevents us from using a large network from scratch. In this paper, we explored the idea of employing a pre-trained convolutional neural network (CNN) model trained on large public datasets of general human poses and fine-tuning the model using our own shallow (limited in size and different in perspective and color) in-bed IRS dataset. We developed an IRS imaging system and collected IRS image data from several realistic life-size mannequins in a simulated hospital room environment. A pre-trained CNN called convolutional pose machine (CPM) was fine-tuned for in-bed pose estimation by re-training its specific intermediate layers. Using the HOG rectification method, the pose estimation performance of CPM improved significantly by 26.4% in the probability of correct key-point (PCK) criteria at PCK0.1 compared to the model without such rectification. Even testing with only well aligned in-bed pose images, our fine-tuned model still surpassed the traditionally tuned CNN by another 16.6% increase in pose estimation accuracy.
format Online
Article
Text
id pubmed-6360998
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher IEEE
record_format MEDLINE/PubMed
spelling pubmed-63609982019-02-21 In-Bed Pose Estimation: Deep Learning With Shallow Dataset IEEE J Transl Eng Health Med Article This paper presents a robust human posture and body parts detection method under a specific application scenario known as in-bed pose estimation. Although the human pose estimation for various computer vision (CV) applications has been studied extensively in the last few decades, the in-bed pose estimation using camera-based vision methods has been ignored by the CV community because it is assumed to be identical to the general purpose pose estimation problems. However, the in-bed pose estimation has its own specialized aspects and comes with specific challenges, including the notable differences in lighting conditions throughout the day and having pose distribution different from the common human surveillance viewpoint. In this paper, we demonstrate that these challenges significantly reduce the effectiveness of the existing general purpose pose estimation models. In order to address the lighting variation challenge, the infrared selective (IRS) image acquisition technique is proposed to provide uniform quality data under various lighting conditions. In addition, to deal with the unconventional pose perspective, a 2- end histogram of oriented gradient (HOG) rectification method is presented. The deep learning framework proves to be the most effective model in human pose estimation; however, the lack of large public dataset for in-bed poses prevents us from using a large network from scratch. In this paper, we explored the idea of employing a pre-trained convolutional neural network (CNN) model trained on large public datasets of general human poses and fine-tuning the model using our own shallow (limited in size and different in perspective and color) in-bed IRS dataset. We developed an IRS imaging system and collected IRS image data from several realistic life-size mannequins in a simulated hospital room environment. A pre-trained CNN called convolutional pose machine (CPM) was fine-tuned for in-bed pose estimation by re-training its specific intermediate layers. Using the HOG rectification method, the pose estimation performance of CPM improved significantly by 26.4% in the probability of correct key-point (PCK) criteria at PCK0.1 compared to the model without such rectification. Even testing with only well aligned in-bed pose images, our fine-tuned model still surpassed the traditionally tuned CNN by another 16.6% increase in pose estimation accuracy. IEEE 2019-01-14 /pmc/articles/PMC6360998/ /pubmed/30792942 http://dx.doi.org/10.1109/JTEHM.2019.2892970 Text en 2168-2372 © 2019 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
spellingShingle Article
In-Bed Pose Estimation: Deep Learning With Shallow Dataset
title In-Bed Pose Estimation: Deep Learning With Shallow Dataset
title_full In-Bed Pose Estimation: Deep Learning With Shallow Dataset
title_fullStr In-Bed Pose Estimation: Deep Learning With Shallow Dataset
title_full_unstemmed In-Bed Pose Estimation: Deep Learning With Shallow Dataset
title_short In-Bed Pose Estimation: Deep Learning With Shallow Dataset
title_sort in-bed pose estimation: deep learning with shallow dataset
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6360998/
https://www.ncbi.nlm.nih.gov/pubmed/30792942
http://dx.doi.org/10.1109/JTEHM.2019.2892970
work_keys_str_mv AT inbedposeestimationdeeplearningwithshallowdataset
AT inbedposeestimationdeeplearningwithshallowdataset
AT inbedposeestimationdeeplearningwithshallowdataset