Cargando…
Improving work detection by segmentation heuristics pre-training on factory operations video
The measurement of work time for individual tasks by using video has made a significant contribution to a framework for productivity improvement such as value stream mapping (VSM). In the past, the work time has been often measured manually, but this process is quite costly and labor-intensive. For...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9173629/ https://www.ncbi.nlm.nih.gov/pubmed/35671292 http://dx.doi.org/10.1371/journal.pone.0267457 |
_version_ | 1784722065286234112 |
---|---|
author | Kataoka, Shotaro Ito, Tetsuro Iwaka, Genki Oba, Masashi Nonaka, Hirofumi |
author_facet | Kataoka, Shotaro Ito, Tetsuro Iwaka, Genki Oba, Masashi Nonaka, Hirofumi |
author_sort | Kataoka, Shotaro |
collection | PubMed |
description | The measurement of work time for individual tasks by using video has made a significant contribution to a framework for productivity improvement such as value stream mapping (VSM). In the past, the work time has been often measured manually, but this process is quite costly and labor-intensive. For these reasons, automation of work analysis at the worksite is needed. There are two main methods for computing spatio-temporal information: by 3D-CNN, and by temporal computation using LSTM after feature extraction in the spatial domain by 2D-CNN. These methods has high computational cost but high model representational power, and the latter has low computational cost but relatively low model representational power. In the manufacturing industry, the use of local computers to make inferences is often required for practicality and confidentiality reasons, necessitating a low computational cost, and so the latter, a lightweight model, needs to have improved performance. Therefore, in this paper, we propose a method that pre-trains the image encoder module of a work detection model using an image segmentation model. This is based on the CNN-LSTM structure, which separates spatial and temporal computation and enables us to include heuristics such as workers’ body parts and work tools in the CNN module. Experimental results demonstrate that our pre-training method reduces over-fitting and provides a greater improvement in detection performance than pre-training on ImageNet. |
format | Online Article Text |
id | pubmed-9173629 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-91736292022-06-08 Improving work detection by segmentation heuristics pre-training on factory operations video Kataoka, Shotaro Ito, Tetsuro Iwaka, Genki Oba, Masashi Nonaka, Hirofumi PLoS One Research Article The measurement of work time for individual tasks by using video has made a significant contribution to a framework for productivity improvement such as value stream mapping (VSM). In the past, the work time has been often measured manually, but this process is quite costly and labor-intensive. For these reasons, automation of work analysis at the worksite is needed. There are two main methods for computing spatio-temporal information: by 3D-CNN, and by temporal computation using LSTM after feature extraction in the spatial domain by 2D-CNN. These methods has high computational cost but high model representational power, and the latter has low computational cost but relatively low model representational power. In the manufacturing industry, the use of local computers to make inferences is often required for practicality and confidentiality reasons, necessitating a low computational cost, and so the latter, a lightweight model, needs to have improved performance. Therefore, in this paper, we propose a method that pre-trains the image encoder module of a work detection model using an image segmentation model. This is based on the CNN-LSTM structure, which separates spatial and temporal computation and enables us to include heuristics such as workers’ body parts and work tools in the CNN module. Experimental results demonstrate that our pre-training method reduces over-fitting and provides a greater improvement in detection performance than pre-training on ImageNet. Public Library of Science 2022-06-07 /pmc/articles/PMC9173629/ /pubmed/35671292 http://dx.doi.org/10.1371/journal.pone.0267457 Text en © 2022 Kataoka et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Kataoka, Shotaro Ito, Tetsuro Iwaka, Genki Oba, Masashi Nonaka, Hirofumi Improving work detection by segmentation heuristics pre-training on factory operations video |
title | Improving work detection by segmentation heuristics pre-training on factory operations video |
title_full | Improving work detection by segmentation heuristics pre-training on factory operations video |
title_fullStr | Improving work detection by segmentation heuristics pre-training on factory operations video |
title_full_unstemmed | Improving work detection by segmentation heuristics pre-training on factory operations video |
title_short | Improving work detection by segmentation heuristics pre-training on factory operations video |
title_sort | improving work detection by segmentation heuristics pre-training on factory operations video |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9173629/ https://www.ncbi.nlm.nih.gov/pubmed/35671292 http://dx.doi.org/10.1371/journal.pone.0267457 |
work_keys_str_mv | AT kataokashotaro improvingworkdetectionbysegmentationheuristicspretrainingonfactoryoperationsvideo AT itotetsuro improvingworkdetectionbysegmentationheuristicspretrainingonfactoryoperationsvideo AT iwakagenki improvingworkdetectionbysegmentationheuristicspretrainingonfactoryoperationsvideo AT obamasashi improvingworkdetectionbysegmentationheuristicspretrainingonfactoryoperationsvideo AT nonakahirofumi improvingworkdetectionbysegmentationheuristicspretrainingonfactoryoperationsvideo |