Cargando…

Synthesising 2D Video from 3D Motion Data for Machine Learning Applications

To increase the utility of legacy, gold-standard, three-dimensional (3D) motion capture datasets for computer vision-based machine learning applications, this study proposed and validated a method to synthesise two-dimensional (2D) video image frames from historic 3D motion data. We applied the vide...

Descripción completa

Detalles Bibliográficos
Autores principales: Mundt, Marion, Oberlack, Henrike, Goldacre, Molly, Powles, Julia, Funken, Johannes, Morris, Corey, Potthast, Wolfgang, Alderson, Jacqueline
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9459679/
https://www.ncbi.nlm.nih.gov/pubmed/36080981
http://dx.doi.org/10.3390/s22176522
_version_ 1784786568532197376
author Mundt, Marion
Oberlack, Henrike
Goldacre, Molly
Powles, Julia
Funken, Johannes
Morris, Corey
Potthast, Wolfgang
Alderson, Jacqueline
author_facet Mundt, Marion
Oberlack, Henrike
Goldacre, Molly
Powles, Julia
Funken, Johannes
Morris, Corey
Potthast, Wolfgang
Alderson, Jacqueline
author_sort Mundt, Marion
collection PubMed
description To increase the utility of legacy, gold-standard, three-dimensional (3D) motion capture datasets for computer vision-based machine learning applications, this study proposed and validated a method to synthesise two-dimensional (2D) video image frames from historic 3D motion data. We applied the video-based human pose estimation model OpenPose to real (in situ) and synthesised 2D videos and compared anatomical landmark keypoint outputs, with trivial observed differences (2.11–3.49 mm). We further demonstrated the utility of the method in a downstream machine learning use-case in which we trained and then tested the validity of an artificial neural network (ANN) to estimate ground reaction forces (GRFs) using synthesised and real 2D videos. Training an ANN to estimate GRFs using eight OpenPose keypoints derived from synthesised 2D videos resulted in accurate waveform GRF estimations (r > 0.9; nRMSE < 14%). When compared with using the smaller number of real videos only, accuracy was improved by adding the synthetic views and enlarging the dataset. The results highlight the utility of the developed approach to enlarge small 2D video datasets, or to create 2D video images to accompany 3D motion capture datasets to make them accessible for machine learning applications.
format Online
Article
Text
id pubmed-9459679
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-94596792022-09-10 Synthesising 2D Video from 3D Motion Data for Machine Learning Applications Mundt, Marion Oberlack, Henrike Goldacre, Molly Powles, Julia Funken, Johannes Morris, Corey Potthast, Wolfgang Alderson, Jacqueline Sensors (Basel) Article To increase the utility of legacy, gold-standard, three-dimensional (3D) motion capture datasets for computer vision-based machine learning applications, this study proposed and validated a method to synthesise two-dimensional (2D) video image frames from historic 3D motion data. We applied the video-based human pose estimation model OpenPose to real (in situ) and synthesised 2D videos and compared anatomical landmark keypoint outputs, with trivial observed differences (2.11–3.49 mm). We further demonstrated the utility of the method in a downstream machine learning use-case in which we trained and then tested the validity of an artificial neural network (ANN) to estimate ground reaction forces (GRFs) using synthesised and real 2D videos. Training an ANN to estimate GRFs using eight OpenPose keypoints derived from synthesised 2D videos resulted in accurate waveform GRF estimations (r > 0.9; nRMSE < 14%). When compared with using the smaller number of real videos only, accuracy was improved by adding the synthetic views and enlarging the dataset. The results highlight the utility of the developed approach to enlarge small 2D video datasets, or to create 2D video images to accompany 3D motion capture datasets to make them accessible for machine learning applications. MDPI 2022-08-29 /pmc/articles/PMC9459679/ /pubmed/36080981 http://dx.doi.org/10.3390/s22176522 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Mundt, Marion
Oberlack, Henrike
Goldacre, Molly
Powles, Julia
Funken, Johannes
Morris, Corey
Potthast, Wolfgang
Alderson, Jacqueline
Synthesising 2D Video from 3D Motion Data for Machine Learning Applications
title Synthesising 2D Video from 3D Motion Data for Machine Learning Applications
title_full Synthesising 2D Video from 3D Motion Data for Machine Learning Applications
title_fullStr Synthesising 2D Video from 3D Motion Data for Machine Learning Applications
title_full_unstemmed Synthesising 2D Video from 3D Motion Data for Machine Learning Applications
title_short Synthesising 2D Video from 3D Motion Data for Machine Learning Applications
title_sort synthesising 2d video from 3d motion data for machine learning applications
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9459679/
https://www.ncbi.nlm.nih.gov/pubmed/36080981
http://dx.doi.org/10.3390/s22176522
work_keys_str_mv AT mundtmarion synthesising2dvideofrom3dmotiondataformachinelearningapplications
AT oberlackhenrike synthesising2dvideofrom3dmotiondataformachinelearningapplications
AT goldacremolly synthesising2dvideofrom3dmotiondataformachinelearningapplications
AT powlesjulia synthesising2dvideofrom3dmotiondataformachinelearningapplications
AT funkenjohannes synthesising2dvideofrom3dmotiondataformachinelearningapplications
AT morriscorey synthesising2dvideofrom3dmotiondataformachinelearningapplications
AT potthastwolfgang synthesising2dvideofrom3dmotiondataformachinelearningapplications
AT aldersonjacqueline synthesising2dvideofrom3dmotiondataformachinelearningapplications