Cargando…

Multi-Modal Deep Learning for Assessing Surgeon Technical Skill

This paper introduces a new dataset of a surgical knot-tying task, and a multi-modal deep learning model that achieves comparable performance to expert human raters on this skill assessment task. Seventy-two surgical trainees and faculty were recruited for the knot-tying task, and were recorded usin...

Descripción completa

Detalles Bibliográficos
Autores principales: Kasa, Kevin, Burns, David, Goldenberg, Mitchell G., Selim, Omar, Whyne, Cari, Hardisty, Michael
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9571767/
https://www.ncbi.nlm.nih.gov/pubmed/36236424
http://dx.doi.org/10.3390/s22197328
_version_ 1784810445260980224
author Kasa, Kevin
Burns, David
Goldenberg, Mitchell G.
Selim, Omar
Whyne, Cari
Hardisty, Michael
author_facet Kasa, Kevin
Burns, David
Goldenberg, Mitchell G.
Selim, Omar
Whyne, Cari
Hardisty, Michael
author_sort Kasa, Kevin
collection PubMed
description This paper introduces a new dataset of a surgical knot-tying task, and a multi-modal deep learning model that achieves comparable performance to expert human raters on this skill assessment task. Seventy-two surgical trainees and faculty were recruited for the knot-tying task, and were recorded using video, kinematic, and image data. Three expert human raters conducted the skills assessment using the Objective Structured Assessment of Technical Skill (OSATS) Global Rating Scale (GRS). We also designed and developed three deep learning models: a ResNet-based image model, a ResNet-LSTM kinematic model, and a multi-modal model leveraging the image and time-series kinematic data. All three models demonstrate performance comparable to the expert human raters on most GRS domains. The multi-modal model demonstrates the best overall performance, as measured using the mean squared error (MSE) and intraclass correlation coefficient (ICC). This work is significant since it demonstrates that multi-modal deep learning has the potential to replicate human raters on a challenging human-performed knot-tying task. The study demonstrates an algorithm with state-of-the-art performance in surgical skill assessment. As objective assessment of technical skill continues to be a growing, but resource-heavy, element of surgical education, this study is an important step towards automated surgical skill assessment, ultimately leading to reduced burden on training faculty and institutes.
format Online
Article
Text
id pubmed-9571767
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95717672022-10-17 Multi-Modal Deep Learning for Assessing Surgeon Technical Skill Kasa, Kevin Burns, David Goldenberg, Mitchell G. Selim, Omar Whyne, Cari Hardisty, Michael Sensors (Basel) Article This paper introduces a new dataset of a surgical knot-tying task, and a multi-modal deep learning model that achieves comparable performance to expert human raters on this skill assessment task. Seventy-two surgical trainees and faculty were recruited for the knot-tying task, and were recorded using video, kinematic, and image data. Three expert human raters conducted the skills assessment using the Objective Structured Assessment of Technical Skill (OSATS) Global Rating Scale (GRS). We also designed and developed three deep learning models: a ResNet-based image model, a ResNet-LSTM kinematic model, and a multi-modal model leveraging the image and time-series kinematic data. All three models demonstrate performance comparable to the expert human raters on most GRS domains. The multi-modal model demonstrates the best overall performance, as measured using the mean squared error (MSE) and intraclass correlation coefficient (ICC). This work is significant since it demonstrates that multi-modal deep learning has the potential to replicate human raters on a challenging human-performed knot-tying task. The study demonstrates an algorithm with state-of-the-art performance in surgical skill assessment. As objective assessment of technical skill continues to be a growing, but resource-heavy, element of surgical education, this study is an important step towards automated surgical skill assessment, ultimately leading to reduced burden on training faculty and institutes. MDPI 2022-09-27 /pmc/articles/PMC9571767/ /pubmed/36236424 http://dx.doi.org/10.3390/s22197328 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kasa, Kevin
Burns, David
Goldenberg, Mitchell G.
Selim, Omar
Whyne, Cari
Hardisty, Michael
Multi-Modal Deep Learning for Assessing Surgeon Technical Skill
title Multi-Modal Deep Learning for Assessing Surgeon Technical Skill
title_full Multi-Modal Deep Learning for Assessing Surgeon Technical Skill
title_fullStr Multi-Modal Deep Learning for Assessing Surgeon Technical Skill
title_full_unstemmed Multi-Modal Deep Learning for Assessing Surgeon Technical Skill
title_short Multi-Modal Deep Learning for Assessing Surgeon Technical Skill
title_sort multi-modal deep learning for assessing surgeon technical skill
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9571767/
https://www.ncbi.nlm.nih.gov/pubmed/36236424
http://dx.doi.org/10.3390/s22197328
work_keys_str_mv AT kasakevin multimodaldeeplearningforassessingsurgeontechnicalskill
AT burnsdavid multimodaldeeplearningforassessingsurgeontechnicalskill
AT goldenbergmitchellg multimodaldeeplearningforassessingsurgeontechnicalskill
AT selimomar multimodaldeeplearningforassessingsurgeontechnicalskill
AT whynecari multimodaldeeplearningforassessingsurgeontechnicalskill
AT hardistymichael multimodaldeeplearningforassessingsurgeontechnicalskill