Cargando…
Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory
PURPOSE: Estimating the surgery length has the potential to be utilized as skill assessment, surgical training, or efficient surgical facility utilization especially if it is done in real-time as a remaining surgery duration (RSD). Surgical length reflects a certain level of efficiency and mastery o...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10161556/ https://www.ncbi.nlm.nih.gov/pubmed/37143041 http://dx.doi.org/10.1186/s12911-023-02160-0 |
_version_ | 1785037518255685632 |
---|---|
author | Wang, Bowen Li, Liangzhi Nakashima, Yuta Kawasaki, Ryo Nagahara, Hajime |
author_facet | Wang, Bowen Li, Liangzhi Nakashima, Yuta Kawasaki, Ryo Nagahara, Hajime |
author_sort | Wang, Bowen |
collection | PubMed |
description | PURPOSE: Estimating the surgery length has the potential to be utilized as skill assessment, surgical training, or efficient surgical facility utilization especially if it is done in real-time as a remaining surgery duration (RSD). Surgical length reflects a certain level of efficiency and mastery of the surgeon in a well-standardized surgery such as cataract surgery. In this paper, we design and develop a real-time RSD estimation method for cataract surgery that does not require manual labeling and is transferable with minimum fine-tuning. METHODS: A regression method consisting of convolutional neural networks (CNNs) and long short-term memory (LSTM) is designed for RSD estimation. The model is firstly trained and evaluated for the single main surgeon with a large number of surgeries. Then, the fine-tuning strategy is used to transfer the model to the data of the other two surgeons. Mean Absolute Error (MAE in seconds) was used to evaluate the performance of the RSD estimation. The proposed method is compared with the naïve method which is based on the statistic of the historical data. A transferability experiment is also set to demonstrate the generalizability of the method. RESULT: The mean surgical time for the sample videos was 318.7 s (s) (standard deviation 83.4 s) for the main surgeon for the initial training. In our experiments, the lowest MAE of 19.4 s (equal to about 6.4% of the mean surgical time) is achieved by our best-trained model for the independent test data of the main target surgeon. It reduces the MAE by 35.5 s (-10.2%) compared to the naïve method. The fine-tuning strategy transfers the model trained for the main target to the data of other surgeons with only a small number of training data (20% of the pre-training). The MAEs for the other two surgeons are 28.3 s and 30.6 s with the fine-tuning model, which decreased by -8.1 s and -7.5 s than the Per-surgeon model (average declining of -7.8 s and 1.3% of video duration). External validation study with Cataract-101 outperformed 3 reported methods of TimeLSTM, RSDNet, and CataNet. CONCLUSION: An approach to build a pre-trained model for estimating RSD estimation based on a single surgeon and then transfer to other surgeons demonstrated both low prediction error and good transferability with minimum fine-tuning videos. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02160-0. |
format | Online Article Text |
id | pubmed-10161556 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-101615562023-05-06 Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory Wang, Bowen Li, Liangzhi Nakashima, Yuta Kawasaki, Ryo Nagahara, Hajime BMC Med Inform Decis Mak Research PURPOSE: Estimating the surgery length has the potential to be utilized as skill assessment, surgical training, or efficient surgical facility utilization especially if it is done in real-time as a remaining surgery duration (RSD). Surgical length reflects a certain level of efficiency and mastery of the surgeon in a well-standardized surgery such as cataract surgery. In this paper, we design and develop a real-time RSD estimation method for cataract surgery that does not require manual labeling and is transferable with minimum fine-tuning. METHODS: A regression method consisting of convolutional neural networks (CNNs) and long short-term memory (LSTM) is designed for RSD estimation. The model is firstly trained and evaluated for the single main surgeon with a large number of surgeries. Then, the fine-tuning strategy is used to transfer the model to the data of the other two surgeons. Mean Absolute Error (MAE in seconds) was used to evaluate the performance of the RSD estimation. The proposed method is compared with the naïve method which is based on the statistic of the historical data. A transferability experiment is also set to demonstrate the generalizability of the method. RESULT: The mean surgical time for the sample videos was 318.7 s (s) (standard deviation 83.4 s) for the main surgeon for the initial training. In our experiments, the lowest MAE of 19.4 s (equal to about 6.4% of the mean surgical time) is achieved by our best-trained model for the independent test data of the main target surgeon. It reduces the MAE by 35.5 s (-10.2%) compared to the naïve method. The fine-tuning strategy transfers the model trained for the main target to the data of other surgeons with only a small number of training data (20% of the pre-training). The MAEs for the other two surgeons are 28.3 s and 30.6 s with the fine-tuning model, which decreased by -8.1 s and -7.5 s than the Per-surgeon model (average declining of -7.8 s and 1.3% of video duration). External validation study with Cataract-101 outperformed 3 reported methods of TimeLSTM, RSDNet, and CataNet. CONCLUSION: An approach to build a pre-trained model for estimating RSD estimation based on a single surgeon and then transfer to other surgeons demonstrated both low prediction error and good transferability with minimum fine-tuning videos. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02160-0. BioMed Central 2023-05-04 /pmc/articles/PMC10161556/ /pubmed/37143041 http://dx.doi.org/10.1186/s12911-023-02160-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Wang, Bowen Li, Liangzhi Nakashima, Yuta Kawasaki, Ryo Nagahara, Hajime Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory |
title | Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory |
title_full | Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory |
title_fullStr | Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory |
title_full_unstemmed | Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory |
title_short | Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory |
title_sort | real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10161556/ https://www.ncbi.nlm.nih.gov/pubmed/37143041 http://dx.doi.org/10.1186/s12911-023-02160-0 |
work_keys_str_mv | AT wangbowen realtimeestimationoftheremainingsurgerydurationforcataractsurgeryusingdeepconvolutionalneuralnetworksandlongshorttermmemory AT liliangzhi realtimeestimationoftheremainingsurgerydurationforcataractsurgeryusingdeepconvolutionalneuralnetworksandlongshorttermmemory AT nakashimayuta realtimeestimationoftheremainingsurgerydurationforcataractsurgeryusingdeepconvolutionalneuralnetworksandlongshorttermmemory AT kawasakiryo realtimeestimationoftheremainingsurgerydurationforcataractsurgeryusingdeepconvolutionalneuralnetworksandlongshorttermmemory AT nagaharahajime realtimeestimationoftheremainingsurgerydurationforcataractsurgeryusingdeepconvolutionalneuralnetworksandlongshorttermmemory |