Cargando…

Assessment of Automated Identification of Phases in Videos of Cataract Surgery Using Machine Learning and Deep Learning Techniques

IMPORTANCE: Competence in cataract surgery is a public health necessity, and videos of cataract surgery are routinely available to educators and trainees but currently are of limited use in training. Machine learning and deep learning techniques can yield tools that efficiently segment videos of cat...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Felix, Silva Croso, Gianluca, Kim, Tae Soo, Song, Ziang, Parker, Felix, Hager, Gregory D., Reiter, Austin, Vedula, S. Swaroop, Ali, Haider, Sikder, Shameema
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Medical Association 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6450320/
https://www.ncbi.nlm.nih.gov/pubmed/30951163
http://dx.doi.org/10.1001/jamanetworkopen.2019.1860
_version_ 1783409011240992768
author Yu, Felix
Silva Croso, Gianluca
Kim, Tae Soo
Song, Ziang
Parker, Felix
Hager, Gregory D.
Reiter, Austin
Vedula, S. Swaroop
Ali, Haider
Sikder, Shameema
author_facet Yu, Felix
Silva Croso, Gianluca
Kim, Tae Soo
Song, Ziang
Parker, Felix
Hager, Gregory D.
Reiter, Austin
Vedula, S. Swaroop
Ali, Haider
Sikder, Shameema
author_sort Yu, Felix
collection PubMed
description IMPORTANCE: Competence in cataract surgery is a public health necessity, and videos of cataract surgery are routinely available to educators and trainees but currently are of limited use in training. Machine learning and deep learning techniques can yield tools that efficiently segment videos of cataract surgery into constituent phases for subsequent automated skill assessment and feedback. OBJECTIVE: To evaluate machine learning and deep learning algorithms for automated phase classification of manually presegmented phases in videos of cataract surgery. DESIGN, SETTING, AND PARTICIPANTS: This was a cross-sectional study using a data set of videos from a convenience sample of 100 cataract procedures performed by faculty and trainee surgeons in an ophthalmology residency program from July 2011 to December 2017. Demographic characteristics for surgeons and patients were not captured. Ten standard labels in the procedure and 14 instruments used during surgery were manually annotated, which served as the ground truth. EXPOSURES: Five algorithms with different input data: (1) a support vector machine input with cross-sectional instrument label data; (2) a recurrent neural network (RNN) input with a time series of instrument labels; (3) a convolutional neural network (CNN) input with cross-sectional image data; (4) a CNN-RNN input with a time series of images; and (5) a CNN-RNN input with time series of images and instrument labels. Each algorithm was evaluated with 5-fold cross-validation. MAIN OUTCOMES AND MEASURES: Accuracy, area under the receiver operating characteristic curve, sensitivity, specificity, and precision. RESULTS: Unweighted accuracy for the 5 algorithms ranged between 0.915 and 0.959. Area under the receiver operating characteristic curve for the 5 algorithms ranged between 0.712 and 0.773, with small differences among them. The area under the receiver operating characteristic curve for the image-only CNN-RNN (0.752) was significantly greater than that of the CNN with cross-sectional image data (0.712) (difference, −0.040; 95% CI, −0.049 to −0.033) and the CNN-RNN with images and instrument labels (0.737) (difference, 0.016; 95% CI, 0.014 to 0.018). While specificity was uniformly high for all phases with all 5 algorithms (range, 0.877 to 0.999), sensitivity ranged between 0.005 (95% CI, 0.000 to 0.015) for the support vector machine for wound closure (corneal hydration) and 0.974 (95% CI, 0.957 to 0.991) for the RNN for main incision. Precision ranged between 0.283 and 0.963. CONCLUSIONS AND RELEVANCE: Time series modeling of instrument labels and video images using deep learning techniques may yield potentially useful tools for the automated detection of phases in cataract surgery procedures.
format Online
Article
Text
id pubmed-6450320
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher American Medical Association
record_format MEDLINE/PubMed
spelling pubmed-64503202019-04-24 Assessment of Automated Identification of Phases in Videos of Cataract Surgery Using Machine Learning and Deep Learning Techniques Yu, Felix Silva Croso, Gianluca Kim, Tae Soo Song, Ziang Parker, Felix Hager, Gregory D. Reiter, Austin Vedula, S. Swaroop Ali, Haider Sikder, Shameema JAMA Netw Open Original Investigation IMPORTANCE: Competence in cataract surgery is a public health necessity, and videos of cataract surgery are routinely available to educators and trainees but currently are of limited use in training. Machine learning and deep learning techniques can yield tools that efficiently segment videos of cataract surgery into constituent phases for subsequent automated skill assessment and feedback. OBJECTIVE: To evaluate machine learning and deep learning algorithms for automated phase classification of manually presegmented phases in videos of cataract surgery. DESIGN, SETTING, AND PARTICIPANTS: This was a cross-sectional study using a data set of videos from a convenience sample of 100 cataract procedures performed by faculty and trainee surgeons in an ophthalmology residency program from July 2011 to December 2017. Demographic characteristics for surgeons and patients were not captured. Ten standard labels in the procedure and 14 instruments used during surgery were manually annotated, which served as the ground truth. EXPOSURES: Five algorithms with different input data: (1) a support vector machine input with cross-sectional instrument label data; (2) a recurrent neural network (RNN) input with a time series of instrument labels; (3) a convolutional neural network (CNN) input with cross-sectional image data; (4) a CNN-RNN input with a time series of images; and (5) a CNN-RNN input with time series of images and instrument labels. Each algorithm was evaluated with 5-fold cross-validation. MAIN OUTCOMES AND MEASURES: Accuracy, area under the receiver operating characteristic curve, sensitivity, specificity, and precision. RESULTS: Unweighted accuracy for the 5 algorithms ranged between 0.915 and 0.959. Area under the receiver operating characteristic curve for the 5 algorithms ranged between 0.712 and 0.773, with small differences among them. The area under the receiver operating characteristic curve for the image-only CNN-RNN (0.752) was significantly greater than that of the CNN with cross-sectional image data (0.712) (difference, −0.040; 95% CI, −0.049 to −0.033) and the CNN-RNN with images and instrument labels (0.737) (difference, 0.016; 95% CI, 0.014 to 0.018). While specificity was uniformly high for all phases with all 5 algorithms (range, 0.877 to 0.999), sensitivity ranged between 0.005 (95% CI, 0.000 to 0.015) for the support vector machine for wound closure (corneal hydration) and 0.974 (95% CI, 0.957 to 0.991) for the RNN for main incision. Precision ranged between 0.283 and 0.963. CONCLUSIONS AND RELEVANCE: Time series modeling of instrument labels and video images using deep learning techniques may yield potentially useful tools for the automated detection of phases in cataract surgery procedures. American Medical Association 2019-04-05 /pmc/articles/PMC6450320/ /pubmed/30951163 http://dx.doi.org/10.1001/jamanetworkopen.2019.1860 Text en Copyright 2019 Yu F et al. JAMA Network Open. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the CC-BY License.
spellingShingle Original Investigation
Yu, Felix
Silva Croso, Gianluca
Kim, Tae Soo
Song, Ziang
Parker, Felix
Hager, Gregory D.
Reiter, Austin
Vedula, S. Swaroop
Ali, Haider
Sikder, Shameema
Assessment of Automated Identification of Phases in Videos of Cataract Surgery Using Machine Learning and Deep Learning Techniques
title Assessment of Automated Identification of Phases in Videos of Cataract Surgery Using Machine Learning and Deep Learning Techniques
title_full Assessment of Automated Identification of Phases in Videos of Cataract Surgery Using Machine Learning and Deep Learning Techniques
title_fullStr Assessment of Automated Identification of Phases in Videos of Cataract Surgery Using Machine Learning and Deep Learning Techniques
title_full_unstemmed Assessment of Automated Identification of Phases in Videos of Cataract Surgery Using Machine Learning and Deep Learning Techniques
title_short Assessment of Automated Identification of Phases in Videos of Cataract Surgery Using Machine Learning and Deep Learning Techniques
title_sort assessment of automated identification of phases in videos of cataract surgery using machine learning and deep learning techniques
topic Original Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6450320/
https://www.ncbi.nlm.nih.gov/pubmed/30951163
http://dx.doi.org/10.1001/jamanetworkopen.2019.1860
work_keys_str_mv AT yufelix assessmentofautomatedidentificationofphasesinvideosofcataractsurgeryusingmachinelearninganddeeplearningtechniques
AT silvacrosogianluca assessmentofautomatedidentificationofphasesinvideosofcataractsurgeryusingmachinelearninganddeeplearningtechniques
AT kimtaesoo assessmentofautomatedidentificationofphasesinvideosofcataractsurgeryusingmachinelearninganddeeplearningtechniques
AT songziang assessmentofautomatedidentificationofphasesinvideosofcataractsurgeryusingmachinelearninganddeeplearningtechniques
AT parkerfelix assessmentofautomatedidentificationofphasesinvideosofcataractsurgeryusingmachinelearninganddeeplearningtechniques
AT hagergregoryd assessmentofautomatedidentificationofphasesinvideosofcataractsurgeryusingmachinelearninganddeeplearningtechniques
AT reiteraustin assessmentofautomatedidentificationofphasesinvideosofcataractsurgeryusingmachinelearninganddeeplearningtechniques
AT vedulasswaroop assessmentofautomatedidentificationofphasesinvideosofcataractsurgeryusingmachinelearninganddeeplearningtechniques
AT alihaider assessmentofautomatedidentificationofphasesinvideosofcataractsurgeryusingmachinelearninganddeeplearningtechniques
AT sikdershameema assessmentofautomatedidentificationofphasesinvideosofcataractsurgeryusingmachinelearninganddeeplearningtechniques