Cargando…

Amalgamation of cloud-based colonoscopy videos with patient-level metadata to facilitate large-scale machine learning

Background and study aims  Storage of full-length endoscopic procedures is becoming increasingly popular. To facilitate large-scale machine learning (ML) focused on clinical outcomes, these videos must be merged with the patient-level data in the electronic health record (EHR). Our aim was to presen...

Descripción completa

Detalles Bibliográficos
Autores principales: Keswani, Rajesh N., Byrd, Daniel, Garcia Vicente, Florencia, Heller, J. Alex, Klug, Matthew, Mazumder, Nikhilesh R., Wood, Jordan, Yang, Anthony D., Etemadi, Mozziyar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Georg Thieme Verlag KG 2021
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7857968/
https://www.ncbi.nlm.nih.gov/pubmed/33553586
http://dx.doi.org/10.1055/a-1326-1289
_version_ 1783646552891326464
author Keswani, Rajesh N.
Byrd, Daniel
Garcia Vicente, Florencia
Heller, J. Alex
Klug, Matthew
Mazumder, Nikhilesh R.
Wood, Jordan
Yang, Anthony D.
Etemadi, Mozziyar
author_facet Keswani, Rajesh N.
Byrd, Daniel
Garcia Vicente, Florencia
Heller, J. Alex
Klug, Matthew
Mazumder, Nikhilesh R.
Wood, Jordan
Yang, Anthony D.
Etemadi, Mozziyar
author_sort Keswani, Rajesh N.
collection PubMed
description Background and study aims  Storage of full-length endoscopic procedures is becoming increasingly popular. To facilitate large-scale machine learning (ML) focused on clinical outcomes, these videos must be merged with the patient-level data in the electronic health record (EHR). Our aim was to present a method of accurately linking patient-level EHR data with cloud stored colonoscopy videos. Methods  This study was conducted at a single academic medical center. Most procedure videos are automatically uploaded to the cloud server but are identified only by procedure time and procedure room. We developed and then tested an algorithm to match recorded videos with corresponding exams in the EHR based upon procedure time and room and subsequently extract frames of interest. Results  Among 28,611 total colonoscopies performed over the study period, 21,170 colonoscopy videos in 20,420 unique patients (54.2 % male, median age 58) were matched to EHR data. Of 100 randomly sampled videos, appropriate matching was manually confirmed in all. In total, these videos represented 489,721 minutes of colonoscopy performed by 50 endoscopists (median 214 colonoscopies per endoscopist). The most common procedure indications were polyp screening (47.3 %), surveillance (28.9 %) and inflammatory bowel disease (9.4 %). From these videos, we extracted procedure highlights (identified by image capture; mean 8.5 per colonoscopy) and surrounding frames. Conclusions  We report the successful merging of a large database of endoscopy videos stored with limited identifiers to rich patient-level data in a highly accurate manner. This technique facilitates the development of ML algorithms based upon relevant patient outcomes.
format Online
Article
Text
id pubmed-7857968
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Georg Thieme Verlag KG
record_format MEDLINE/PubMed
spelling pubmed-78579682021-02-05 Amalgamation of cloud-based colonoscopy videos with patient-level metadata to facilitate large-scale machine learning Keswani, Rajesh N. Byrd, Daniel Garcia Vicente, Florencia Heller, J. Alex Klug, Matthew Mazumder, Nikhilesh R. Wood, Jordan Yang, Anthony D. Etemadi, Mozziyar Endosc Int Open Background and study aims  Storage of full-length endoscopic procedures is becoming increasingly popular. To facilitate large-scale machine learning (ML) focused on clinical outcomes, these videos must be merged with the patient-level data in the electronic health record (EHR). Our aim was to present a method of accurately linking patient-level EHR data with cloud stored colonoscopy videos. Methods  This study was conducted at a single academic medical center. Most procedure videos are automatically uploaded to the cloud server but are identified only by procedure time and procedure room. We developed and then tested an algorithm to match recorded videos with corresponding exams in the EHR based upon procedure time and room and subsequently extract frames of interest. Results  Among 28,611 total colonoscopies performed over the study period, 21,170 colonoscopy videos in 20,420 unique patients (54.2 % male, median age 58) were matched to EHR data. Of 100 randomly sampled videos, appropriate matching was manually confirmed in all. In total, these videos represented 489,721 minutes of colonoscopy performed by 50 endoscopists (median 214 colonoscopies per endoscopist). The most common procedure indications were polyp screening (47.3 %), surveillance (28.9 %) and inflammatory bowel disease (9.4 %). From these videos, we extracted procedure highlights (identified by image capture; mean 8.5 per colonoscopy) and surrounding frames. Conclusions  We report the successful merging of a large database of endoscopy videos stored with limited identifiers to rich patient-level data in a highly accurate manner. This technique facilitates the development of ML algorithms based upon relevant patient outcomes. Georg Thieme Verlag KG 2021-02 2021-02-03 /pmc/articles/PMC7857968/ /pubmed/33553586 http://dx.doi.org/10.1055/a-1326-1289 Text en The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commecial purposes, or adapted, remixed, transformed or built upon. (https://creativecommons.org/licenses/by-nc-nd/4.0/) https://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License, which permits unrestricted reproduction and distribution, for non-commercial purposes only; and use and reproduction, but not distribution, of adapted material for non-commercial purposes only, provided the original work is properly cited.
spellingShingle Keswani, Rajesh N.
Byrd, Daniel
Garcia Vicente, Florencia
Heller, J. Alex
Klug, Matthew
Mazumder, Nikhilesh R.
Wood, Jordan
Yang, Anthony D.
Etemadi, Mozziyar
Amalgamation of cloud-based colonoscopy videos with patient-level metadata to facilitate large-scale machine learning
title Amalgamation of cloud-based colonoscopy videos with patient-level metadata to facilitate large-scale machine learning
title_full Amalgamation of cloud-based colonoscopy videos with patient-level metadata to facilitate large-scale machine learning
title_fullStr Amalgamation of cloud-based colonoscopy videos with patient-level metadata to facilitate large-scale machine learning
title_full_unstemmed Amalgamation of cloud-based colonoscopy videos with patient-level metadata to facilitate large-scale machine learning
title_short Amalgamation of cloud-based colonoscopy videos with patient-level metadata to facilitate large-scale machine learning
title_sort amalgamation of cloud-based colonoscopy videos with patient-level metadata to facilitate large-scale machine learning
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7857968/
https://www.ncbi.nlm.nih.gov/pubmed/33553586
http://dx.doi.org/10.1055/a-1326-1289
work_keys_str_mv AT keswanirajeshn amalgamationofcloudbasedcolonoscopyvideoswithpatientlevelmetadatatofacilitatelargescalemachinelearning
AT byrddaniel amalgamationofcloudbasedcolonoscopyvideoswithpatientlevelmetadatatofacilitatelargescalemachinelearning
AT garciavicenteflorencia amalgamationofcloudbasedcolonoscopyvideoswithpatientlevelmetadatatofacilitatelargescalemachinelearning
AT hellerjalex amalgamationofcloudbasedcolonoscopyvideoswithpatientlevelmetadatatofacilitatelargescalemachinelearning
AT klugmatthew amalgamationofcloudbasedcolonoscopyvideoswithpatientlevelmetadatatofacilitatelargescalemachinelearning
AT mazumdernikhileshr amalgamationofcloudbasedcolonoscopyvideoswithpatientlevelmetadatatofacilitatelargescalemachinelearning
AT woodjordan amalgamationofcloudbasedcolonoscopyvideoswithpatientlevelmetadatatofacilitatelargescalemachinelearning
AT yanganthonyd amalgamationofcloudbasedcolonoscopyvideoswithpatientlevelmetadatatofacilitatelargescalemachinelearning
AT etemadimozziyar amalgamationofcloudbasedcolonoscopyvideoswithpatientlevelmetadatatofacilitatelargescalemachinelearning