Cargando…

Training and deploying a deep learning model for endoscopic severity grading in ulcerative colitis using multicenter clinical trial data

INTRODUCTION: The Mayo Clinic Endoscopic Subscore is a commonly used grading system to assess the severity of ulcerative colitis. Correctly grading colonoscopies using the Mayo Clinic Endoscopic Subscore is a challenging task, with suboptimal rates of interrater and intrarater variability observed e...

Descripción completa

Detalles Bibliográficos
Autores principales: Gutierrez Becker, Benjamin, Arcadu, Filippo, Thalhammer, Andreas, Gamez Serna, Citlalli, Feehan, Owen, Drawnel, Faye, Oh, Young S., Prunotto, Marco
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7917417/
https://www.ncbi.nlm.nih.gov/pubmed/33718871
http://dx.doi.org/10.1177/2631774521990623
_version_ 1783657694717018112
author Gutierrez Becker, Benjamin
Arcadu, Filippo
Thalhammer, Andreas
Gamez Serna, Citlalli
Feehan, Owen
Drawnel, Faye
Oh, Young S.
Prunotto, Marco
author_facet Gutierrez Becker, Benjamin
Arcadu, Filippo
Thalhammer, Andreas
Gamez Serna, Citlalli
Feehan, Owen
Drawnel, Faye
Oh, Young S.
Prunotto, Marco
author_sort Gutierrez Becker, Benjamin
collection PubMed
description INTRODUCTION: The Mayo Clinic Endoscopic Subscore is a commonly used grading system to assess the severity of ulcerative colitis. Correctly grading colonoscopies using the Mayo Clinic Endoscopic Subscore is a challenging task, with suboptimal rates of interrater and intrarater variability observed even among experienced and sufficiently trained experts. In recent years, several machine learning algorithms have been proposed in an effort to improve the standardization and reproducibility of Mayo Clinic Endoscopic Subscore grading. METHODS: Here we propose an end-to-end fully automated system based on deep learning to predict a binary version of the Mayo Clinic Endoscopic Subscore directly from raw colonoscopy videos. Differently from previous studies, the proposed method mimics the assessment done in practice by a gastroenterologist, that is, traversing the whole colonoscopy video, identifying visually informative regions and computing an overall Mayo Clinic Endoscopic Subscore. The proposed deep learning–based system has been trained and deployed on raw colonoscopies using Mayo Clinic Endoscopic Subscore ground truth provided only at the colon section level, without manually selecting frames driving the severity scoring of ulcerative colitis. RESULTS AND CONCLUSION: Our evaluation on 1672 endoscopic videos obtained from a multisite data set obtained from the etrolizumab Phase II Eucalyptus and Phase III Hickory and Laurel clinical trials, show that our proposed methodology can grade endoscopic videos with a high degree of accuracy and robustness (Area Under the Receiver Operating Characteristic Curve = 0.84 for Mayo Clinic Endoscopic Subscore ⩾ 1, 0.85 for Mayo Clinic Endoscopic Subscore ⩾ 2 and 0.85 for Mayo Clinic Endoscopic Subscore ⩾ 3) and reduced amounts of manual annotation. PLAIN LANGUAGE SUMMARY: Patient, caregiver and provider thoughts on educational materials about prescribing and medication safety Artificial intelligence can be used to automatically assess full endoscopic videos and estimate the severity of ulcerative colitis. In this work, we present an artificial intelligence algorithm for the automatic grading of ulcerative colitis in full endoscopic videos. Our artificial intelligence models were trained and evaluated on a large and diverse set of colonoscopy videos obtained from concluded clinical trials. We demonstrate not only that artificial intelligence is able to accurately grade full endoscopic videos, but also that using diverse data sets obtained from multiple sites is critical to train robust AI models that could potentially be deployed on real-world data.
format Online
Article
Text
id pubmed-7917417
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-79174172021-03-11 Training and deploying a deep learning model for endoscopic severity grading in ulcerative colitis using multicenter clinical trial data Gutierrez Becker, Benjamin Arcadu, Filippo Thalhammer, Andreas Gamez Serna, Citlalli Feehan, Owen Drawnel, Faye Oh, Young S. Prunotto, Marco Ther Adv Gastrointest Endosc Artificial Intelligence in Gastrointestinal Endoscopy INTRODUCTION: The Mayo Clinic Endoscopic Subscore is a commonly used grading system to assess the severity of ulcerative colitis. Correctly grading colonoscopies using the Mayo Clinic Endoscopic Subscore is a challenging task, with suboptimal rates of interrater and intrarater variability observed even among experienced and sufficiently trained experts. In recent years, several machine learning algorithms have been proposed in an effort to improve the standardization and reproducibility of Mayo Clinic Endoscopic Subscore grading. METHODS: Here we propose an end-to-end fully automated system based on deep learning to predict a binary version of the Mayo Clinic Endoscopic Subscore directly from raw colonoscopy videos. Differently from previous studies, the proposed method mimics the assessment done in practice by a gastroenterologist, that is, traversing the whole colonoscopy video, identifying visually informative regions and computing an overall Mayo Clinic Endoscopic Subscore. The proposed deep learning–based system has been trained and deployed on raw colonoscopies using Mayo Clinic Endoscopic Subscore ground truth provided only at the colon section level, without manually selecting frames driving the severity scoring of ulcerative colitis. RESULTS AND CONCLUSION: Our evaluation on 1672 endoscopic videos obtained from a multisite data set obtained from the etrolizumab Phase II Eucalyptus and Phase III Hickory and Laurel clinical trials, show that our proposed methodology can grade endoscopic videos with a high degree of accuracy and robustness (Area Under the Receiver Operating Characteristic Curve = 0.84 for Mayo Clinic Endoscopic Subscore ⩾ 1, 0.85 for Mayo Clinic Endoscopic Subscore ⩾ 2 and 0.85 for Mayo Clinic Endoscopic Subscore ⩾ 3) and reduced amounts of manual annotation. PLAIN LANGUAGE SUMMARY: Patient, caregiver and provider thoughts on educational materials about prescribing and medication safety Artificial intelligence can be used to automatically assess full endoscopic videos and estimate the severity of ulcerative colitis. In this work, we present an artificial intelligence algorithm for the automatic grading of ulcerative colitis in full endoscopic videos. Our artificial intelligence models were trained and evaluated on a large and diverse set of colonoscopy videos obtained from concluded clinical trials. We demonstrate not only that artificial intelligence is able to accurately grade full endoscopic videos, but also that using diverse data sets obtained from multiple sites is critical to train robust AI models that could potentially be deployed on real-world data. SAGE Publications 2021-02-25 /pmc/articles/PMC7917417/ /pubmed/33718871 http://dx.doi.org/10.1177/2631774521990623 Text en © The Author(s), 2021 https://creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Artificial Intelligence in Gastrointestinal Endoscopy
Gutierrez Becker, Benjamin
Arcadu, Filippo
Thalhammer, Andreas
Gamez Serna, Citlalli
Feehan, Owen
Drawnel, Faye
Oh, Young S.
Prunotto, Marco
Training and deploying a deep learning model for endoscopic severity grading in ulcerative colitis using multicenter clinical trial data
title Training and deploying a deep learning model for endoscopic severity grading in ulcerative colitis using multicenter clinical trial data
title_full Training and deploying a deep learning model for endoscopic severity grading in ulcerative colitis using multicenter clinical trial data
title_fullStr Training and deploying a deep learning model for endoscopic severity grading in ulcerative colitis using multicenter clinical trial data
title_full_unstemmed Training and deploying a deep learning model for endoscopic severity grading in ulcerative colitis using multicenter clinical trial data
title_short Training and deploying a deep learning model for endoscopic severity grading in ulcerative colitis using multicenter clinical trial data
title_sort training and deploying a deep learning model for endoscopic severity grading in ulcerative colitis using multicenter clinical trial data
topic Artificial Intelligence in Gastrointestinal Endoscopy
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7917417/
https://www.ncbi.nlm.nih.gov/pubmed/33718871
http://dx.doi.org/10.1177/2631774521990623
work_keys_str_mv AT gutierrezbeckerbenjamin traininganddeployingadeeplearningmodelforendoscopicseveritygradinginulcerativecolitisusingmulticenterclinicaltrialdata
AT arcadufilippo traininganddeployingadeeplearningmodelforendoscopicseveritygradinginulcerativecolitisusingmulticenterclinicaltrialdata
AT thalhammerandreas traininganddeployingadeeplearningmodelforendoscopicseveritygradinginulcerativecolitisusingmulticenterclinicaltrialdata
AT gamezsernacitlalli traininganddeployingadeeplearningmodelforendoscopicseveritygradinginulcerativecolitisusingmulticenterclinicaltrialdata
AT feehanowen traininganddeployingadeeplearningmodelforendoscopicseveritygradinginulcerativecolitisusingmulticenterclinicaltrialdata
AT drawnelfaye traininganddeployingadeeplearningmodelforendoscopicseveritygradinginulcerativecolitisusingmulticenterclinicaltrialdata
AT ohyoungs traininganddeployingadeeplearningmodelforendoscopicseveritygradinginulcerativecolitisusingmulticenterclinicaltrialdata
AT prunottomarco traininganddeployingadeeplearningmodelforendoscopicseveritygradinginulcerativecolitisusingmulticenterclinicaltrialdata