Cargando…
Training a computer-aided polyp detection system to detect sessile serrated adenomas using public domain colonoscopy videos
Background Colorectal cancer (CRC) is a major public health burden worldwide, and colonoscopy is the most commonly used CRC screening tool. Still, there is variability in adenoma detection rate (ADR) among endoscopists. Recent studies have reported improved ADR using deep learning models trained on...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Georg Thieme Verlag KG
2020
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7541193/ https://www.ncbi.nlm.nih.gov/pubmed/33043112 http://dx.doi.org/10.1055/a-1229-3927 |
_version_ | 1783591356626632704 |
---|---|
author | Li, Taibo Glissen Brown, Jeremy R. Tsourides, Kelovoulos Mahmud, Nadim Cohen, Jonah M. Berzin, Tyler M. |
author_facet | Li, Taibo Glissen Brown, Jeremy R. Tsourides, Kelovoulos Mahmud, Nadim Cohen, Jonah M. Berzin, Tyler M. |
author_sort | Li, Taibo |
collection | PubMed |
description | Background Colorectal cancer (CRC) is a major public health burden worldwide, and colonoscopy is the most commonly used CRC screening tool. Still, there is variability in adenoma detection rate (ADR) among endoscopists. Recent studies have reported improved ADR using deep learning models trained on videos curated largely from private in-house datasets. Few have focused on the detection of sessile serrated adenomas (SSAs), which are the most challenging target clinically. Methods We identified 23 colonoscopy videos available in the public domain and for which pathology data were provided, totaling 390 minutes of footage. Expert endoscopists annotated segments of video with adenomatous polyps, from which we captured 509 polyp-positive and 6,875 polyp-free frames. Via data augmentation, we generated 15,270 adenomatous polyp-positive images, of which 2,310 were SSAs, and 20,625 polyp-negative images. We used the CNN AlexNet and fine-tuned its parameters using 90 % of the images, before testing its performance on the remaining 10 % of images unseen by the model. Results We trained the model on 32,305 images and tested performance on 3,590 images with the same proportion of SSA, non-SSA polyp-positive, and polyp-negative images. The overall accuracy of the model was 0.86, with a sensitivity of 0.73 and a specificity of 0.96. Positive predictive value was 0.93 and negative predictive value was 0.96. The area under the curve was 0.94. SSAs were detected in 93 % of SSA-positive images. Conclusions Using a relatively small set of publicly-available colonoscopy data, we obtained sizable training and validation sets of endoscopic images using data augmentation, and achieved an excellent performance in adenomatous polyp detection. |
format | Online Article Text |
id | pubmed-7541193 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Georg Thieme Verlag KG |
record_format | MEDLINE/PubMed |
spelling | pubmed-75411932020-10-09 Training a computer-aided polyp detection system to detect sessile serrated adenomas using public domain colonoscopy videos Li, Taibo Glissen Brown, Jeremy R. Tsourides, Kelovoulos Mahmud, Nadim Cohen, Jonah M. Berzin, Tyler M. Endosc Int Open Background Colorectal cancer (CRC) is a major public health burden worldwide, and colonoscopy is the most commonly used CRC screening tool. Still, there is variability in adenoma detection rate (ADR) among endoscopists. Recent studies have reported improved ADR using deep learning models trained on videos curated largely from private in-house datasets. Few have focused on the detection of sessile serrated adenomas (SSAs), which are the most challenging target clinically. Methods We identified 23 colonoscopy videos available in the public domain and for which pathology data were provided, totaling 390 minutes of footage. Expert endoscopists annotated segments of video with adenomatous polyps, from which we captured 509 polyp-positive and 6,875 polyp-free frames. Via data augmentation, we generated 15,270 adenomatous polyp-positive images, of which 2,310 were SSAs, and 20,625 polyp-negative images. We used the CNN AlexNet and fine-tuned its parameters using 90 % of the images, before testing its performance on the remaining 10 % of images unseen by the model. Results We trained the model on 32,305 images and tested performance on 3,590 images with the same proportion of SSA, non-SSA polyp-positive, and polyp-negative images. The overall accuracy of the model was 0.86, with a sensitivity of 0.73 and a specificity of 0.96. Positive predictive value was 0.93 and negative predictive value was 0.96. The area under the curve was 0.94. SSAs were detected in 93 % of SSA-positive images. Conclusions Using a relatively small set of publicly-available colonoscopy data, we obtained sizable training and validation sets of endoscopic images using data augmentation, and achieved an excellent performance in adenomatous polyp detection. Georg Thieme Verlag KG 2020-10 2020-10-07 /pmc/articles/PMC7541193/ /pubmed/33043112 http://dx.doi.org/10.1055/a-1229-3927 Text en The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commecial purposes, or adapted, remixed, transformed or built upon. (https://creativecommons.org/licenses/by-nc-nd/4.0/) https://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License, which permits unrestricted reproduction and distribution, for non-commercial purposes only; and use and reproduction, but not distribution, of adapted material for non-commercial purposes only, provided the original work is properly cited. |
spellingShingle | Li, Taibo Glissen Brown, Jeremy R. Tsourides, Kelovoulos Mahmud, Nadim Cohen, Jonah M. Berzin, Tyler M. Training a computer-aided polyp detection system to detect sessile serrated adenomas using public domain colonoscopy videos |
title | Training a computer-aided polyp detection system to detect sessile serrated adenomas using public domain colonoscopy videos |
title_full | Training a computer-aided polyp detection system to detect sessile serrated adenomas using public domain colonoscopy videos |
title_fullStr | Training a computer-aided polyp detection system to detect sessile serrated adenomas using public domain colonoscopy videos |
title_full_unstemmed | Training a computer-aided polyp detection system to detect sessile serrated adenomas using public domain colonoscopy videos |
title_short | Training a computer-aided polyp detection system to detect sessile serrated adenomas using public domain colonoscopy videos |
title_sort | training a computer-aided polyp detection system to detect sessile serrated adenomas using public domain colonoscopy videos |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7541193/ https://www.ncbi.nlm.nih.gov/pubmed/33043112 http://dx.doi.org/10.1055/a-1229-3927 |
work_keys_str_mv | AT litaibo trainingacomputeraidedpolypdetectionsystemtodetectsessileserratedadenomasusingpublicdomaincolonoscopyvideos AT glissenbrownjeremyr trainingacomputeraidedpolypdetectionsystemtodetectsessileserratedadenomasusingpublicdomaincolonoscopyvideos AT tsourideskelovoulos trainingacomputeraidedpolypdetectionsystemtodetectsessileserratedadenomasusingpublicdomaincolonoscopyvideos AT mahmudnadim trainingacomputeraidedpolypdetectionsystemtodetectsessileserratedadenomasusingpublicdomaincolonoscopyvideos AT cohenjonahm trainingacomputeraidedpolypdetectionsystemtodetectsessileserratedadenomasusingpublicdomaincolonoscopyvideos AT berzintylerm trainingacomputeraidedpolypdetectionsystemtodetectsessileserratedadenomasusingpublicdomaincolonoscopyvideos |