Cargando…
Technical and imaging factors influencing performance of deep learning systems for diabetic retinopathy
Deep learning (DL) has been shown to be effective in developing diabetic retinopathy (DR) algorithms, possibly tackling financial and manpower challenges hindering implementation of DR screening. However, our systematic review of the literature reveals few studies studied the impact of different fac...
Autores principales: | , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7090044/ https://www.ncbi.nlm.nih.gov/pubmed/32219181 http://dx.doi.org/10.1038/s41746-020-0247-1 |
_version_ | 1783509849095536640 |
---|---|
author | Yip, Michelle Y. T. Lim, Gilbert Lim, Zhan Wei Nguyen, Quang D. Chong, Crystal C. Y. Yu, Marco Bellemo, Valentina Xie, Yuchen Lee, Xin Qi Hamzah, Haslina Ho, Jinyi Tan, Tien-En Sabanayagam, Charumathi Grzybowski, Andrzej Tan, Gavin S. W. Hsu, Wynne Lee, Mong Li Wong, Tien Yin Ting, Daniel S. W. |
author_facet | Yip, Michelle Y. T. Lim, Gilbert Lim, Zhan Wei Nguyen, Quang D. Chong, Crystal C. Y. Yu, Marco Bellemo, Valentina Xie, Yuchen Lee, Xin Qi Hamzah, Haslina Ho, Jinyi Tan, Tien-En Sabanayagam, Charumathi Grzybowski, Andrzej Tan, Gavin S. W. Hsu, Wynne Lee, Mong Li Wong, Tien Yin Ting, Daniel S. W. |
author_sort | Yip, Michelle Y. T. |
collection | PubMed |
description | Deep learning (DL) has been shown to be effective in developing diabetic retinopathy (DR) algorithms, possibly tackling financial and manpower challenges hindering implementation of DR screening. However, our systematic review of the literature reveals few studies studied the impact of different factors on these DL algorithms, that are important for clinical deployment in real-world settings. Using 455,491 retinal images, we evaluated two technical and three image-related factors in detection of referable DR. For technical factors, the performances of four DL models (VGGNet, ResNet, DenseNet, Ensemble) and two computational frameworks (Caffe, TensorFlow) were evaluated while for image-related factors, we evaluated image compression levels (reducing image size, 350, 300, 250, 200, 150 KB), number of fields (7-field, 2-field, 1-field) and media clarity (pseudophakic vs phakic). In detection of referable DR, four DL models showed comparable diagnostic performance (AUC 0.936-0.944). To develop the VGGNet model, two computational frameworks had similar AUC (0.936). The DL performance dropped when image size decreased below 250 KB (AUC 0.936, 0.900, p < 0.001). The DL performance performed better when there were increased number of fields (dataset 1: 2-field vs 1-field—AUC 0.936 vs 0.908, p < 0.001; dataset 2: 7-field vs 2-field vs 1-field, AUC 0.949 vs 0.911 vs 0.895). DL performed better in the pseudophakic than phakic eyes (AUC 0.918 vs 0.833, p < 0.001). Various image-related factors play more significant roles than technical factors in determining the diagnostic performance, suggesting the importance of having robust training and testing datasets for DL training and deployment in the real-world settings. |
format | Online Article Text |
id | pubmed-7090044 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-70900442020-03-26 Technical and imaging factors influencing performance of deep learning systems for diabetic retinopathy Yip, Michelle Y. T. Lim, Gilbert Lim, Zhan Wei Nguyen, Quang D. Chong, Crystal C. Y. Yu, Marco Bellemo, Valentina Xie, Yuchen Lee, Xin Qi Hamzah, Haslina Ho, Jinyi Tan, Tien-En Sabanayagam, Charumathi Grzybowski, Andrzej Tan, Gavin S. W. Hsu, Wynne Lee, Mong Li Wong, Tien Yin Ting, Daniel S. W. NPJ Digit Med Article Deep learning (DL) has been shown to be effective in developing diabetic retinopathy (DR) algorithms, possibly tackling financial and manpower challenges hindering implementation of DR screening. However, our systematic review of the literature reveals few studies studied the impact of different factors on these DL algorithms, that are important for clinical deployment in real-world settings. Using 455,491 retinal images, we evaluated two technical and three image-related factors in detection of referable DR. For technical factors, the performances of four DL models (VGGNet, ResNet, DenseNet, Ensemble) and two computational frameworks (Caffe, TensorFlow) were evaluated while for image-related factors, we evaluated image compression levels (reducing image size, 350, 300, 250, 200, 150 KB), number of fields (7-field, 2-field, 1-field) and media clarity (pseudophakic vs phakic). In detection of referable DR, four DL models showed comparable diagnostic performance (AUC 0.936-0.944). To develop the VGGNet model, two computational frameworks had similar AUC (0.936). The DL performance dropped when image size decreased below 250 KB (AUC 0.936, 0.900, p < 0.001). The DL performance performed better when there were increased number of fields (dataset 1: 2-field vs 1-field—AUC 0.936 vs 0.908, p < 0.001; dataset 2: 7-field vs 2-field vs 1-field, AUC 0.949 vs 0.911 vs 0.895). DL performed better in the pseudophakic than phakic eyes (AUC 0.918 vs 0.833, p < 0.001). Various image-related factors play more significant roles than technical factors in determining the diagnostic performance, suggesting the importance of having robust training and testing datasets for DL training and deployment in the real-world settings. Nature Publishing Group UK 2020-03-23 /pmc/articles/PMC7090044/ /pubmed/32219181 http://dx.doi.org/10.1038/s41746-020-0247-1 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Yip, Michelle Y. T. Lim, Gilbert Lim, Zhan Wei Nguyen, Quang D. Chong, Crystal C. Y. Yu, Marco Bellemo, Valentina Xie, Yuchen Lee, Xin Qi Hamzah, Haslina Ho, Jinyi Tan, Tien-En Sabanayagam, Charumathi Grzybowski, Andrzej Tan, Gavin S. W. Hsu, Wynne Lee, Mong Li Wong, Tien Yin Ting, Daniel S. W. Technical and imaging factors influencing performance of deep learning systems for diabetic retinopathy |
title | Technical and imaging factors influencing performance of deep learning systems for diabetic retinopathy |
title_full | Technical and imaging factors influencing performance of deep learning systems for diabetic retinopathy |
title_fullStr | Technical and imaging factors influencing performance of deep learning systems for diabetic retinopathy |
title_full_unstemmed | Technical and imaging factors influencing performance of deep learning systems for diabetic retinopathy |
title_short | Technical and imaging factors influencing performance of deep learning systems for diabetic retinopathy |
title_sort | technical and imaging factors influencing performance of deep learning systems for diabetic retinopathy |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7090044/ https://www.ncbi.nlm.nih.gov/pubmed/32219181 http://dx.doi.org/10.1038/s41746-020-0247-1 |
work_keys_str_mv | AT yipmichelleyt technicalandimagingfactorsinfluencingperformanceofdeeplearningsystemsfordiabeticretinopathy AT limgilbert technicalandimagingfactorsinfluencingperformanceofdeeplearningsystemsfordiabeticretinopathy AT limzhanwei technicalandimagingfactorsinfluencingperformanceofdeeplearningsystemsfordiabeticretinopathy AT nguyenquangd technicalandimagingfactorsinfluencingperformanceofdeeplearningsystemsfordiabeticretinopathy AT chongcrystalcy technicalandimagingfactorsinfluencingperformanceofdeeplearningsystemsfordiabeticretinopathy AT yumarco technicalandimagingfactorsinfluencingperformanceofdeeplearningsystemsfordiabeticretinopathy AT bellemovalentina technicalandimagingfactorsinfluencingperformanceofdeeplearningsystemsfordiabeticretinopathy AT xieyuchen technicalandimagingfactorsinfluencingperformanceofdeeplearningsystemsfordiabeticretinopathy AT leexinqi technicalandimagingfactorsinfluencingperformanceofdeeplearningsystemsfordiabeticretinopathy AT hamzahhaslina technicalandimagingfactorsinfluencingperformanceofdeeplearningsystemsfordiabeticretinopathy AT hojinyi technicalandimagingfactorsinfluencingperformanceofdeeplearningsystemsfordiabeticretinopathy AT tantienen technicalandimagingfactorsinfluencingperformanceofdeeplearningsystemsfordiabeticretinopathy AT sabanayagamcharumathi technicalandimagingfactorsinfluencingperformanceofdeeplearningsystemsfordiabeticretinopathy AT grzybowskiandrzej technicalandimagingfactorsinfluencingperformanceofdeeplearningsystemsfordiabeticretinopathy AT tangavinsw technicalandimagingfactorsinfluencingperformanceofdeeplearningsystemsfordiabeticretinopathy AT hsuwynne technicalandimagingfactorsinfluencingperformanceofdeeplearningsystemsfordiabeticretinopathy AT leemongli technicalandimagingfactorsinfluencingperformanceofdeeplearningsystemsfordiabeticretinopathy AT wongtienyin technicalandimagingfactorsinfluencingperformanceofdeeplearningsystemsfordiabeticretinopathy AT tingdanielsw technicalandimagingfactorsinfluencingperformanceofdeeplearningsystemsfordiabeticretinopathy |