Cargando…
Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program
Deep learning algorithms have been used to detect diabetic retinopathy (DR) with specialist-level accuracy. This study aims to validate one such algorithm on a large-scale clinical population, and compare the algorithm performance with that of human graders. A total of 25,326 gradable retinal images...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6550283/ https://www.ncbi.nlm.nih.gov/pubmed/31304372 http://dx.doi.org/10.1038/s41746-019-0099-8 |
_version_ | 1783424167322845184 |
---|---|
author | Ruamviboonsuk, Paisan Krause, Jonathan Chotcomwongse, Peranut Sayres, Rory Raman, Rajiv Widner, Kasumi Campana, Bilson J. L. Phene, Sonia Hemarat, Kornwipa Tadarati, Mongkol Silpa-Archa, Sukhum Limwattanayingyong, Jirawut Rao, Chetan Kuruvilla, Oscar Jung, Jesse Tan, Jeffrey Orprayoon, Surapong Kangwanwongpaisan, Chawawat Sukumalpaiboon, Ramase Luengchaichawang, Chainarong Fuangkaew, Jitumporn Kongsap, Pipat Chualinpha, Lamyong Saree, Sarawuth Kawinpanitan, Srirut Mitvongsa, Korntip Lawanasakol, Siriporn Thepchatri, Chaiyasit Wongpichedchai, Lalita Corrado, Greg S. Peng, Lily Webster, Dale R. |
author_facet | Ruamviboonsuk, Paisan Krause, Jonathan Chotcomwongse, Peranut Sayres, Rory Raman, Rajiv Widner, Kasumi Campana, Bilson J. L. Phene, Sonia Hemarat, Kornwipa Tadarati, Mongkol Silpa-Archa, Sukhum Limwattanayingyong, Jirawut Rao, Chetan Kuruvilla, Oscar Jung, Jesse Tan, Jeffrey Orprayoon, Surapong Kangwanwongpaisan, Chawawat Sukumalpaiboon, Ramase Luengchaichawang, Chainarong Fuangkaew, Jitumporn Kongsap, Pipat Chualinpha, Lamyong Saree, Sarawuth Kawinpanitan, Srirut Mitvongsa, Korntip Lawanasakol, Siriporn Thepchatri, Chaiyasit Wongpichedchai, Lalita Corrado, Greg S. Peng, Lily Webster, Dale R. |
author_sort | Ruamviboonsuk, Paisan |
collection | PubMed |
description | Deep learning algorithms have been used to detect diabetic retinopathy (DR) with specialist-level accuracy. This study aims to validate one such algorithm on a large-scale clinical population, and compare the algorithm performance with that of human graders. A total of 25,326 gradable retinal images of patients with diabetes from the community-based, nationwide screening program of DR in Thailand were analyzed for DR severity and referable diabetic macular edema (DME). Grades adjudicated by a panel of international retinal specialists served as the reference standard. Relative to human graders, for detecting referable DR (moderate NPDR or worse), the deep learning algorithm had significantly higher sensitivity (0.97 vs. 0.74, p < 0.001), and a slightly lower specificity (0.96 vs. 0.98, p < 0.001). Higher sensitivity of the algorithm was also observed for each of the categories of severe or worse NPDR, PDR, and DME (p < 0.001 for all comparisons). The quadratic-weighted kappa for determination of DR severity levels by the algorithm and human graders was 0.85 and 0.78 respectively (p < 0.001 for the difference). Across different severity levels of DR for determining referable disease, deep learning significantly reduced the false negative rate (by 23%) at the cost of slightly higher false positive rates (2%). Deep learning algorithms may serve as a valuable tool for DR screening. |
format | Online Article Text |
id | pubmed-6550283 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-65502832019-07-12 Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program Ruamviboonsuk, Paisan Krause, Jonathan Chotcomwongse, Peranut Sayres, Rory Raman, Rajiv Widner, Kasumi Campana, Bilson J. L. Phene, Sonia Hemarat, Kornwipa Tadarati, Mongkol Silpa-Archa, Sukhum Limwattanayingyong, Jirawut Rao, Chetan Kuruvilla, Oscar Jung, Jesse Tan, Jeffrey Orprayoon, Surapong Kangwanwongpaisan, Chawawat Sukumalpaiboon, Ramase Luengchaichawang, Chainarong Fuangkaew, Jitumporn Kongsap, Pipat Chualinpha, Lamyong Saree, Sarawuth Kawinpanitan, Srirut Mitvongsa, Korntip Lawanasakol, Siriporn Thepchatri, Chaiyasit Wongpichedchai, Lalita Corrado, Greg S. Peng, Lily Webster, Dale R. NPJ Digit Med Article Deep learning algorithms have been used to detect diabetic retinopathy (DR) with specialist-level accuracy. This study aims to validate one such algorithm on a large-scale clinical population, and compare the algorithm performance with that of human graders. A total of 25,326 gradable retinal images of patients with diabetes from the community-based, nationwide screening program of DR in Thailand were analyzed for DR severity and referable diabetic macular edema (DME). Grades adjudicated by a panel of international retinal specialists served as the reference standard. Relative to human graders, for detecting referable DR (moderate NPDR or worse), the deep learning algorithm had significantly higher sensitivity (0.97 vs. 0.74, p < 0.001), and a slightly lower specificity (0.96 vs. 0.98, p < 0.001). Higher sensitivity of the algorithm was also observed for each of the categories of severe or worse NPDR, PDR, and DME (p < 0.001 for all comparisons). The quadratic-weighted kappa for determination of DR severity levels by the algorithm and human graders was 0.85 and 0.78 respectively (p < 0.001 for the difference). Across different severity levels of DR for determining referable disease, deep learning significantly reduced the false negative rate (by 23%) at the cost of slightly higher false positive rates (2%). Deep learning algorithms may serve as a valuable tool for DR screening. Nature Publishing Group UK 2019-04-10 /pmc/articles/PMC6550283/ /pubmed/31304372 http://dx.doi.org/10.1038/s41746-019-0099-8 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Ruamviboonsuk, Paisan Krause, Jonathan Chotcomwongse, Peranut Sayres, Rory Raman, Rajiv Widner, Kasumi Campana, Bilson J. L. Phene, Sonia Hemarat, Kornwipa Tadarati, Mongkol Silpa-Archa, Sukhum Limwattanayingyong, Jirawut Rao, Chetan Kuruvilla, Oscar Jung, Jesse Tan, Jeffrey Orprayoon, Surapong Kangwanwongpaisan, Chawawat Sukumalpaiboon, Ramase Luengchaichawang, Chainarong Fuangkaew, Jitumporn Kongsap, Pipat Chualinpha, Lamyong Saree, Sarawuth Kawinpanitan, Srirut Mitvongsa, Korntip Lawanasakol, Siriporn Thepchatri, Chaiyasit Wongpichedchai, Lalita Corrado, Greg S. Peng, Lily Webster, Dale R. Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program |
title | Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program |
title_full | Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program |
title_fullStr | Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program |
title_full_unstemmed | Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program |
title_short | Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program |
title_sort | deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6550283/ https://www.ncbi.nlm.nih.gov/pubmed/31304372 http://dx.doi.org/10.1038/s41746-019-0099-8 |
work_keys_str_mv | AT ruamviboonsukpaisan deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT krausejonathan deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT chotcomwongseperanut deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT sayresrory deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT ramanrajiv deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT widnerkasumi deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT campanabilsonjl deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT phenesonia deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT hemaratkornwipa deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT tadaratimongkol deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT silpaarchasukhum deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT limwattanayingyongjirawut deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT raochetan deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT kuruvillaoscar deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT jungjesse deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT tanjeffrey deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT orprayoonsurapong deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT kangwanwongpaisanchawawat deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT sukumalpaiboonramase deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT luengchaichawangchainarong deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT fuangkaewjitumporn deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT kongsappipat deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT chualinphalamyong deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT sareesarawuth deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT kawinpanitansrirut deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT mitvongsakorntip deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT lawanasakolsiriporn deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT thepchatrichaiyasit deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT wongpichedchailalita deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT corradogregs deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT penglily deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram AT websterdaler deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram |