Cargando…

Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program

Deep learning algorithms have been used to detect diabetic retinopathy (DR) with specialist-level accuracy. This study aims to validate one such algorithm on a large-scale clinical population, and compare the algorithm performance with that of human graders. A total of 25,326 gradable retinal images...

Descripción completa

Detalles Bibliográficos
Autores principales: Ruamviboonsuk, Paisan, Krause, Jonathan, Chotcomwongse, Peranut, Sayres, Rory, Raman, Rajiv, Widner, Kasumi, Campana, Bilson J. L., Phene, Sonia, Hemarat, Kornwipa, Tadarati, Mongkol, Silpa-Archa, Sukhum, Limwattanayingyong, Jirawut, Rao, Chetan, Kuruvilla, Oscar, Jung, Jesse, Tan, Jeffrey, Orprayoon, Surapong, Kangwanwongpaisan, Chawawat, Sukumalpaiboon, Ramase, Luengchaichawang, Chainarong, Fuangkaew, Jitumporn, Kongsap, Pipat, Chualinpha, Lamyong, Saree, Sarawuth, Kawinpanitan, Srirut, Mitvongsa, Korntip, Lawanasakol, Siriporn, Thepchatri, Chaiyasit, Wongpichedchai, Lalita, Corrado, Greg S., Peng, Lily, Webster, Dale R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6550283/
https://www.ncbi.nlm.nih.gov/pubmed/31304372
http://dx.doi.org/10.1038/s41746-019-0099-8
_version_ 1783424167322845184
author Ruamviboonsuk, Paisan
Krause, Jonathan
Chotcomwongse, Peranut
Sayres, Rory
Raman, Rajiv
Widner, Kasumi
Campana, Bilson J. L.
Phene, Sonia
Hemarat, Kornwipa
Tadarati, Mongkol
Silpa-Archa, Sukhum
Limwattanayingyong, Jirawut
Rao, Chetan
Kuruvilla, Oscar
Jung, Jesse
Tan, Jeffrey
Orprayoon, Surapong
Kangwanwongpaisan, Chawawat
Sukumalpaiboon, Ramase
Luengchaichawang, Chainarong
Fuangkaew, Jitumporn
Kongsap, Pipat
Chualinpha, Lamyong
Saree, Sarawuth
Kawinpanitan, Srirut
Mitvongsa, Korntip
Lawanasakol, Siriporn
Thepchatri, Chaiyasit
Wongpichedchai, Lalita
Corrado, Greg S.
Peng, Lily
Webster, Dale R.
author_facet Ruamviboonsuk, Paisan
Krause, Jonathan
Chotcomwongse, Peranut
Sayres, Rory
Raman, Rajiv
Widner, Kasumi
Campana, Bilson J. L.
Phene, Sonia
Hemarat, Kornwipa
Tadarati, Mongkol
Silpa-Archa, Sukhum
Limwattanayingyong, Jirawut
Rao, Chetan
Kuruvilla, Oscar
Jung, Jesse
Tan, Jeffrey
Orprayoon, Surapong
Kangwanwongpaisan, Chawawat
Sukumalpaiboon, Ramase
Luengchaichawang, Chainarong
Fuangkaew, Jitumporn
Kongsap, Pipat
Chualinpha, Lamyong
Saree, Sarawuth
Kawinpanitan, Srirut
Mitvongsa, Korntip
Lawanasakol, Siriporn
Thepchatri, Chaiyasit
Wongpichedchai, Lalita
Corrado, Greg S.
Peng, Lily
Webster, Dale R.
author_sort Ruamviboonsuk, Paisan
collection PubMed
description Deep learning algorithms have been used to detect diabetic retinopathy (DR) with specialist-level accuracy. This study aims to validate one such algorithm on a large-scale clinical population, and compare the algorithm performance with that of human graders. A total of 25,326 gradable retinal images of patients with diabetes from the community-based, nationwide screening program of DR in Thailand were analyzed for DR severity and referable diabetic macular edema (DME). Grades adjudicated by a panel of international retinal specialists served as the reference standard. Relative to human graders, for detecting referable DR (moderate NPDR or worse), the deep learning algorithm had significantly higher sensitivity (0.97 vs. 0.74, p < 0.001), and a slightly lower specificity (0.96 vs. 0.98, p < 0.001). Higher sensitivity of the algorithm was also observed for each of the categories of severe or worse NPDR, PDR, and DME (p < 0.001 for all comparisons). The quadratic-weighted kappa for determination of DR severity levels by the algorithm and human graders was 0.85 and 0.78 respectively (p < 0.001 for the difference). Across different severity levels of DR for determining referable disease, deep learning significantly reduced the false negative rate (by 23%) at the cost of slightly higher false positive rates (2%). Deep learning algorithms may serve as a valuable tool for DR screening.
format Online
Article
Text
id pubmed-6550283
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-65502832019-07-12 Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program Ruamviboonsuk, Paisan Krause, Jonathan Chotcomwongse, Peranut Sayres, Rory Raman, Rajiv Widner, Kasumi Campana, Bilson J. L. Phene, Sonia Hemarat, Kornwipa Tadarati, Mongkol Silpa-Archa, Sukhum Limwattanayingyong, Jirawut Rao, Chetan Kuruvilla, Oscar Jung, Jesse Tan, Jeffrey Orprayoon, Surapong Kangwanwongpaisan, Chawawat Sukumalpaiboon, Ramase Luengchaichawang, Chainarong Fuangkaew, Jitumporn Kongsap, Pipat Chualinpha, Lamyong Saree, Sarawuth Kawinpanitan, Srirut Mitvongsa, Korntip Lawanasakol, Siriporn Thepchatri, Chaiyasit Wongpichedchai, Lalita Corrado, Greg S. Peng, Lily Webster, Dale R. NPJ Digit Med Article Deep learning algorithms have been used to detect diabetic retinopathy (DR) with specialist-level accuracy. This study aims to validate one such algorithm on a large-scale clinical population, and compare the algorithm performance with that of human graders. A total of 25,326 gradable retinal images of patients with diabetes from the community-based, nationwide screening program of DR in Thailand were analyzed for DR severity and referable diabetic macular edema (DME). Grades adjudicated by a panel of international retinal specialists served as the reference standard. Relative to human graders, for detecting referable DR (moderate NPDR or worse), the deep learning algorithm had significantly higher sensitivity (0.97 vs. 0.74, p < 0.001), and a slightly lower specificity (0.96 vs. 0.98, p < 0.001). Higher sensitivity of the algorithm was also observed for each of the categories of severe or worse NPDR, PDR, and DME (p < 0.001 for all comparisons). The quadratic-weighted kappa for determination of DR severity levels by the algorithm and human graders was 0.85 and 0.78 respectively (p < 0.001 for the difference). Across different severity levels of DR for determining referable disease, deep learning significantly reduced the false negative rate (by 23%) at the cost of slightly higher false positive rates (2%). Deep learning algorithms may serve as a valuable tool for DR screening. Nature Publishing Group UK 2019-04-10 /pmc/articles/PMC6550283/ /pubmed/31304372 http://dx.doi.org/10.1038/s41746-019-0099-8 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Ruamviboonsuk, Paisan
Krause, Jonathan
Chotcomwongse, Peranut
Sayres, Rory
Raman, Rajiv
Widner, Kasumi
Campana, Bilson J. L.
Phene, Sonia
Hemarat, Kornwipa
Tadarati, Mongkol
Silpa-Archa, Sukhum
Limwattanayingyong, Jirawut
Rao, Chetan
Kuruvilla, Oscar
Jung, Jesse
Tan, Jeffrey
Orprayoon, Surapong
Kangwanwongpaisan, Chawawat
Sukumalpaiboon, Ramase
Luengchaichawang, Chainarong
Fuangkaew, Jitumporn
Kongsap, Pipat
Chualinpha, Lamyong
Saree, Sarawuth
Kawinpanitan, Srirut
Mitvongsa, Korntip
Lawanasakol, Siriporn
Thepchatri, Chaiyasit
Wongpichedchai, Lalita
Corrado, Greg S.
Peng, Lily
Webster, Dale R.
Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program
title Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program
title_full Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program
title_fullStr Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program
title_full_unstemmed Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program
title_short Deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program
title_sort deep learning versus human graders for classifying diabetic retinopathy severity in a nationwide screening program
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6550283/
https://www.ncbi.nlm.nih.gov/pubmed/31304372
http://dx.doi.org/10.1038/s41746-019-0099-8
work_keys_str_mv AT ruamviboonsukpaisan deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT krausejonathan deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT chotcomwongseperanut deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT sayresrory deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT ramanrajiv deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT widnerkasumi deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT campanabilsonjl deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT phenesonia deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT hemaratkornwipa deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT tadaratimongkol deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT silpaarchasukhum deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT limwattanayingyongjirawut deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT raochetan deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT kuruvillaoscar deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT jungjesse deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT tanjeffrey deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT orprayoonsurapong deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT kangwanwongpaisanchawawat deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT sukumalpaiboonramase deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT luengchaichawangchainarong deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT fuangkaewjitumporn deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT kongsappipat deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT chualinphalamyong deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT sareesarawuth deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT kawinpanitansrirut deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT mitvongsakorntip deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT lawanasakolsiriporn deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT thepchatrichaiyasit deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT wongpichedchailalita deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT corradogregs deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT penglily deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram
AT websterdaler deeplearningversushumangradersforclassifyingdiabeticretinopathyseverityinanationwidescreeningprogram