Cargando…

Effects of Study Population, Labeling and Training on Glaucoma Detection Using Deep Learning Algorithms

PURPOSE: To compare performance of independently developed deep learning algorithms for detecting glaucoma from fundus photographs and to evaluate strategies for incorporating new data into models. METHODS: Two fundus photograph datasets from the Diagnostic Innovations in Glaucoma Study/African Desc...

Descripción completa

Detalles Bibliográficos
Autores principales: Christopher, Mark, Nakahara, Kenichi, Bowd, Christopher, Proudfoot, James A., Belghith, Akram, Goldbaum, Michael H., Rezapour, Jasmin, Weinreb, Robert N., Fazio, Massimo A., Girkin, Christopher A., Liebmann, Jeffrey M., De Moraes, Gustavo, Murata, Hiroshi, Tokumo, Kana, Shibata, Naoto, Fujino, Yuri, Matsuura, Masato, Kiuchi, Yoshiaki, Tanito, Masaki, Asaoka, Ryo, Zangwill, Linda M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Association for Research in Vision and Ophthalmology 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7396194/
https://www.ncbi.nlm.nih.gov/pubmed/32818088
http://dx.doi.org/10.1167/tvst.9.2.27
_version_ 1783565541233917952
author Christopher, Mark
Nakahara, Kenichi
Bowd, Christopher
Proudfoot, James A.
Belghith, Akram
Goldbaum, Michael H.
Rezapour, Jasmin
Weinreb, Robert N.
Fazio, Massimo A.
Girkin, Christopher A.
Liebmann, Jeffrey M.
De Moraes, Gustavo
Murata, Hiroshi
Tokumo, Kana
Shibata, Naoto
Fujino, Yuri
Matsuura, Masato
Kiuchi, Yoshiaki
Tanito, Masaki
Asaoka, Ryo
Zangwill, Linda M.
author_facet Christopher, Mark
Nakahara, Kenichi
Bowd, Christopher
Proudfoot, James A.
Belghith, Akram
Goldbaum, Michael H.
Rezapour, Jasmin
Weinreb, Robert N.
Fazio, Massimo A.
Girkin, Christopher A.
Liebmann, Jeffrey M.
De Moraes, Gustavo
Murata, Hiroshi
Tokumo, Kana
Shibata, Naoto
Fujino, Yuri
Matsuura, Masato
Kiuchi, Yoshiaki
Tanito, Masaki
Asaoka, Ryo
Zangwill, Linda M.
author_sort Christopher, Mark
collection PubMed
description PURPOSE: To compare performance of independently developed deep learning algorithms for detecting glaucoma from fundus photographs and to evaluate strategies for incorporating new data into models. METHODS: Two fundus photograph datasets from the Diagnostic Innovations in Glaucoma Study/African Descent and Glaucoma Evaluation Study and Matsue Red Cross Hospital were used to independently develop deep learning algorithms for detection of glaucoma at the University of California, San Diego, and the University of Tokyo. We compared three versions of the University of California, San Diego, and University of Tokyo models: original (no retraining), sequential (retraining only on new data), and combined (training on combined data). Independent datasets were used to test the algorithms. RESULTS: The original University of California, San Diego and University of Tokyo models performed similarly (area under the receiver operating characteristic curve = 0.96 and 0.97, respectively) for detection of glaucoma in the Matsue Red Cross Hospital dataset, but not the Diagnostic Innovations in Glaucoma Study/African Descent and Glaucoma Evaluation Study data (0.79 and 0.92; P < .001), respectively. Model performance was higher when classifying moderate-to-severe compared with mild disease (area under the receiver operating characteristic curve = 0.98 and 0.91; P < .001), respectively. Models trained with the combined strategy generally had better performance across all datasets than the original strategy. CONCLUSIONS: Deep learning glaucoma detection can achieve high accuracy across diverse datasets with appropriate training strategies. Because model performance was influenced by the severity of disease, labeling, training strategies, and population characteristics, reporting accuracy stratified by relevant covariates is important for cross study comparisons. TRANSLATIONAL RELEVANCE: High sensitivity and specificity of deep learning algorithms for moderate-to-severe glaucoma across diverse populations suggest a role for artificial intelligence in the detection of glaucoma in primary care.
format Online
Article
Text
id pubmed-7396194
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher The Association for Research in Vision and Ophthalmology
record_format MEDLINE/PubMed
spelling pubmed-73961942020-08-17 Effects of Study Population, Labeling and Training on Glaucoma Detection Using Deep Learning Algorithms Christopher, Mark Nakahara, Kenichi Bowd, Christopher Proudfoot, James A. Belghith, Akram Goldbaum, Michael H. Rezapour, Jasmin Weinreb, Robert N. Fazio, Massimo A. Girkin, Christopher A. Liebmann, Jeffrey M. De Moraes, Gustavo Murata, Hiroshi Tokumo, Kana Shibata, Naoto Fujino, Yuri Matsuura, Masato Kiuchi, Yoshiaki Tanito, Masaki Asaoka, Ryo Zangwill, Linda M. Transl Vis Sci Technol Special Issue PURPOSE: To compare performance of independently developed deep learning algorithms for detecting glaucoma from fundus photographs and to evaluate strategies for incorporating new data into models. METHODS: Two fundus photograph datasets from the Diagnostic Innovations in Glaucoma Study/African Descent and Glaucoma Evaluation Study and Matsue Red Cross Hospital were used to independently develop deep learning algorithms for detection of glaucoma at the University of California, San Diego, and the University of Tokyo. We compared three versions of the University of California, San Diego, and University of Tokyo models: original (no retraining), sequential (retraining only on new data), and combined (training on combined data). Independent datasets were used to test the algorithms. RESULTS: The original University of California, San Diego and University of Tokyo models performed similarly (area under the receiver operating characteristic curve = 0.96 and 0.97, respectively) for detection of glaucoma in the Matsue Red Cross Hospital dataset, but not the Diagnostic Innovations in Glaucoma Study/African Descent and Glaucoma Evaluation Study data (0.79 and 0.92; P < .001), respectively. Model performance was higher when classifying moderate-to-severe compared with mild disease (area under the receiver operating characteristic curve = 0.98 and 0.91; P < .001), respectively. Models trained with the combined strategy generally had better performance across all datasets than the original strategy. CONCLUSIONS: Deep learning glaucoma detection can achieve high accuracy across diverse datasets with appropriate training strategies. Because model performance was influenced by the severity of disease, labeling, training strategies, and population characteristics, reporting accuracy stratified by relevant covariates is important for cross study comparisons. TRANSLATIONAL RELEVANCE: High sensitivity and specificity of deep learning algorithms for moderate-to-severe glaucoma across diverse populations suggest a role for artificial intelligence in the detection of glaucoma in primary care. The Association for Research in Vision and Ophthalmology 2020-04-28 /pmc/articles/PMC7396194/ /pubmed/32818088 http://dx.doi.org/10.1167/tvst.9.2.27 Text en Copyright 2020 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
spellingShingle Special Issue
Christopher, Mark
Nakahara, Kenichi
Bowd, Christopher
Proudfoot, James A.
Belghith, Akram
Goldbaum, Michael H.
Rezapour, Jasmin
Weinreb, Robert N.
Fazio, Massimo A.
Girkin, Christopher A.
Liebmann, Jeffrey M.
De Moraes, Gustavo
Murata, Hiroshi
Tokumo, Kana
Shibata, Naoto
Fujino, Yuri
Matsuura, Masato
Kiuchi, Yoshiaki
Tanito, Masaki
Asaoka, Ryo
Zangwill, Linda M.
Effects of Study Population, Labeling and Training on Glaucoma Detection Using Deep Learning Algorithms
title Effects of Study Population, Labeling and Training on Glaucoma Detection Using Deep Learning Algorithms
title_full Effects of Study Population, Labeling and Training on Glaucoma Detection Using Deep Learning Algorithms
title_fullStr Effects of Study Population, Labeling and Training on Glaucoma Detection Using Deep Learning Algorithms
title_full_unstemmed Effects of Study Population, Labeling and Training on Glaucoma Detection Using Deep Learning Algorithms
title_short Effects of Study Population, Labeling and Training on Glaucoma Detection Using Deep Learning Algorithms
title_sort effects of study population, labeling and training on glaucoma detection using deep learning algorithms
topic Special Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7396194/
https://www.ncbi.nlm.nih.gov/pubmed/32818088
http://dx.doi.org/10.1167/tvst.9.2.27
work_keys_str_mv AT christophermark effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms
AT nakaharakenichi effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms
AT bowdchristopher effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms
AT proudfootjamesa effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms
AT belghithakram effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms
AT goldbaummichaelh effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms
AT rezapourjasmin effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms
AT weinrebrobertn effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms
AT faziomassimoa effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms
AT girkinchristophera effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms
AT liebmannjeffreym effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms
AT demoraesgustavo effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms
AT muratahiroshi effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms
AT tokumokana effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms
AT shibatanaoto effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms
AT fujinoyuri effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms
AT matsuuramasato effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms
AT kiuchiyoshiaki effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms
AT tanitomasaki effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms
AT asaokaryo effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms
AT zangwilllindam effectsofstudypopulationlabelingandtrainingonglaucomadetectionusingdeeplearningalgorithms