Cargando…

Validating the Generalizability of Ophthalmic Artificial Intelligence Models on Real-World Clinical Data

PURPOSE: This study aims to investigate generalizability of deep learning (DL) models trained on commonly used public fundus images to an instance of real-world data (RWD) for glaucoma diagnosis. METHODS: We used Illinois Eye and Ear Infirmary fundus data set as an instance of RWD in addition to six...

Descripción completa

Detalles Bibliográficos
Autores principales:	Rashidisabet, Homa, Sethi, Abhishek, Jindarak, Ponpawee, Edmonds, James, Chan, R. V. Paul, Leiderman, Yannek I., Vajaranant, Thasarat Sutabutr, Yi, Darvin
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	The Association for Research in Vision and Ophthalmology 2023
Materias:	Artificial Intelligence
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10629532/ https://www.ncbi.nlm.nih.gov/pubmed/37922149 http://dx.doi.org/10.1167/tvst.12.11.8

_version_	1785131980862521344
author	Rashidisabet, Homa Sethi, Abhishek Jindarak, Ponpawee Edmonds, James Chan, R. V. Paul Leiderman, Yannek I. Vajaranant, Thasarat Sutabutr Yi, Darvin
author_facet	Rashidisabet, Homa Sethi, Abhishek Jindarak, Ponpawee Edmonds, James Chan, R. V. Paul Leiderman, Yannek I. Vajaranant, Thasarat Sutabutr Yi, Darvin
author_sort	Rashidisabet, Homa
collection	PubMed
description	PURPOSE: This study aims to investigate generalizability of deep learning (DL) models trained on commonly used public fundus images to an instance of real-world data (RWD) for glaucoma diagnosis. METHODS: We used Illinois Eye and Ear Infirmary fundus data set as an instance of RWD in addition to six publicly available fundus data sets. We compared the performance of DL-trained models on public data and RWD for glaucoma classification and optic disc (OD) segmentation tasks. For each task, we created models trained on each data set, respectively, and each model was tested on both data sets. We further examined each model's decision-making process and learned embeddings for the glaucoma classification task. RESULTS: Using public data for the test set, public-trained models outperformed RWD-trained models in OD segmentation and glaucoma classification with a mean intersection over union of 96.3% and mean area under the receiver operating characteristic curve of 95.0%, respectively. Using the RWD test set, the performance of public models decreased by 8.0% and 18.4% to 85.6% and 76.6% for OD segmentation and glaucoma classification tasks, respectively. RWD models outperformed public models on RWD test sets by 2.0% and 9.5%, respectively, in OD segmentation and glaucoma classification tasks. CONCLUSIONS: DL models trained on commonly used public data have limited ability to generalize to RWD for classifying glaucoma. They perform similarly to RWD models for OD segmentation. TRANSLATIONAL RELEVANCE: RWD is a potential solution for improving generalizability of DL models and enabling clinical translations in the care of prevalent blinding ophthalmic conditions, such as glaucoma.
format	Online Article Text
id	pubmed-10629532
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	The Association for Research in Vision and Ophthalmology
record_format	MEDLINE/PubMed
spelling	pubmed-106295322023-11-08 Validating the Generalizability of Ophthalmic Artificial Intelligence Models on Real-World Clinical Data Rashidisabet, Homa Sethi, Abhishek Jindarak, Ponpawee Edmonds, James Chan, R. V. Paul Leiderman, Yannek I. Vajaranant, Thasarat Sutabutr Yi, Darvin Transl Vis Sci Technol Artificial Intelligence PURPOSE: This study aims to investigate generalizability of deep learning (DL) models trained on commonly used public fundus images to an instance of real-world data (RWD) for glaucoma diagnosis. METHODS: We used Illinois Eye and Ear Infirmary fundus data set as an instance of RWD in addition to six publicly available fundus data sets. We compared the performance of DL-trained models on public data and RWD for glaucoma classification and optic disc (OD) segmentation tasks. For each task, we created models trained on each data set, respectively, and each model was tested on both data sets. We further examined each model's decision-making process and learned embeddings for the glaucoma classification task. RESULTS: Using public data for the test set, public-trained models outperformed RWD-trained models in OD segmentation and glaucoma classification with a mean intersection over union of 96.3% and mean area under the receiver operating characteristic curve of 95.0%, respectively. Using the RWD test set, the performance of public models decreased by 8.0% and 18.4% to 85.6% and 76.6% for OD segmentation and glaucoma classification tasks, respectively. RWD models outperformed public models on RWD test sets by 2.0% and 9.5%, respectively, in OD segmentation and glaucoma classification tasks. CONCLUSIONS: DL models trained on commonly used public data have limited ability to generalize to RWD for classifying glaucoma. They perform similarly to RWD models for OD segmentation. TRANSLATIONAL RELEVANCE: RWD is a potential solution for improving generalizability of DL models and enabling clinical translations in the care of prevalent blinding ophthalmic conditions, such as glaucoma. The Association for Research in Vision and Ophthalmology 2023-11-03 /pmc/articles/PMC10629532/ /pubmed/37922149 http://dx.doi.org/10.1167/tvst.12.11.8 Text en Copyright 2023 The Authors https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License.
spellingShingle	Artificial Intelligence Rashidisabet, Homa Sethi, Abhishek Jindarak, Ponpawee Edmonds, James Chan, R. V. Paul Leiderman, Yannek I. Vajaranant, Thasarat Sutabutr Yi, Darvin Validating the Generalizability of Ophthalmic Artificial Intelligence Models on Real-World Clinical Data
title	Validating the Generalizability of Ophthalmic Artificial Intelligence Models on Real-World Clinical Data
title_full	Validating the Generalizability of Ophthalmic Artificial Intelligence Models on Real-World Clinical Data
title_fullStr	Validating the Generalizability of Ophthalmic Artificial Intelligence Models on Real-World Clinical Data
title_full_unstemmed	Validating the Generalizability of Ophthalmic Artificial Intelligence Models on Real-World Clinical Data
title_short	Validating the Generalizability of Ophthalmic Artificial Intelligence Models on Real-World Clinical Data
title_sort	validating the generalizability of ophthalmic artificial intelligence models on real-world clinical data
topic	Artificial Intelligence
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10629532/ https://www.ncbi.nlm.nih.gov/pubmed/37922149 http://dx.doi.org/10.1167/tvst.12.11.8
work_keys_str_mv	AT rashidisabethoma validatingthegeneralizabilityofophthalmicartificialintelligencemodelsonrealworldclinicaldata AT sethiabhishek validatingthegeneralizabilityofophthalmicartificialintelligencemodelsonrealworldclinicaldata AT jindarakponpawee validatingthegeneralizabilityofophthalmicartificialintelligencemodelsonrealworldclinicaldata AT edmondsjames validatingthegeneralizabilityofophthalmicartificialintelligencemodelsonrealworldclinicaldata AT chanrvpaul validatingthegeneralizabilityofophthalmicartificialintelligencemodelsonrealworldclinicaldata AT leidermanyanneki validatingthegeneralizabilityofophthalmicartificialintelligencemodelsonrealworldclinicaldata AT vajaranantthasaratsutabutr validatingthegeneralizabilityofophthalmicartificialintelligencemodelsonrealworldclinicaldata AT yidarvin validatingthegeneralizabilityofophthalmicartificialintelligencemodelsonrealworldclinicaldata

Validating the Generalizability of Ophthalmic Artificial Intelligence Models on Real-World Clinical Data

Ejemplares similares