Cargando…

Performance of deep-learning artificial intelligence algorithms in detecting retinopathy of prematurity: A systematic review

PURPOSE: Artificial intelligence (AI) offers considerable promise for retinopathy of prematurity (ROP) screening and diagnosis. The development of deep-learning algorithms to detect the presence of disease may contribute to sufficient screening, early detection, and timely treatment for this prevent...

Descripción completa

Detalles Bibliográficos
Autores principales: Bai, Amelia, Carty, Christopher, Dai, Shuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Wolters Kluwer - Medknow 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9583359/
https://www.ncbi.nlm.nih.gov/pubmed/36276252
http://dx.doi.org/10.4103/sjopt.sjopt_219_21
_version_ 1784813056015990784
author Bai, Amelia
Carty, Christopher
Dai, Shuan
author_facet Bai, Amelia
Carty, Christopher
Dai, Shuan
author_sort Bai, Amelia
collection PubMed
description PURPOSE: Artificial intelligence (AI) offers considerable promise for retinopathy of prematurity (ROP) screening and diagnosis. The development of deep-learning algorithms to detect the presence of disease may contribute to sufficient screening, early detection, and timely treatment for this preventable blinding disease. This review aimed to systematically examine the literature in AI algorithms in detecting ROP. Specifically, we focused on the performance of deep-learning algorithms through sensitivity, specificity, and area under the receiver operating curve (AUROC) for both the detection and grade of ROP. METHODS: We searched Medline OVID, PubMed, Web of Science, and Embase for studies published from January 1, 2012, to September 20, 2021. Studies evaluating the diagnostic performance of deep-learning models based on retinal fundus images with expert ophthalmologists' judgment as reference standard were included. Studies which did not investigate the presence or absence of disease were excluded. Risk of bias was assessed using the QUADAS-2 tool. RESULTS: Twelve studies out of the 175 studies identified were included. Five studies measured the performance of detecting the presence of ROP and seven studies determined the presence of plus disease. The average AUROC out of 11 studies was 0.98. The average sensitivity and specificity for detecting ROP was 95.72% and 98.15%, respectively, and for detecting plus disease was 91.13% and 95.92%, respectively. CONCLUSION: The diagnostic performance of deep-learning algorithms in published studies was high. Few studies presented externally validated results or compared performance to expert human graders. Large scale prospective validation alongside robust study design could improve future studies.
format Online
Article
Text
id pubmed-9583359
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Wolters Kluwer - Medknow
record_format MEDLINE/PubMed
spelling pubmed-95833592022-10-21 Performance of deep-learning artificial intelligence algorithms in detecting retinopathy of prematurity: A systematic review Bai, Amelia Carty, Christopher Dai, Shuan Saudi J Ophthalmol Original Article PURPOSE: Artificial intelligence (AI) offers considerable promise for retinopathy of prematurity (ROP) screening and diagnosis. The development of deep-learning algorithms to detect the presence of disease may contribute to sufficient screening, early detection, and timely treatment for this preventable blinding disease. This review aimed to systematically examine the literature in AI algorithms in detecting ROP. Specifically, we focused on the performance of deep-learning algorithms through sensitivity, specificity, and area under the receiver operating curve (AUROC) for both the detection and grade of ROP. METHODS: We searched Medline OVID, PubMed, Web of Science, and Embase for studies published from January 1, 2012, to September 20, 2021. Studies evaluating the diagnostic performance of deep-learning models based on retinal fundus images with expert ophthalmologists' judgment as reference standard were included. Studies which did not investigate the presence or absence of disease were excluded. Risk of bias was assessed using the QUADAS-2 tool. RESULTS: Twelve studies out of the 175 studies identified were included. Five studies measured the performance of detecting the presence of ROP and seven studies determined the presence of plus disease. The average AUROC out of 11 studies was 0.98. The average sensitivity and specificity for detecting ROP was 95.72% and 98.15%, respectively, and for detecting plus disease was 91.13% and 95.92%, respectively. CONCLUSION: The diagnostic performance of deep-learning algorithms in published studies was high. Few studies presented externally validated results or compared performance to expert human graders. Large scale prospective validation alongside robust study design could improve future studies. Wolters Kluwer - Medknow 2022-10-14 /pmc/articles/PMC9583359/ /pubmed/36276252 http://dx.doi.org/10.4103/sjopt.sjopt_219_21 Text en Copyright: © 2022 Saudi Journal of Ophthalmology https://creativecommons.org/licenses/by-nc-sa/4.0/This is an open access journal, and articles are distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License, which allows others to remix, tweak, and build upon the work non-commercially, as long as appropriate credit is given and the new creations are licensed under the identical terms.
spellingShingle Original Article
Bai, Amelia
Carty, Christopher
Dai, Shuan
Performance of deep-learning artificial intelligence algorithms in detecting retinopathy of prematurity: A systematic review
title Performance of deep-learning artificial intelligence algorithms in detecting retinopathy of prematurity: A systematic review
title_full Performance of deep-learning artificial intelligence algorithms in detecting retinopathy of prematurity: A systematic review
title_fullStr Performance of deep-learning artificial intelligence algorithms in detecting retinopathy of prematurity: A systematic review
title_full_unstemmed Performance of deep-learning artificial intelligence algorithms in detecting retinopathy of prematurity: A systematic review
title_short Performance of deep-learning artificial intelligence algorithms in detecting retinopathy of prematurity: A systematic review
title_sort performance of deep-learning artificial intelligence algorithms in detecting retinopathy of prematurity: a systematic review
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9583359/
https://www.ncbi.nlm.nih.gov/pubmed/36276252
http://dx.doi.org/10.4103/sjopt.sjopt_219_21
work_keys_str_mv AT baiamelia performanceofdeeplearningartificialintelligencealgorithmsindetectingretinopathyofprematurityasystematicreview
AT cartychristopher performanceofdeeplearningartificialintelligencealgorithmsindetectingretinopathyofprematurityasystematicreview
AT daishuan performanceofdeeplearningartificialintelligencealgorithmsindetectingretinopathyofprematurityasystematicreview