Cargando…
Performance of deep-learning artificial intelligence algorithms in detecting retinopathy of prematurity: A systematic review
PURPOSE: Artificial intelligence (AI) offers considerable promise for retinopathy of prematurity (ROP) screening and diagnosis. The development of deep-learning algorithms to detect the presence of disease may contribute to sufficient screening, early detection, and timely treatment for this prevent...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Wolters Kluwer - Medknow
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9583359/ https://www.ncbi.nlm.nih.gov/pubmed/36276252 http://dx.doi.org/10.4103/sjopt.sjopt_219_21 |
_version_ | 1784813056015990784 |
---|---|
author | Bai, Amelia Carty, Christopher Dai, Shuan |
author_facet | Bai, Amelia Carty, Christopher Dai, Shuan |
author_sort | Bai, Amelia |
collection | PubMed |
description | PURPOSE: Artificial intelligence (AI) offers considerable promise for retinopathy of prematurity (ROP) screening and diagnosis. The development of deep-learning algorithms to detect the presence of disease may contribute to sufficient screening, early detection, and timely treatment for this preventable blinding disease. This review aimed to systematically examine the literature in AI algorithms in detecting ROP. Specifically, we focused on the performance of deep-learning algorithms through sensitivity, specificity, and area under the receiver operating curve (AUROC) for both the detection and grade of ROP. METHODS: We searched Medline OVID, PubMed, Web of Science, and Embase for studies published from January 1, 2012, to September 20, 2021. Studies evaluating the diagnostic performance of deep-learning models based on retinal fundus images with expert ophthalmologists' judgment as reference standard were included. Studies which did not investigate the presence or absence of disease were excluded. Risk of bias was assessed using the QUADAS-2 tool. RESULTS: Twelve studies out of the 175 studies identified were included. Five studies measured the performance of detecting the presence of ROP and seven studies determined the presence of plus disease. The average AUROC out of 11 studies was 0.98. The average sensitivity and specificity for detecting ROP was 95.72% and 98.15%, respectively, and for detecting plus disease was 91.13% and 95.92%, respectively. CONCLUSION: The diagnostic performance of deep-learning algorithms in published studies was high. Few studies presented externally validated results or compared performance to expert human graders. Large scale prospective validation alongside robust study design could improve future studies. |
format | Online Article Text |
id | pubmed-9583359 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Wolters Kluwer - Medknow |
record_format | MEDLINE/PubMed |
spelling | pubmed-95833592022-10-21 Performance of deep-learning artificial intelligence algorithms in detecting retinopathy of prematurity: A systematic review Bai, Amelia Carty, Christopher Dai, Shuan Saudi J Ophthalmol Original Article PURPOSE: Artificial intelligence (AI) offers considerable promise for retinopathy of prematurity (ROP) screening and diagnosis. The development of deep-learning algorithms to detect the presence of disease may contribute to sufficient screening, early detection, and timely treatment for this preventable blinding disease. This review aimed to systematically examine the literature in AI algorithms in detecting ROP. Specifically, we focused on the performance of deep-learning algorithms through sensitivity, specificity, and area under the receiver operating curve (AUROC) for both the detection and grade of ROP. METHODS: We searched Medline OVID, PubMed, Web of Science, and Embase for studies published from January 1, 2012, to September 20, 2021. Studies evaluating the diagnostic performance of deep-learning models based on retinal fundus images with expert ophthalmologists' judgment as reference standard were included. Studies which did not investigate the presence or absence of disease were excluded. Risk of bias was assessed using the QUADAS-2 tool. RESULTS: Twelve studies out of the 175 studies identified were included. Five studies measured the performance of detecting the presence of ROP and seven studies determined the presence of plus disease. The average AUROC out of 11 studies was 0.98. The average sensitivity and specificity for detecting ROP was 95.72% and 98.15%, respectively, and for detecting plus disease was 91.13% and 95.92%, respectively. CONCLUSION: The diagnostic performance of deep-learning algorithms in published studies was high. Few studies presented externally validated results or compared performance to expert human graders. Large scale prospective validation alongside robust study design could improve future studies. Wolters Kluwer - Medknow 2022-10-14 /pmc/articles/PMC9583359/ /pubmed/36276252 http://dx.doi.org/10.4103/sjopt.sjopt_219_21 Text en Copyright: © 2022 Saudi Journal of Ophthalmology https://creativecommons.org/licenses/by-nc-sa/4.0/This is an open access journal, and articles are distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License, which allows others to remix, tweak, and build upon the work non-commercially, as long as appropriate credit is given and the new creations are licensed under the identical terms. |
spellingShingle | Original Article Bai, Amelia Carty, Christopher Dai, Shuan Performance of deep-learning artificial intelligence algorithms in detecting retinopathy of prematurity: A systematic review |
title | Performance of deep-learning artificial intelligence algorithms in detecting retinopathy of prematurity: A systematic review |
title_full | Performance of deep-learning artificial intelligence algorithms in detecting retinopathy of prematurity: A systematic review |
title_fullStr | Performance of deep-learning artificial intelligence algorithms in detecting retinopathy of prematurity: A systematic review |
title_full_unstemmed | Performance of deep-learning artificial intelligence algorithms in detecting retinopathy of prematurity: A systematic review |
title_short | Performance of deep-learning artificial intelligence algorithms in detecting retinopathy of prematurity: A systematic review |
title_sort | performance of deep-learning artificial intelligence algorithms in detecting retinopathy of prematurity: a systematic review |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9583359/ https://www.ncbi.nlm.nih.gov/pubmed/36276252 http://dx.doi.org/10.4103/sjopt.sjopt_219_21 |
work_keys_str_mv | AT baiamelia performanceofdeeplearningartificialintelligencealgorithmsindetectingretinopathyofprematurityasystematicreview AT cartychristopher performanceofdeeplearningartificialintelligencealgorithmsindetectingretinopathyofprematurityasystematicreview AT daishuan performanceofdeeplearningartificialintelligencealgorithmsindetectingretinopathyofprematurityasystematicreview |