Cargando…

Fairness and generalisability in deep learning of retinopathy of prematurity screening algorithms: a literature review

BACKGROUND: Retinopathy of prematurity (ROP) is a vasoproliferative disease responsible for more than 30 000 blind children worldwide. Its diagnosis and treatment are challenging due to the lack of specialists, divergent diagnostic concordance and variation in classification standards. While artific...

Descripción completa

Detalles Bibliográficos
Autores principales: Nakayama, Luis Filipe, Mitchell, William Greig, Ribeiro, Lucas Zago, Dychiao, Robyn Gayle, Phanphruk, Warachaya, Celi, Leo Anthony, Kalua, Khumbo, Santiago, Alvina Pauline Dy, Regatieri, Caio Vinicius Saito, Moraes, Nilva Simeren Bueno
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10414056/
https://www.ncbi.nlm.nih.gov/pubmed/37558406
http://dx.doi.org/10.1136/bmjophth-2022-001216
_version_ 1785087263242190848
author Nakayama, Luis Filipe
Mitchell, William Greig
Ribeiro, Lucas Zago
Dychiao, Robyn Gayle
Phanphruk, Warachaya
Celi, Leo Anthony
Kalua, Khumbo
Santiago, Alvina Pauline Dy
Regatieri, Caio Vinicius Saito
Moraes, Nilva Simeren Bueno
author_facet Nakayama, Luis Filipe
Mitchell, William Greig
Ribeiro, Lucas Zago
Dychiao, Robyn Gayle
Phanphruk, Warachaya
Celi, Leo Anthony
Kalua, Khumbo
Santiago, Alvina Pauline Dy
Regatieri, Caio Vinicius Saito
Moraes, Nilva Simeren Bueno
author_sort Nakayama, Luis Filipe
collection PubMed
description BACKGROUND: Retinopathy of prematurity (ROP) is a vasoproliferative disease responsible for more than 30 000 blind children worldwide. Its diagnosis and treatment are challenging due to the lack of specialists, divergent diagnostic concordance and variation in classification standards. While artificial intelligence (AI) can address the shortage of professionals and provide more cost-effective management, its development needs fairness, generalisability and bias controls prior to deployment to avoid producing harmful unpredictable results. This review aims to compare AI and ROP study’s characteristics, fairness and generalisability efforts. METHODS: Our review yielded 220 articles, of which 18 were included after full-text assessment. The articles were classified into ROP severity grading, plus detection, detecting treatment requiring, ROP prediction and detection of retinal zones. RESULTS: All the article’s authors and included patients are from middle-income and high-income countries, with no low-income countries, South America, Australia and Africa Continents representation. Code is available in two articles and in one on request, while data are not available in any article. 88.9% of the studies use the same retinal camera. In two articles, patients’ sex was described, but none applied a bias control in their models. CONCLUSION: The reviewed articles included 180 228 images and reported good metrics, but fairness, generalisability and bias control remained limited. Reproducibility is also a critical limitation, with few articles sharing codes and none sharing data. Fair and generalisable ROP and AI studies are needed that include diverse datasets, data and code sharing, collaborative research, and bias control to avoid unpredictable and harmful deployments.
format Online
Article
Text
id pubmed-10414056
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-104140562023-08-11 Fairness and generalisability in deep learning of retinopathy of prematurity screening algorithms: a literature review Nakayama, Luis Filipe Mitchell, William Greig Ribeiro, Lucas Zago Dychiao, Robyn Gayle Phanphruk, Warachaya Celi, Leo Anthony Kalua, Khumbo Santiago, Alvina Pauline Dy Regatieri, Caio Vinicius Saito Moraes, Nilva Simeren Bueno BMJ Open Ophthalmol Systematic Review BACKGROUND: Retinopathy of prematurity (ROP) is a vasoproliferative disease responsible for more than 30 000 blind children worldwide. Its diagnosis and treatment are challenging due to the lack of specialists, divergent diagnostic concordance and variation in classification standards. While artificial intelligence (AI) can address the shortage of professionals and provide more cost-effective management, its development needs fairness, generalisability and bias controls prior to deployment to avoid producing harmful unpredictable results. This review aims to compare AI and ROP study’s characteristics, fairness and generalisability efforts. METHODS: Our review yielded 220 articles, of which 18 were included after full-text assessment. The articles were classified into ROP severity grading, plus detection, detecting treatment requiring, ROP prediction and detection of retinal zones. RESULTS: All the article’s authors and included patients are from middle-income and high-income countries, with no low-income countries, South America, Australia and Africa Continents representation. Code is available in two articles and in one on request, while data are not available in any article. 88.9% of the studies use the same retinal camera. In two articles, patients’ sex was described, but none applied a bias control in their models. CONCLUSION: The reviewed articles included 180 228 images and reported good metrics, but fairness, generalisability and bias control remained limited. Reproducibility is also a critical limitation, with few articles sharing codes and none sharing data. Fair and generalisable ROP and AI studies are needed that include diverse datasets, data and code sharing, collaborative research, and bias control to avoid unpredictable and harmful deployments. BMJ Publishing Group 2023-08-09 /pmc/articles/PMC10414056/ /pubmed/37558406 http://dx.doi.org/10.1136/bmjophth-2022-001216 Text en © Author(s) (or their employer(s)) 2023. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Systematic Review
Nakayama, Luis Filipe
Mitchell, William Greig
Ribeiro, Lucas Zago
Dychiao, Robyn Gayle
Phanphruk, Warachaya
Celi, Leo Anthony
Kalua, Khumbo
Santiago, Alvina Pauline Dy
Regatieri, Caio Vinicius Saito
Moraes, Nilva Simeren Bueno
Fairness and generalisability in deep learning of retinopathy of prematurity screening algorithms: a literature review
title Fairness and generalisability in deep learning of retinopathy of prematurity screening algorithms: a literature review
title_full Fairness and generalisability in deep learning of retinopathy of prematurity screening algorithms: a literature review
title_fullStr Fairness and generalisability in deep learning of retinopathy of prematurity screening algorithms: a literature review
title_full_unstemmed Fairness and generalisability in deep learning of retinopathy of prematurity screening algorithms: a literature review
title_short Fairness and generalisability in deep learning of retinopathy of prematurity screening algorithms: a literature review
title_sort fairness and generalisability in deep learning of retinopathy of prematurity screening algorithms: a literature review
topic Systematic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10414056/
https://www.ncbi.nlm.nih.gov/pubmed/37558406
http://dx.doi.org/10.1136/bmjophth-2022-001216
work_keys_str_mv AT nakayamaluisfilipe fairnessandgeneralisabilityindeeplearningofretinopathyofprematurityscreeningalgorithmsaliteraturereview
AT mitchellwilliamgreig fairnessandgeneralisabilityindeeplearningofretinopathyofprematurityscreeningalgorithmsaliteraturereview
AT ribeirolucaszago fairnessandgeneralisabilityindeeplearningofretinopathyofprematurityscreeningalgorithmsaliteraturereview
AT dychiaorobyngayle fairnessandgeneralisabilityindeeplearningofretinopathyofprematurityscreeningalgorithmsaliteraturereview
AT phanphrukwarachaya fairnessandgeneralisabilityindeeplearningofretinopathyofprematurityscreeningalgorithmsaliteraturereview
AT celileoanthony fairnessandgeneralisabilityindeeplearningofretinopathyofprematurityscreeningalgorithmsaliteraturereview
AT kaluakhumbo fairnessandgeneralisabilityindeeplearningofretinopathyofprematurityscreeningalgorithmsaliteraturereview
AT santiagoalvinapaulinedy fairnessandgeneralisabilityindeeplearningofretinopathyofprematurityscreeningalgorithmsaliteraturereview
AT regatiericaioviniciussaito fairnessandgeneralisabilityindeeplearningofretinopathyofprematurityscreeningalgorithmsaliteraturereview
AT moraesnilvasimerenbueno fairnessandgeneralisabilityindeeplearningofretinopathyofprematurityscreeningalgorithmsaliteraturereview