Cargando…

Towards population-independent, multi-disease detection in fundus photographs

Independent validation studies of automatic diabetic retinopathy screening systems have recently shown a drop of screening performance on external data. Beyond diabetic retinopathy, this study investigates the generalizability of deep learning (DL) algorithms for screening various ocular anomalies i...

Descripción completa

Detalles Bibliográficos
Autores principales: Matta, Sarah, Lamard, Mathieu, Conze, Pierre-Henri, Le Guilcher, Alexandre, Lecat, Clément, Carette, Romuald, Basset, Fabien, Massin, Pascale, Rottier, Jean-Bernard, Cochener, Béatrice, Quellec, Gwenolé
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10352300/
https://www.ncbi.nlm.nih.gov/pubmed/37460629
http://dx.doi.org/10.1038/s41598-023-38610-y
_version_ 1785074482907447296
author Matta, Sarah
Lamard, Mathieu
Conze, Pierre-Henri
Le Guilcher, Alexandre
Lecat, Clément
Carette, Romuald
Basset, Fabien
Massin, Pascale
Rottier, Jean-Bernard
Cochener, Béatrice
Quellec, Gwenolé
author_facet Matta, Sarah
Lamard, Mathieu
Conze, Pierre-Henri
Le Guilcher, Alexandre
Lecat, Clément
Carette, Romuald
Basset, Fabien
Massin, Pascale
Rottier, Jean-Bernard
Cochener, Béatrice
Quellec, Gwenolé
author_sort Matta, Sarah
collection PubMed
description Independent validation studies of automatic diabetic retinopathy screening systems have recently shown a drop of screening performance on external data. Beyond diabetic retinopathy, this study investigates the generalizability of deep learning (DL) algorithms for screening various ocular anomalies in fundus photographs, across heterogeneous populations and imaging protocols. The following datasets are considered: OPHDIAT (France, diabetic population), OphtaMaine (France, general population), RIADD (India, general population) and ODIR (China, general population). Two multi-disease DL algorithms were developed: a Single-Dataset (SD) network, trained on the largest dataset (OPHDIAT), and a Multiple-Dataset (MD) network, trained on multiple datasets simultaneously. To assess their generalizability, both algorithms were evaluated whenever training and test data originate from overlapping datasets or from disjoint datasets. The SD network achieved a mean per-disease area under the receiver operating characteristic curve (mAUC) of 0.9571 on OPHDIAT. However, it generalized poorly to the other three datasets (mAUC < 0.9). When all four datasets were involved in training, the MD network significantly outperformed the SD network (p = 0.0058), indicating improved generality. However, in leave-one-dataset-out experiments, performance of the MD network was significantly lower on populations unseen during training than on populations involved in training (p < 0.0001), indicating imperfect generalizability.
format Online
Article
Text
id pubmed-10352300
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-103523002023-07-19 Towards population-independent, multi-disease detection in fundus photographs Matta, Sarah Lamard, Mathieu Conze, Pierre-Henri Le Guilcher, Alexandre Lecat, Clément Carette, Romuald Basset, Fabien Massin, Pascale Rottier, Jean-Bernard Cochener, Béatrice Quellec, Gwenolé Sci Rep Article Independent validation studies of automatic diabetic retinopathy screening systems have recently shown a drop of screening performance on external data. Beyond diabetic retinopathy, this study investigates the generalizability of deep learning (DL) algorithms for screening various ocular anomalies in fundus photographs, across heterogeneous populations and imaging protocols. The following datasets are considered: OPHDIAT (France, diabetic population), OphtaMaine (France, general population), RIADD (India, general population) and ODIR (China, general population). Two multi-disease DL algorithms were developed: a Single-Dataset (SD) network, trained on the largest dataset (OPHDIAT), and a Multiple-Dataset (MD) network, trained on multiple datasets simultaneously. To assess their generalizability, both algorithms were evaluated whenever training and test data originate from overlapping datasets or from disjoint datasets. The SD network achieved a mean per-disease area under the receiver operating characteristic curve (mAUC) of 0.9571 on OPHDIAT. However, it generalized poorly to the other three datasets (mAUC < 0.9). When all four datasets were involved in training, the MD network significantly outperformed the SD network (p = 0.0058), indicating improved generality. However, in leave-one-dataset-out experiments, performance of the MD network was significantly lower on populations unseen during training than on populations involved in training (p < 0.0001), indicating imperfect generalizability. Nature Publishing Group UK 2023-07-17 /pmc/articles/PMC10352300/ /pubmed/37460629 http://dx.doi.org/10.1038/s41598-023-38610-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Matta, Sarah
Lamard, Mathieu
Conze, Pierre-Henri
Le Guilcher, Alexandre
Lecat, Clément
Carette, Romuald
Basset, Fabien
Massin, Pascale
Rottier, Jean-Bernard
Cochener, Béatrice
Quellec, Gwenolé
Towards population-independent, multi-disease detection in fundus photographs
title Towards population-independent, multi-disease detection in fundus photographs
title_full Towards population-independent, multi-disease detection in fundus photographs
title_fullStr Towards population-independent, multi-disease detection in fundus photographs
title_full_unstemmed Towards population-independent, multi-disease detection in fundus photographs
title_short Towards population-independent, multi-disease detection in fundus photographs
title_sort towards population-independent, multi-disease detection in fundus photographs
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10352300/
https://www.ncbi.nlm.nih.gov/pubmed/37460629
http://dx.doi.org/10.1038/s41598-023-38610-y
work_keys_str_mv AT mattasarah towardspopulationindependentmultidiseasedetectioninfundusphotographs
AT lamardmathieu towardspopulationindependentmultidiseasedetectioninfundusphotographs
AT conzepierrehenri towardspopulationindependentmultidiseasedetectioninfundusphotographs
AT leguilcheralexandre towardspopulationindependentmultidiseasedetectioninfundusphotographs
AT lecatclement towardspopulationindependentmultidiseasedetectioninfundusphotographs
AT caretteromuald towardspopulationindependentmultidiseasedetectioninfundusphotographs
AT bassetfabien towardspopulationindependentmultidiseasedetectioninfundusphotographs
AT massinpascale towardspopulationindependentmultidiseasedetectioninfundusphotographs
AT rottierjeanbernard towardspopulationindependentmultidiseasedetectioninfundusphotographs
AT cochenerbeatrice towardspopulationindependentmultidiseasedetectioninfundusphotographs
AT quellecgwenole towardspopulationindependentmultidiseasedetectioninfundusphotographs