Cargando…
Towards population-independent, multi-disease detection in fundus photographs
Independent validation studies of automatic diabetic retinopathy screening systems have recently shown a drop of screening performance on external data. Beyond diabetic retinopathy, this study investigates the generalizability of deep learning (DL) algorithms for screening various ocular anomalies i...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10352300/ https://www.ncbi.nlm.nih.gov/pubmed/37460629 http://dx.doi.org/10.1038/s41598-023-38610-y |
_version_ | 1785074482907447296 |
---|---|
author | Matta, Sarah Lamard, Mathieu Conze, Pierre-Henri Le Guilcher, Alexandre Lecat, Clément Carette, Romuald Basset, Fabien Massin, Pascale Rottier, Jean-Bernard Cochener, Béatrice Quellec, Gwenolé |
author_facet | Matta, Sarah Lamard, Mathieu Conze, Pierre-Henri Le Guilcher, Alexandre Lecat, Clément Carette, Romuald Basset, Fabien Massin, Pascale Rottier, Jean-Bernard Cochener, Béatrice Quellec, Gwenolé |
author_sort | Matta, Sarah |
collection | PubMed |
description | Independent validation studies of automatic diabetic retinopathy screening systems have recently shown a drop of screening performance on external data. Beyond diabetic retinopathy, this study investigates the generalizability of deep learning (DL) algorithms for screening various ocular anomalies in fundus photographs, across heterogeneous populations and imaging protocols. The following datasets are considered: OPHDIAT (France, diabetic population), OphtaMaine (France, general population), RIADD (India, general population) and ODIR (China, general population). Two multi-disease DL algorithms were developed: a Single-Dataset (SD) network, trained on the largest dataset (OPHDIAT), and a Multiple-Dataset (MD) network, trained on multiple datasets simultaneously. To assess their generalizability, both algorithms were evaluated whenever training and test data originate from overlapping datasets or from disjoint datasets. The SD network achieved a mean per-disease area under the receiver operating characteristic curve (mAUC) of 0.9571 on OPHDIAT. However, it generalized poorly to the other three datasets (mAUC < 0.9). When all four datasets were involved in training, the MD network significantly outperformed the SD network (p = 0.0058), indicating improved generality. However, in leave-one-dataset-out experiments, performance of the MD network was significantly lower on populations unseen during training than on populations involved in training (p < 0.0001), indicating imperfect generalizability. |
format | Online Article Text |
id | pubmed-10352300 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-103523002023-07-19 Towards population-independent, multi-disease detection in fundus photographs Matta, Sarah Lamard, Mathieu Conze, Pierre-Henri Le Guilcher, Alexandre Lecat, Clément Carette, Romuald Basset, Fabien Massin, Pascale Rottier, Jean-Bernard Cochener, Béatrice Quellec, Gwenolé Sci Rep Article Independent validation studies of automatic diabetic retinopathy screening systems have recently shown a drop of screening performance on external data. Beyond diabetic retinopathy, this study investigates the generalizability of deep learning (DL) algorithms for screening various ocular anomalies in fundus photographs, across heterogeneous populations and imaging protocols. The following datasets are considered: OPHDIAT (France, diabetic population), OphtaMaine (France, general population), RIADD (India, general population) and ODIR (China, general population). Two multi-disease DL algorithms were developed: a Single-Dataset (SD) network, trained on the largest dataset (OPHDIAT), and a Multiple-Dataset (MD) network, trained on multiple datasets simultaneously. To assess their generalizability, both algorithms were evaluated whenever training and test data originate from overlapping datasets or from disjoint datasets. The SD network achieved a mean per-disease area under the receiver operating characteristic curve (mAUC) of 0.9571 on OPHDIAT. However, it generalized poorly to the other three datasets (mAUC < 0.9). When all four datasets were involved in training, the MD network significantly outperformed the SD network (p = 0.0058), indicating improved generality. However, in leave-one-dataset-out experiments, performance of the MD network was significantly lower on populations unseen during training than on populations involved in training (p < 0.0001), indicating imperfect generalizability. Nature Publishing Group UK 2023-07-17 /pmc/articles/PMC10352300/ /pubmed/37460629 http://dx.doi.org/10.1038/s41598-023-38610-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Matta, Sarah Lamard, Mathieu Conze, Pierre-Henri Le Guilcher, Alexandre Lecat, Clément Carette, Romuald Basset, Fabien Massin, Pascale Rottier, Jean-Bernard Cochener, Béatrice Quellec, Gwenolé Towards population-independent, multi-disease detection in fundus photographs |
title | Towards population-independent, multi-disease detection in fundus photographs |
title_full | Towards population-independent, multi-disease detection in fundus photographs |
title_fullStr | Towards population-independent, multi-disease detection in fundus photographs |
title_full_unstemmed | Towards population-independent, multi-disease detection in fundus photographs |
title_short | Towards population-independent, multi-disease detection in fundus photographs |
title_sort | towards population-independent, multi-disease detection in fundus photographs |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10352300/ https://www.ncbi.nlm.nih.gov/pubmed/37460629 http://dx.doi.org/10.1038/s41598-023-38610-y |
work_keys_str_mv | AT mattasarah towardspopulationindependentmultidiseasedetectioninfundusphotographs AT lamardmathieu towardspopulationindependentmultidiseasedetectioninfundusphotographs AT conzepierrehenri towardspopulationindependentmultidiseasedetectioninfundusphotographs AT leguilcheralexandre towardspopulationindependentmultidiseasedetectioninfundusphotographs AT lecatclement towardspopulationindependentmultidiseasedetectioninfundusphotographs AT caretteromuald towardspopulationindependentmultidiseasedetectioninfundusphotographs AT bassetfabien towardspopulationindependentmultidiseasedetectioninfundusphotographs AT massinpascale towardspopulationindependentmultidiseasedetectioninfundusphotographs AT rottierjeanbernard towardspopulationindependentmultidiseasedetectioninfundusphotographs AT cochenerbeatrice towardspopulationindependentmultidiseasedetectioninfundusphotographs AT quellecgwenole towardspopulationindependentmultidiseasedetectioninfundusphotographs |