Cargando…
An interpretable transformer network for the retinal disease classification using optical coherence tomography
Retinal illnesses such as age-related macular degeneration and diabetic macular edema will lead to irreversible blindness. With optical coherence tomography (OCT), doctors are able to see cross-sections of the retinal layers and provide patients with a diagnosis. Manual reading of OCT images is time...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9984386/ https://www.ncbi.nlm.nih.gov/pubmed/36869160 http://dx.doi.org/10.1038/s41598-023-30853-z |
_version_ | 1784900735450742784 |
---|---|
author | He, Jingzhen Wang, Junxia Han, Zeyu Ma, Jun Wang, Chongjing Qi, Meng |
author_facet | He, Jingzhen Wang, Junxia Han, Zeyu Ma, Jun Wang, Chongjing Qi, Meng |
author_sort | He, Jingzhen |
collection | PubMed |
description | Retinal illnesses such as age-related macular degeneration and diabetic macular edema will lead to irreversible blindness. With optical coherence tomography (OCT), doctors are able to see cross-sections of the retinal layers and provide patients with a diagnosis. Manual reading of OCT images is time-consuming, labor-intensive and even error-prone. Computer-aided diagnosis algorithms improve efficiency by automatically analyzing and diagnosing retinal OCT images. However, the accuracy and interpretability of these algorithms can be further improved through effective feature extraction, loss optimization and visualization analysis. In this paper, we propose an interpretable Swin-Poly Transformer network for performing automatically retinal OCT image classification. By shifting the window partition, the Swin-Poly Transformer constructs connections between neighboring non-overlapping windows in the previous layer and thus has the flexibility to model multi-scale features. Besides, the Swin-Poly Transformer modifies the importance of polynomial bases to refine cross entropy for better retinal OCT image classification. In addition, the proposed method also provides confidence score maps, assisting medical practitioners to understand the models’ decision-making process. Experiments in OCT2017 and OCT-C8 reveal that the proposed method outperforms both the convolutional neural network approach and ViT, with an accuracy of 99.80% and an AUC of 99.99%. |
format | Online Article Text |
id | pubmed-9984386 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-99843862023-03-05 An interpretable transformer network for the retinal disease classification using optical coherence tomography He, Jingzhen Wang, Junxia Han, Zeyu Ma, Jun Wang, Chongjing Qi, Meng Sci Rep Article Retinal illnesses such as age-related macular degeneration and diabetic macular edema will lead to irreversible blindness. With optical coherence tomography (OCT), doctors are able to see cross-sections of the retinal layers and provide patients with a diagnosis. Manual reading of OCT images is time-consuming, labor-intensive and even error-prone. Computer-aided diagnosis algorithms improve efficiency by automatically analyzing and diagnosing retinal OCT images. However, the accuracy and interpretability of these algorithms can be further improved through effective feature extraction, loss optimization and visualization analysis. In this paper, we propose an interpretable Swin-Poly Transformer network for performing automatically retinal OCT image classification. By shifting the window partition, the Swin-Poly Transformer constructs connections between neighboring non-overlapping windows in the previous layer and thus has the flexibility to model multi-scale features. Besides, the Swin-Poly Transformer modifies the importance of polynomial bases to refine cross entropy for better retinal OCT image classification. In addition, the proposed method also provides confidence score maps, assisting medical practitioners to understand the models’ decision-making process. Experiments in OCT2017 and OCT-C8 reveal that the proposed method outperforms both the convolutional neural network approach and ViT, with an accuracy of 99.80% and an AUC of 99.99%. Nature Publishing Group UK 2023-03-03 /pmc/articles/PMC9984386/ /pubmed/36869160 http://dx.doi.org/10.1038/s41598-023-30853-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article He, Jingzhen Wang, Junxia Han, Zeyu Ma, Jun Wang, Chongjing Qi, Meng An interpretable transformer network for the retinal disease classification using optical coherence tomography |
title | An interpretable transformer network for the retinal disease classification using optical coherence tomography |
title_full | An interpretable transformer network for the retinal disease classification using optical coherence tomography |
title_fullStr | An interpretable transformer network for the retinal disease classification using optical coherence tomography |
title_full_unstemmed | An interpretable transformer network for the retinal disease classification using optical coherence tomography |
title_short | An interpretable transformer network for the retinal disease classification using optical coherence tomography |
title_sort | interpretable transformer network for the retinal disease classification using optical coherence tomography |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9984386/ https://www.ncbi.nlm.nih.gov/pubmed/36869160 http://dx.doi.org/10.1038/s41598-023-30853-z |
work_keys_str_mv | AT hejingzhen aninterpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography AT wangjunxia aninterpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography AT hanzeyu aninterpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography AT majun aninterpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography AT wangchongjing aninterpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography AT qimeng aninterpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography AT hejingzhen interpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography AT wangjunxia interpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography AT hanzeyu interpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography AT majun interpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography AT wangchongjing interpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography AT qimeng interpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography |