Cargando…

An interpretable transformer network for the retinal disease classification using optical coherence tomography

Retinal illnesses such as age-related macular degeneration and diabetic macular edema will lead to irreversible blindness. With optical coherence tomography (OCT), doctors are able to see cross-sections of the retinal layers and provide patients with a diagnosis. Manual reading of OCT images is time...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Jingzhen, Wang, Junxia, Han, Zeyu, Ma, Jun, Wang, Chongjing, Qi, Meng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9984386/
https://www.ncbi.nlm.nih.gov/pubmed/36869160
http://dx.doi.org/10.1038/s41598-023-30853-z
_version_ 1784900735450742784
author He, Jingzhen
Wang, Junxia
Han, Zeyu
Ma, Jun
Wang, Chongjing
Qi, Meng
author_facet He, Jingzhen
Wang, Junxia
Han, Zeyu
Ma, Jun
Wang, Chongjing
Qi, Meng
author_sort He, Jingzhen
collection PubMed
description Retinal illnesses such as age-related macular degeneration and diabetic macular edema will lead to irreversible blindness. With optical coherence tomography (OCT), doctors are able to see cross-sections of the retinal layers and provide patients with a diagnosis. Manual reading of OCT images is time-consuming, labor-intensive and even error-prone. Computer-aided diagnosis algorithms improve efficiency by automatically analyzing and diagnosing retinal OCT images. However, the accuracy and interpretability of these algorithms can be further improved through effective feature extraction, loss optimization and visualization analysis. In this paper, we propose an interpretable Swin-Poly Transformer network for performing automatically retinal OCT image classification. By shifting the window partition, the Swin-Poly Transformer constructs connections between neighboring non-overlapping windows in the previous layer and thus has the flexibility to model multi-scale features. Besides, the Swin-Poly Transformer modifies the importance of polynomial bases to refine cross entropy for better retinal OCT image classification. In addition, the proposed method also provides confidence score maps, assisting medical practitioners to understand the models’ decision-making process. Experiments in OCT2017 and OCT-C8 reveal that the proposed method outperforms both the convolutional neural network approach and ViT, with an accuracy of 99.80% and an AUC of 99.99%.
format Online
Article
Text
id pubmed-9984386
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-99843862023-03-05 An interpretable transformer network for the retinal disease classification using optical coherence tomography He, Jingzhen Wang, Junxia Han, Zeyu Ma, Jun Wang, Chongjing Qi, Meng Sci Rep Article Retinal illnesses such as age-related macular degeneration and diabetic macular edema will lead to irreversible blindness. With optical coherence tomography (OCT), doctors are able to see cross-sections of the retinal layers and provide patients with a diagnosis. Manual reading of OCT images is time-consuming, labor-intensive and even error-prone. Computer-aided diagnosis algorithms improve efficiency by automatically analyzing and diagnosing retinal OCT images. However, the accuracy and interpretability of these algorithms can be further improved through effective feature extraction, loss optimization and visualization analysis. In this paper, we propose an interpretable Swin-Poly Transformer network for performing automatically retinal OCT image classification. By shifting the window partition, the Swin-Poly Transformer constructs connections between neighboring non-overlapping windows in the previous layer and thus has the flexibility to model multi-scale features. Besides, the Swin-Poly Transformer modifies the importance of polynomial bases to refine cross entropy for better retinal OCT image classification. In addition, the proposed method also provides confidence score maps, assisting medical practitioners to understand the models’ decision-making process. Experiments in OCT2017 and OCT-C8 reveal that the proposed method outperforms both the convolutional neural network approach and ViT, with an accuracy of 99.80% and an AUC of 99.99%. Nature Publishing Group UK 2023-03-03 /pmc/articles/PMC9984386/ /pubmed/36869160 http://dx.doi.org/10.1038/s41598-023-30853-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
He, Jingzhen
Wang, Junxia
Han, Zeyu
Ma, Jun
Wang, Chongjing
Qi, Meng
An interpretable transformer network for the retinal disease classification using optical coherence tomography
title An interpretable transformer network for the retinal disease classification using optical coherence tomography
title_full An interpretable transformer network for the retinal disease classification using optical coherence tomography
title_fullStr An interpretable transformer network for the retinal disease classification using optical coherence tomography
title_full_unstemmed An interpretable transformer network for the retinal disease classification using optical coherence tomography
title_short An interpretable transformer network for the retinal disease classification using optical coherence tomography
title_sort interpretable transformer network for the retinal disease classification using optical coherence tomography
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9984386/
https://www.ncbi.nlm.nih.gov/pubmed/36869160
http://dx.doi.org/10.1038/s41598-023-30853-z
work_keys_str_mv AT hejingzhen aninterpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography
AT wangjunxia aninterpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography
AT hanzeyu aninterpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography
AT majun aninterpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography
AT wangchongjing aninterpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography
AT qimeng aninterpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography
AT hejingzhen interpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography
AT wangjunxia interpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography
AT hanzeyu interpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography
AT majun interpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography
AT wangchongjing interpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography
AT qimeng interpretabletransformernetworkfortheretinaldiseaseclassificationusingopticalcoherencetomography