Cargando…

Vison transformer adapter-based hyperbolic embeddings for multi-lesion segmentation in diabetic retinopathy

Diabetic Retinopathy (DR) is a major cause of blindness worldwide. Early detection and treatment are crucial to prevent vision loss, making accurate and timely diagnosis critical. Deep learning technology has shown promise in the automated diagnosis of DR, and in particular, multi-lesion segmentatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Zijian, Lu, Haimei, Yan, Haixin, Kan, Hongxing, Jin, Li
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10333307/
https://www.ncbi.nlm.nih.gov/pubmed/37429966
http://dx.doi.org/10.1038/s41598-023-38320-5
_version_ 1785070619396669440
author Wang, Zijian
Lu, Haimei
Yan, Haixin
Kan, Hongxing
Jin, Li
author_facet Wang, Zijian
Lu, Haimei
Yan, Haixin
Kan, Hongxing
Jin, Li
author_sort Wang, Zijian
collection PubMed
description Diabetic Retinopathy (DR) is a major cause of blindness worldwide. Early detection and treatment are crucial to prevent vision loss, making accurate and timely diagnosis critical. Deep learning technology has shown promise in the automated diagnosis of DR, and in particular, multi-lesion segmentation tasks. In this paper, we propose a novel Transformer-based model for DR segmentation that incorporates hyperbolic embeddings and a spatial prior module. The proposed model is primarily built on a traditional Vision Transformer encoder and further enhanced by incorporating a spatial prior module for image convolution and feature continuity, followed by feature interaction processing using the spatial feature injector and extractor. Hyperbolic embeddings are used to classify feature matrices from the model at the pixel level. We evaluated the proposed model’s performance on the publicly available datasets and compared it with other widely used DR segmentation models. The results show that our model outperforms these widely used DR segmentation models. The incorporation of hyperbolic embeddings and a spatial prior module into the Vision Transformer-based model significantly improves the accuracy of DR segmentation. The hyperbolic embeddings enable us to better capture the underlying geometric structure of the feature matrices, which is important for accurate segmentation. The spatial prior module improves the continuity of the features and helps to better distinguish between lesions and normal tissues. Overall, our proposed model has potential for clinical use in automated DR diagnosis, improving accuracy and speed of diagnosis. Our study shows that the integration of hyperbolic embeddings and a spatial prior module with a Vision Transformer-based model improves the performance of DR segmentation models. Future research can explore the application of our model to other medical imaging tasks, as well as further optimization and validation in real-world clinical settings.
format Online
Article
Text
id pubmed-10333307
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-103333072023-07-12 Vison transformer adapter-based hyperbolic embeddings for multi-lesion segmentation in diabetic retinopathy Wang, Zijian Lu, Haimei Yan, Haixin Kan, Hongxing Jin, Li Sci Rep Article Diabetic Retinopathy (DR) is a major cause of blindness worldwide. Early detection and treatment are crucial to prevent vision loss, making accurate and timely diagnosis critical. Deep learning technology has shown promise in the automated diagnosis of DR, and in particular, multi-lesion segmentation tasks. In this paper, we propose a novel Transformer-based model for DR segmentation that incorporates hyperbolic embeddings and a spatial prior module. The proposed model is primarily built on a traditional Vision Transformer encoder and further enhanced by incorporating a spatial prior module for image convolution and feature continuity, followed by feature interaction processing using the spatial feature injector and extractor. Hyperbolic embeddings are used to classify feature matrices from the model at the pixel level. We evaluated the proposed model’s performance on the publicly available datasets and compared it with other widely used DR segmentation models. The results show that our model outperforms these widely used DR segmentation models. The incorporation of hyperbolic embeddings and a spatial prior module into the Vision Transformer-based model significantly improves the accuracy of DR segmentation. The hyperbolic embeddings enable us to better capture the underlying geometric structure of the feature matrices, which is important for accurate segmentation. The spatial prior module improves the continuity of the features and helps to better distinguish between lesions and normal tissues. Overall, our proposed model has potential for clinical use in automated DR diagnosis, improving accuracy and speed of diagnosis. Our study shows that the integration of hyperbolic embeddings and a spatial prior module with a Vision Transformer-based model improves the performance of DR segmentation models. Future research can explore the application of our model to other medical imaging tasks, as well as further optimization and validation in real-world clinical settings. Nature Publishing Group UK 2023-07-10 /pmc/articles/PMC10333307/ /pubmed/37429966 http://dx.doi.org/10.1038/s41598-023-38320-5 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Wang, Zijian
Lu, Haimei
Yan, Haixin
Kan, Hongxing
Jin, Li
Vison transformer adapter-based hyperbolic embeddings for multi-lesion segmentation in diabetic retinopathy
title Vison transformer adapter-based hyperbolic embeddings for multi-lesion segmentation in diabetic retinopathy
title_full Vison transformer adapter-based hyperbolic embeddings for multi-lesion segmentation in diabetic retinopathy
title_fullStr Vison transformer adapter-based hyperbolic embeddings for multi-lesion segmentation in diabetic retinopathy
title_full_unstemmed Vison transformer adapter-based hyperbolic embeddings for multi-lesion segmentation in diabetic retinopathy
title_short Vison transformer adapter-based hyperbolic embeddings for multi-lesion segmentation in diabetic retinopathy
title_sort vison transformer adapter-based hyperbolic embeddings for multi-lesion segmentation in diabetic retinopathy
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10333307/
https://www.ncbi.nlm.nih.gov/pubmed/37429966
http://dx.doi.org/10.1038/s41598-023-38320-5
work_keys_str_mv AT wangzijian visontransformeradapterbasedhyperbolicembeddingsformultilesionsegmentationindiabeticretinopathy
AT luhaimei visontransformeradapterbasedhyperbolicembeddingsformultilesionsegmentationindiabeticretinopathy
AT yanhaixin visontransformeradapterbasedhyperbolicembeddingsformultilesionsegmentationindiabeticretinopathy
AT kanhongxing visontransformeradapterbasedhyperbolicembeddingsformultilesionsegmentationindiabeticretinopathy
AT jinli visontransformeradapterbasedhyperbolicembeddingsformultilesionsegmentationindiabeticretinopathy