Cargando…

PCAT-UNet: UNet-like network fused convolution and transformer for retinal vessel segmentation

The accurate segmentation of retinal vessels images can not only be used to evaluate and monitor various ophthalmic diseases, but also timely reflect systemic diseases such as diabetes and blood diseases. Therefore, the study on segmentation of retinal vessels images is of great significance for the...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Danny, Yang, Wenzhong, Wang, Liejun, Tan, Sixiang, Lin, Jiangzhaung, Bu, Wenxiu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8786152/
https://www.ncbi.nlm.nih.gov/pubmed/35073371
http://dx.doi.org/10.1371/journal.pone.0262689
_version_ 1784639055489662976
author Chen, Danny
Yang, Wenzhong
Wang, Liejun
Tan, Sixiang
Lin, Jiangzhaung
Bu, Wenxiu
author_facet Chen, Danny
Yang, Wenzhong
Wang, Liejun
Tan, Sixiang
Lin, Jiangzhaung
Bu, Wenxiu
author_sort Chen, Danny
collection PubMed
description The accurate segmentation of retinal vessels images can not only be used to evaluate and monitor various ophthalmic diseases, but also timely reflect systemic diseases such as diabetes and blood diseases. Therefore, the study on segmentation of retinal vessels images is of great significance for the diagnosis of visually threatening diseases. In recent years, especially the convolutional neural networks (CNN) based on UNet and its variant have been widely used in various medical image tasks. However, although CNN has achieved excellent performance, it cannot learn global and long-distance semantic information interaction well due to the local computing characteristics of convolution operation, which limits the development of medical image segmentation tasks. Transformer, currently popular in computer vision, has global computing features, but due to the lack of low-level details, local feature information extraction is insufficient. In this paper, we propose Patches Convolution Attention based Transformer UNet (PCAT-UNet), which is a U-shaped network based on Transformer with a Convolution branch. We use skip connection to fuse the deep and shallow features of both sides. By taking advantage of the complementary advantages of both sides, we can effectively capture the global dependence relationship and the details of the underlying feature space, thus improving the current problems of insufficient extraction of retinal micro vessels feature information and low sensitivity caused by easily predicting of pixels as background. In addition, our method enables end-to-end training and rapid inference. Finally, three publicly available retinal vessels datasets (DRIVE, STARE and CHASE_DB1) were used to evaluate PCAT-UNet. The experimental results show that the proposed PCAT-UNET method achieves good retinal vessel segmentation performance on these three datasets, and is superior to other architectures in terms of AUC, Accuracy and Sensitivity performance indicators. AUC reached 0.9872, 0.9953 and 0.9925, Accuracy reached 0.9622, 0.9796 and 0.9812, Sensitivity reached 0.8576, 0.8703 and 0.8493, respectively. In addition, PCAT-UNET also achieved good results in two other F1-Score and Specificity indicators.
format Online
Article
Text
id pubmed-8786152
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-87861522022-01-25 PCAT-UNet: UNet-like network fused convolution and transformer for retinal vessel segmentation Chen, Danny Yang, Wenzhong Wang, Liejun Tan, Sixiang Lin, Jiangzhaung Bu, Wenxiu PLoS One Research Article The accurate segmentation of retinal vessels images can not only be used to evaluate and monitor various ophthalmic diseases, but also timely reflect systemic diseases such as diabetes and blood diseases. Therefore, the study on segmentation of retinal vessels images is of great significance for the diagnosis of visually threatening diseases. In recent years, especially the convolutional neural networks (CNN) based on UNet and its variant have been widely used in various medical image tasks. However, although CNN has achieved excellent performance, it cannot learn global and long-distance semantic information interaction well due to the local computing characteristics of convolution operation, which limits the development of medical image segmentation tasks. Transformer, currently popular in computer vision, has global computing features, but due to the lack of low-level details, local feature information extraction is insufficient. In this paper, we propose Patches Convolution Attention based Transformer UNet (PCAT-UNet), which is a U-shaped network based on Transformer with a Convolution branch. We use skip connection to fuse the deep and shallow features of both sides. By taking advantage of the complementary advantages of both sides, we can effectively capture the global dependence relationship and the details of the underlying feature space, thus improving the current problems of insufficient extraction of retinal micro vessels feature information and low sensitivity caused by easily predicting of pixels as background. In addition, our method enables end-to-end training and rapid inference. Finally, three publicly available retinal vessels datasets (DRIVE, STARE and CHASE_DB1) were used to evaluate PCAT-UNet. The experimental results show that the proposed PCAT-UNET method achieves good retinal vessel segmentation performance on these three datasets, and is superior to other architectures in terms of AUC, Accuracy and Sensitivity performance indicators. AUC reached 0.9872, 0.9953 and 0.9925, Accuracy reached 0.9622, 0.9796 and 0.9812, Sensitivity reached 0.8576, 0.8703 and 0.8493, respectively. In addition, PCAT-UNET also achieved good results in two other F1-Score and Specificity indicators. Public Library of Science 2022-01-24 /pmc/articles/PMC8786152/ /pubmed/35073371 http://dx.doi.org/10.1371/journal.pone.0262689 Text en © 2022 Chen et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Chen, Danny
Yang, Wenzhong
Wang, Liejun
Tan, Sixiang
Lin, Jiangzhaung
Bu, Wenxiu
PCAT-UNet: UNet-like network fused convolution and transformer for retinal vessel segmentation
title PCAT-UNet: UNet-like network fused convolution and transformer for retinal vessel segmentation
title_full PCAT-UNet: UNet-like network fused convolution and transformer for retinal vessel segmentation
title_fullStr PCAT-UNet: UNet-like network fused convolution and transformer for retinal vessel segmentation
title_full_unstemmed PCAT-UNet: UNet-like network fused convolution and transformer for retinal vessel segmentation
title_short PCAT-UNet: UNet-like network fused convolution and transformer for retinal vessel segmentation
title_sort pcat-unet: unet-like network fused convolution and transformer for retinal vessel segmentation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8786152/
https://www.ncbi.nlm.nih.gov/pubmed/35073371
http://dx.doi.org/10.1371/journal.pone.0262689
work_keys_str_mv AT chendanny pcatunetunetlikenetworkfusedconvolutionandtransformerforretinalvesselsegmentation
AT yangwenzhong pcatunetunetlikenetworkfusedconvolutionandtransformerforretinalvesselsegmentation
AT wangliejun pcatunetunetlikenetworkfusedconvolutionandtransformerforretinalvesselsegmentation
AT tansixiang pcatunetunetlikenetworkfusedconvolutionandtransformerforretinalvesselsegmentation
AT linjiangzhaung pcatunetunetlikenetworkfusedconvolutionandtransformerforretinalvesselsegmentation
AT buwenxiu pcatunetunetlikenetworkfusedconvolutionandtransformerforretinalvesselsegmentation