Cargando…

UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network

BACKGROUND: Methods based on the combination of transformer and convolutional neural networks (CNNs) have achieved impressive results in the field of medical image segmentation. However, most of the recently proposed combination segmentation approaches simply treat transformers as auxiliary modules...

Descripción completa

Detalles Bibliográficos
Autores principales:	Fang, Kun, He, Baochun, Liu, Libo, Hu, Haoyu, Fang, Chihua, Huang, Xuguang, Jia, Fucang
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	AME Publishing Company 2023
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10006157/ https://www.ncbi.nlm.nih.gov/pubmed/36915332 http://dx.doi.org/10.21037/qims-22-544

_version_	1784905251955933184
author	Fang, Kun He, Baochun Liu, Libo Hu, Haoyu Fang, Chihua Huang, Xuguang Jia, Fucang
author_facet	Fang, Kun He, Baochun Liu, Libo Hu, Haoyu Fang, Chihua Huang, Xuguang Jia, Fucang
author_sort	Fang, Kun
collection	PubMed
description	BACKGROUND: Methods based on the combination of transformer and convolutional neural networks (CNNs) have achieved impressive results in the field of medical image segmentation. However, most of the recently proposed combination segmentation approaches simply treat transformers as auxiliary modules which help to extract long-range information and encode global context into convolutional representations, and there is a lack of investigation on how to optimally combine self-attention with convolution. METHODS: We designed a novel transformer block (MRFormer) that combines a multi-head self-attention layer and a residual depthwise convolutional block as the basic unit to deeply integrate both long-range and local spatial information. The MRFormer block was embedded between the encoder and decoder in U-Net at the last two layers. This framework (UMRFormer-Net) was applied to the segmentation of three-dimensional (3D) pancreas, and its ability to effectively capture the characteristic contextual information of the pancreas and surrounding tissues was investigated. RESULTS: Experimental results show that the proposed UMRFormer-Net achieved accuracy in pancreas segmentation that was comparable or superior to that of existing state-of-the-art 3D methods in both the Clinical Proteomic Tumor Analysis Consortium Pancreatic Ductal Adenocarcinoma (CPTAC-PDA) dataset and the public Medical Segmentation Decathlon dataset (self-division). UMRFormer-Net statistically significantly outperformed existing transformer-related methods and state-of-the-art 3D methods (P<0.05, P<0.01, or P<0.001), with a higher Dice coefficient (85.54% and 77.36%, respectively) or a lower 95% Hausdorff distance (4.05 and 8.34 mm, respectively). CONCLUSIONS: UMRFormer-Net can obtain more matched and accurate segmentation boundary and region information in pancreas segmentation, thus improving the accuracy of pancreas segmentation. The code is available at https://github.com/supersunshinefk/UMRFormer-Net.
format	Online Article Text
id	pubmed-10006157
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	AME Publishing Company
record_format	MEDLINE/PubMed
spelling	pubmed-100061572023-03-12 UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network Fang, Kun He, Baochun Liu, Libo Hu, Haoyu Fang, Chihua Huang, Xuguang Jia, Fucang Quant Imaging Med Surg Original Article BACKGROUND: Methods based on the combination of transformer and convolutional neural networks (CNNs) have achieved impressive results in the field of medical image segmentation. However, most of the recently proposed combination segmentation approaches simply treat transformers as auxiliary modules which help to extract long-range information and encode global context into convolutional representations, and there is a lack of investigation on how to optimally combine self-attention with convolution. METHODS: We designed a novel transformer block (MRFormer) that combines a multi-head self-attention layer and a residual depthwise convolutional block as the basic unit to deeply integrate both long-range and local spatial information. The MRFormer block was embedded between the encoder and decoder in U-Net at the last two layers. This framework (UMRFormer-Net) was applied to the segmentation of three-dimensional (3D) pancreas, and its ability to effectively capture the characteristic contextual information of the pancreas and surrounding tissues was investigated. RESULTS: Experimental results show that the proposed UMRFormer-Net achieved accuracy in pancreas segmentation that was comparable or superior to that of existing state-of-the-art 3D methods in both the Clinical Proteomic Tumor Analysis Consortium Pancreatic Ductal Adenocarcinoma (CPTAC-PDA) dataset and the public Medical Segmentation Decathlon dataset (self-division). UMRFormer-Net statistically significantly outperformed existing transformer-related methods and state-of-the-art 3D methods (P<0.05, P<0.01, or P<0.001), with a higher Dice coefficient (85.54% and 77.36%, respectively) or a lower 95% Hausdorff distance (4.05 and 8.34 mm, respectively). CONCLUSIONS: UMRFormer-Net can obtain more matched and accurate segmentation boundary and region information in pancreas segmentation, thus improving the accuracy of pancreas segmentation. The code is available at https://github.com/supersunshinefk/UMRFormer-Net. AME Publishing Company 2023-02-10 2023-03-01 /pmc/articles/PMC10006157/ /pubmed/36915332 http://dx.doi.org/10.21037/qims-22-544 Text en 2023 Quantitative Imaging in Medicine and Surgery. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle	Original Article Fang, Kun He, Baochun Liu, Libo Hu, Haoyu Fang, Chihua Huang, Xuguang Jia, Fucang UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network
title	UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network
title_full	UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network
title_fullStr	UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network
title_full_unstemmed	UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network
title_short	UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network
title_sort	umrformer-net: a three-dimensional u-shaped pancreas segmentation method based on a double-layer bridged transformer network
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10006157/ https://www.ncbi.nlm.nih.gov/pubmed/36915332 http://dx.doi.org/10.21037/qims-22-544
work_keys_str_mv	AT fangkun umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork AT hebaochun umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork AT liulibo umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork AT huhaoyu umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork AT fangchihua umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork AT huangxuguang umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork AT jiafucang umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork

UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network

Ejemplares similares