Cargando…

UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network

BACKGROUND: Methods based on the combination of transformer and convolutional neural networks (CNNs) have achieved impressive results in the field of medical image segmentation. However, most of the recently proposed combination segmentation approaches simply treat transformers as auxiliary modules...

Descripción completa

Detalles Bibliográficos
Autores principales: Fang, Kun, He, Baochun, Liu, Libo, Hu, Haoyu, Fang, Chihua, Huang, Xuguang, Jia, Fucang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: AME Publishing Company 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10006157/
https://www.ncbi.nlm.nih.gov/pubmed/36915332
http://dx.doi.org/10.21037/qims-22-544
_version_ 1784905251955933184
author Fang, Kun
He, Baochun
Liu, Libo
Hu, Haoyu
Fang, Chihua
Huang, Xuguang
Jia, Fucang
author_facet Fang, Kun
He, Baochun
Liu, Libo
Hu, Haoyu
Fang, Chihua
Huang, Xuguang
Jia, Fucang
author_sort Fang, Kun
collection PubMed
description BACKGROUND: Methods based on the combination of transformer and convolutional neural networks (CNNs) have achieved impressive results in the field of medical image segmentation. However, most of the recently proposed combination segmentation approaches simply treat transformers as auxiliary modules which help to extract long-range information and encode global context into convolutional representations, and there is a lack of investigation on how to optimally combine self-attention with convolution. METHODS: We designed a novel transformer block (MRFormer) that combines a multi-head self-attention layer and a residual depthwise convolutional block as the basic unit to deeply integrate both long-range and local spatial information. The MRFormer block was embedded between the encoder and decoder in U-Net at the last two layers. This framework (UMRFormer-Net) was applied to the segmentation of three-dimensional (3D) pancreas, and its ability to effectively capture the characteristic contextual information of the pancreas and surrounding tissues was investigated. RESULTS: Experimental results show that the proposed UMRFormer-Net achieved accuracy in pancreas segmentation that was comparable or superior to that of existing state-of-the-art 3D methods in both the Clinical Proteomic Tumor Analysis Consortium Pancreatic Ductal Adenocarcinoma (CPTAC-PDA) dataset and the public Medical Segmentation Decathlon dataset (self-division). UMRFormer-Net statistically significantly outperformed existing transformer-related methods and state-of-the-art 3D methods (P<0.05, P<0.01, or P<0.001), with a higher Dice coefficient (85.54% and 77.36%, respectively) or a lower 95% Hausdorff distance (4.05 and 8.34 mm, respectively). CONCLUSIONS: UMRFormer-Net can obtain more matched and accurate segmentation boundary and region information in pancreas segmentation, thus improving the accuracy of pancreas segmentation. The code is available at https://github.com/supersunshinefk/UMRFormer-Net.
format Online
Article
Text
id pubmed-10006157
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher AME Publishing Company
record_format MEDLINE/PubMed
spelling pubmed-100061572023-03-12 UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network Fang, Kun He, Baochun Liu, Libo Hu, Haoyu Fang, Chihua Huang, Xuguang Jia, Fucang Quant Imaging Med Surg Original Article BACKGROUND: Methods based on the combination of transformer and convolutional neural networks (CNNs) have achieved impressive results in the field of medical image segmentation. However, most of the recently proposed combination segmentation approaches simply treat transformers as auxiliary modules which help to extract long-range information and encode global context into convolutional representations, and there is a lack of investigation on how to optimally combine self-attention with convolution. METHODS: We designed a novel transformer block (MRFormer) that combines a multi-head self-attention layer and a residual depthwise convolutional block as the basic unit to deeply integrate both long-range and local spatial information. The MRFormer block was embedded between the encoder and decoder in U-Net at the last two layers. This framework (UMRFormer-Net) was applied to the segmentation of three-dimensional (3D) pancreas, and its ability to effectively capture the characteristic contextual information of the pancreas and surrounding tissues was investigated. RESULTS: Experimental results show that the proposed UMRFormer-Net achieved accuracy in pancreas segmentation that was comparable or superior to that of existing state-of-the-art 3D methods in both the Clinical Proteomic Tumor Analysis Consortium Pancreatic Ductal Adenocarcinoma (CPTAC-PDA) dataset and the public Medical Segmentation Decathlon dataset (self-division). UMRFormer-Net statistically significantly outperformed existing transformer-related methods and state-of-the-art 3D methods (P<0.05, P<0.01, or P<0.001), with a higher Dice coefficient (85.54% and 77.36%, respectively) or a lower 95% Hausdorff distance (4.05 and 8.34 mm, respectively). CONCLUSIONS: UMRFormer-Net can obtain more matched and accurate segmentation boundary and region information in pancreas segmentation, thus improving the accuracy of pancreas segmentation. The code is available at https://github.com/supersunshinefk/UMRFormer-Net. AME Publishing Company 2023-02-10 2023-03-01 /pmc/articles/PMC10006157/ /pubmed/36915332 http://dx.doi.org/10.21037/qims-22-544 Text en 2023 Quantitative Imaging in Medicine and Surgery. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Original Article
Fang, Kun
He, Baochun
Liu, Libo
Hu, Haoyu
Fang, Chihua
Huang, Xuguang
Jia, Fucang
UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network
title UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network
title_full UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network
title_fullStr UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network
title_full_unstemmed UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network
title_short UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network
title_sort umrformer-net: a three-dimensional u-shaped pancreas segmentation method based on a double-layer bridged transformer network
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10006157/
https://www.ncbi.nlm.nih.gov/pubmed/36915332
http://dx.doi.org/10.21037/qims-22-544
work_keys_str_mv AT fangkun umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork
AT hebaochun umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork
AT liulibo umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork
AT huhaoyu umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork
AT fangchihua umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork
AT huangxuguang umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork
AT jiafucang umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork