Cargando…
UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network
BACKGROUND: Methods based on the combination of transformer and convolutional neural networks (CNNs) have achieved impressive results in the field of medical image segmentation. However, most of the recently proposed combination segmentation approaches simply treat transformers as auxiliary modules...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
AME Publishing Company
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10006157/ https://www.ncbi.nlm.nih.gov/pubmed/36915332 http://dx.doi.org/10.21037/qims-22-544 |
_version_ | 1784905251955933184 |
---|---|
author | Fang, Kun He, Baochun Liu, Libo Hu, Haoyu Fang, Chihua Huang, Xuguang Jia, Fucang |
author_facet | Fang, Kun He, Baochun Liu, Libo Hu, Haoyu Fang, Chihua Huang, Xuguang Jia, Fucang |
author_sort | Fang, Kun |
collection | PubMed |
description | BACKGROUND: Methods based on the combination of transformer and convolutional neural networks (CNNs) have achieved impressive results in the field of medical image segmentation. However, most of the recently proposed combination segmentation approaches simply treat transformers as auxiliary modules which help to extract long-range information and encode global context into convolutional representations, and there is a lack of investigation on how to optimally combine self-attention with convolution. METHODS: We designed a novel transformer block (MRFormer) that combines a multi-head self-attention layer and a residual depthwise convolutional block as the basic unit to deeply integrate both long-range and local spatial information. The MRFormer block was embedded between the encoder and decoder in U-Net at the last two layers. This framework (UMRFormer-Net) was applied to the segmentation of three-dimensional (3D) pancreas, and its ability to effectively capture the characteristic contextual information of the pancreas and surrounding tissues was investigated. RESULTS: Experimental results show that the proposed UMRFormer-Net achieved accuracy in pancreas segmentation that was comparable or superior to that of existing state-of-the-art 3D methods in both the Clinical Proteomic Tumor Analysis Consortium Pancreatic Ductal Adenocarcinoma (CPTAC-PDA) dataset and the public Medical Segmentation Decathlon dataset (self-division). UMRFormer-Net statistically significantly outperformed existing transformer-related methods and state-of-the-art 3D methods (P<0.05, P<0.01, or P<0.001), with a higher Dice coefficient (85.54% and 77.36%, respectively) or a lower 95% Hausdorff distance (4.05 and 8.34 mm, respectively). CONCLUSIONS: UMRFormer-Net can obtain more matched and accurate segmentation boundary and region information in pancreas segmentation, thus improving the accuracy of pancreas segmentation. The code is available at https://github.com/supersunshinefk/UMRFormer-Net. |
format | Online Article Text |
id | pubmed-10006157 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | AME Publishing Company |
record_format | MEDLINE/PubMed |
spelling | pubmed-100061572023-03-12 UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network Fang, Kun He, Baochun Liu, Libo Hu, Haoyu Fang, Chihua Huang, Xuguang Jia, Fucang Quant Imaging Med Surg Original Article BACKGROUND: Methods based on the combination of transformer and convolutional neural networks (CNNs) have achieved impressive results in the field of medical image segmentation. However, most of the recently proposed combination segmentation approaches simply treat transformers as auxiliary modules which help to extract long-range information and encode global context into convolutional representations, and there is a lack of investigation on how to optimally combine self-attention with convolution. METHODS: We designed a novel transformer block (MRFormer) that combines a multi-head self-attention layer and a residual depthwise convolutional block as the basic unit to deeply integrate both long-range and local spatial information. The MRFormer block was embedded between the encoder and decoder in U-Net at the last two layers. This framework (UMRFormer-Net) was applied to the segmentation of three-dimensional (3D) pancreas, and its ability to effectively capture the characteristic contextual information of the pancreas and surrounding tissues was investigated. RESULTS: Experimental results show that the proposed UMRFormer-Net achieved accuracy in pancreas segmentation that was comparable or superior to that of existing state-of-the-art 3D methods in both the Clinical Proteomic Tumor Analysis Consortium Pancreatic Ductal Adenocarcinoma (CPTAC-PDA) dataset and the public Medical Segmentation Decathlon dataset (self-division). UMRFormer-Net statistically significantly outperformed existing transformer-related methods and state-of-the-art 3D methods (P<0.05, P<0.01, or P<0.001), with a higher Dice coefficient (85.54% and 77.36%, respectively) or a lower 95% Hausdorff distance (4.05 and 8.34 mm, respectively). CONCLUSIONS: UMRFormer-Net can obtain more matched and accurate segmentation boundary and region information in pancreas segmentation, thus improving the accuracy of pancreas segmentation. The code is available at https://github.com/supersunshinefk/UMRFormer-Net. AME Publishing Company 2023-02-10 2023-03-01 /pmc/articles/PMC10006157/ /pubmed/36915332 http://dx.doi.org/10.21037/qims-22-544 Text en 2023 Quantitative Imaging in Medicine and Surgery. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Original Article Fang, Kun He, Baochun Liu, Libo Hu, Haoyu Fang, Chihua Huang, Xuguang Jia, Fucang UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network |
title | UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network |
title_full | UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network |
title_fullStr | UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network |
title_full_unstemmed | UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network |
title_short | UMRFormer-net: a three-dimensional U-shaped pancreas segmentation method based on a double-layer bridged transformer network |
title_sort | umrformer-net: a three-dimensional u-shaped pancreas segmentation method based on a double-layer bridged transformer network |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10006157/ https://www.ncbi.nlm.nih.gov/pubmed/36915332 http://dx.doi.org/10.21037/qims-22-544 |
work_keys_str_mv | AT fangkun umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork AT hebaochun umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork AT liulibo umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork AT huhaoyu umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork AT fangchihua umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork AT huangxuguang umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork AT jiafucang umrformernetathreedimensionalushapedpancreassegmentationmethodbasedonadoublelayerbridgedtransformernetwork |