Cargando…

HybridCTrm: Bridging CNN and Transformer for Multimodal Brain Image Segmentation

Multimodal medical image segmentation is always a critical problem in medical image segmentation. Traditional deep learning methods utilize fully CNNs for encoding given images, thus leading to deficiency of long-range dependencies and bad generalization performance. Recently, a sequence of Transfor...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Qixuan, Fang, Nianhua, Liu, Zhuo, Zhao, Liang, Wen, Youpeng, Lin, Hongxiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8500745/
https://www.ncbi.nlm.nih.gov/pubmed/34630994
http://dx.doi.org/10.1155/2021/7467261
_version_ 1784580510002970624
author Sun, Qixuan
Fang, Nianhua
Liu, Zhuo
Zhao, Liang
Wen, Youpeng
Lin, Hongxiang
author_facet Sun, Qixuan
Fang, Nianhua
Liu, Zhuo
Zhao, Liang
Wen, Youpeng
Lin, Hongxiang
author_sort Sun, Qixuan
collection PubMed
description Multimodal medical image segmentation is always a critical problem in medical image segmentation. Traditional deep learning methods utilize fully CNNs for encoding given images, thus leading to deficiency of long-range dependencies and bad generalization performance. Recently, a sequence of Transformer-based methodologies emerges in the field of image processing, which brings great generalization and performance in various tasks. On the other hand, traditional CNNs have their own advantages, such as rapid convergence and local representations. Therefore, we analyze a hybrid multimodal segmentation method based on Transformers and CNNs and propose a novel architecture, HybridCTrm network. We conduct experiments using HybridCTrm on two benchmark datasets and compare with HyperDenseNet, a network based on fully CNNs. Results show that our HybridCTrm outperforms HyperDenseNet on most of the evaluation metrics. Furthermore, we analyze the influence of the depth of Transformer on the performance. Besides, we visualize the results and carefully explore how our hybrid methods improve on segmentations.
format Online
Article
Text
id pubmed-8500745
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-85007452021-10-09 HybridCTrm: Bridging CNN and Transformer for Multimodal Brain Image Segmentation Sun, Qixuan Fang, Nianhua Liu, Zhuo Zhao, Liang Wen, Youpeng Lin, Hongxiang J Healthc Eng Research Article Multimodal medical image segmentation is always a critical problem in medical image segmentation. Traditional deep learning methods utilize fully CNNs for encoding given images, thus leading to deficiency of long-range dependencies and bad generalization performance. Recently, a sequence of Transformer-based methodologies emerges in the field of image processing, which brings great generalization and performance in various tasks. On the other hand, traditional CNNs have their own advantages, such as rapid convergence and local representations. Therefore, we analyze a hybrid multimodal segmentation method based on Transformers and CNNs and propose a novel architecture, HybridCTrm network. We conduct experiments using HybridCTrm on two benchmark datasets and compare with HyperDenseNet, a network based on fully CNNs. Results show that our HybridCTrm outperforms HyperDenseNet on most of the evaluation metrics. Furthermore, we analyze the influence of the depth of Transformer on the performance. Besides, we visualize the results and carefully explore how our hybrid methods improve on segmentations. Hindawi 2021-10-01 /pmc/articles/PMC8500745/ /pubmed/34630994 http://dx.doi.org/10.1155/2021/7467261 Text en Copyright © 2021 Qixuan Sun et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Sun, Qixuan
Fang, Nianhua
Liu, Zhuo
Zhao, Liang
Wen, Youpeng
Lin, Hongxiang
HybridCTrm: Bridging CNN and Transformer for Multimodal Brain Image Segmentation
title HybridCTrm: Bridging CNN and Transformer for Multimodal Brain Image Segmentation
title_full HybridCTrm: Bridging CNN and Transformer for Multimodal Brain Image Segmentation
title_fullStr HybridCTrm: Bridging CNN and Transformer for Multimodal Brain Image Segmentation
title_full_unstemmed HybridCTrm: Bridging CNN and Transformer for Multimodal Brain Image Segmentation
title_short HybridCTrm: Bridging CNN and Transformer for Multimodal Brain Image Segmentation
title_sort hybridctrm: bridging cnn and transformer for multimodal brain image segmentation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8500745/
https://www.ncbi.nlm.nih.gov/pubmed/34630994
http://dx.doi.org/10.1155/2021/7467261
work_keys_str_mv AT sunqixuan hybridctrmbridgingcnnandtransformerformultimodalbrainimagesegmentation
AT fangnianhua hybridctrmbridgingcnnandtransformerformultimodalbrainimagesegmentation
AT liuzhuo hybridctrmbridgingcnnandtransformerformultimodalbrainimagesegmentation
AT zhaoliang hybridctrmbridgingcnnandtransformerformultimodalbrainimagesegmentation
AT wenyoupeng hybridctrmbridgingcnnandtransformerformultimodalbrainimagesegmentation
AT linhongxiang hybridctrmbridgingcnnandtransformerformultimodalbrainimagesegmentation