Cargando…

O-Net: A Novel Framework With Deep Fusion of CNN and Transformer for Simultaneous Segmentation and Classification

The application of deep learning in the medical field has continuously made huge breakthroughs in recent years. Based on convolutional neural network (CNN), the U-Net framework has become the benchmark of the medical image segmentation task. However, this framework cannot fully learn global informat...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Tao, Lan, Junlin, Han, Zixin, Hu, Ziwei, Huang, Yuxiu, Deng, Yanglin, Zhang, Hejun, Wang, Jianchao, Chen, Musheng, Jiang, Haiyan, Lee, Ren-Guey, Gao, Qinquan, Du, Ming, Tong, Tong, Chen, Gang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9201625/
https://www.ncbi.nlm.nih.gov/pubmed/35720715
http://dx.doi.org/10.3389/fnins.2022.876065
_version_ 1784728355473457152
author Wang, Tao
Lan, Junlin
Han, Zixin
Hu, Ziwei
Huang, Yuxiu
Deng, Yanglin
Zhang, Hejun
Wang, Jianchao
Chen, Musheng
Jiang, Haiyan
Lee, Ren-Guey
Gao, Qinquan
Du, Ming
Tong, Tong
Chen, Gang
author_facet Wang, Tao
Lan, Junlin
Han, Zixin
Hu, Ziwei
Huang, Yuxiu
Deng, Yanglin
Zhang, Hejun
Wang, Jianchao
Chen, Musheng
Jiang, Haiyan
Lee, Ren-Guey
Gao, Qinquan
Du, Ming
Tong, Tong
Chen, Gang
author_sort Wang, Tao
collection PubMed
description The application of deep learning in the medical field has continuously made huge breakthroughs in recent years. Based on convolutional neural network (CNN), the U-Net framework has become the benchmark of the medical image segmentation task. However, this framework cannot fully learn global information and remote semantic information. The transformer structure has been demonstrated to capture global information relatively better than the U-Net, but the ability to learn local information is not as good as CNN. Therefore, we propose a novel network referred to as the O-Net, which combines the advantages of CNN and transformer to fully use both the global and the local information for improving medical image segmentation and classification. In the encoder part of our proposed O-Net framework, we combine the CNN and the Swin Transformer to acquire both global and local contextual features. In the decoder part, the results of the Swin Transformer and the CNN blocks are fused to get the final results. We have evaluated the proposed network on the synapse multi-organ CT dataset and the ISIC 2017 challenge dataset for the segmentation task. The classification network is simultaneously trained by using the encoder weights of the segmentation network. The experimental results show that our proposed O-Net achieves superior segmentation performance than state-of-the-art approaches, and the segmentation results are beneficial for improving the accuracy of the classification task. The codes and models of this study are available at https://github.com/ortonwang/O-Net.
format Online
Article
Text
id pubmed-9201625
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-92016252022-06-17 O-Net: A Novel Framework With Deep Fusion of CNN and Transformer for Simultaneous Segmentation and Classification Wang, Tao Lan, Junlin Han, Zixin Hu, Ziwei Huang, Yuxiu Deng, Yanglin Zhang, Hejun Wang, Jianchao Chen, Musheng Jiang, Haiyan Lee, Ren-Guey Gao, Qinquan Du, Ming Tong, Tong Chen, Gang Front Neurosci Neuroscience The application of deep learning in the medical field has continuously made huge breakthroughs in recent years. Based on convolutional neural network (CNN), the U-Net framework has become the benchmark of the medical image segmentation task. However, this framework cannot fully learn global information and remote semantic information. The transformer structure has been demonstrated to capture global information relatively better than the U-Net, but the ability to learn local information is not as good as CNN. Therefore, we propose a novel network referred to as the O-Net, which combines the advantages of CNN and transformer to fully use both the global and the local information for improving medical image segmentation and classification. In the encoder part of our proposed O-Net framework, we combine the CNN and the Swin Transformer to acquire both global and local contextual features. In the decoder part, the results of the Swin Transformer and the CNN blocks are fused to get the final results. We have evaluated the proposed network on the synapse multi-organ CT dataset and the ISIC 2017 challenge dataset for the segmentation task. The classification network is simultaneously trained by using the encoder weights of the segmentation network. The experimental results show that our proposed O-Net achieves superior segmentation performance than state-of-the-art approaches, and the segmentation results are beneficial for improving the accuracy of the classification task. The codes and models of this study are available at https://github.com/ortonwang/O-Net. Frontiers Media S.A. 2022-06-02 /pmc/articles/PMC9201625/ /pubmed/35720715 http://dx.doi.org/10.3389/fnins.2022.876065 Text en Copyright © 2022 Wang, Lan, Han, Hu, Huang, Deng, Zhang, Wang, Chen, Jiang, Lee, Gao, Du, Tong and Chen. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Wang, Tao
Lan, Junlin
Han, Zixin
Hu, Ziwei
Huang, Yuxiu
Deng, Yanglin
Zhang, Hejun
Wang, Jianchao
Chen, Musheng
Jiang, Haiyan
Lee, Ren-Guey
Gao, Qinquan
Du, Ming
Tong, Tong
Chen, Gang
O-Net: A Novel Framework With Deep Fusion of CNN and Transformer for Simultaneous Segmentation and Classification
title O-Net: A Novel Framework With Deep Fusion of CNN and Transformer for Simultaneous Segmentation and Classification
title_full O-Net: A Novel Framework With Deep Fusion of CNN and Transformer for Simultaneous Segmentation and Classification
title_fullStr O-Net: A Novel Framework With Deep Fusion of CNN and Transformer for Simultaneous Segmentation and Classification
title_full_unstemmed O-Net: A Novel Framework With Deep Fusion of CNN and Transformer for Simultaneous Segmentation and Classification
title_short O-Net: A Novel Framework With Deep Fusion of CNN and Transformer for Simultaneous Segmentation and Classification
title_sort o-net: a novel framework with deep fusion of cnn and transformer for simultaneous segmentation and classification
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9201625/
https://www.ncbi.nlm.nih.gov/pubmed/35720715
http://dx.doi.org/10.3389/fnins.2022.876065
work_keys_str_mv AT wangtao onetanovelframeworkwithdeepfusionofcnnandtransformerforsimultaneoussegmentationandclassification
AT lanjunlin onetanovelframeworkwithdeepfusionofcnnandtransformerforsimultaneoussegmentationandclassification
AT hanzixin onetanovelframeworkwithdeepfusionofcnnandtransformerforsimultaneoussegmentationandclassification
AT huziwei onetanovelframeworkwithdeepfusionofcnnandtransformerforsimultaneoussegmentationandclassification
AT huangyuxiu onetanovelframeworkwithdeepfusionofcnnandtransformerforsimultaneoussegmentationandclassification
AT dengyanglin onetanovelframeworkwithdeepfusionofcnnandtransformerforsimultaneoussegmentationandclassification
AT zhanghejun onetanovelframeworkwithdeepfusionofcnnandtransformerforsimultaneoussegmentationandclassification
AT wangjianchao onetanovelframeworkwithdeepfusionofcnnandtransformerforsimultaneoussegmentationandclassification
AT chenmusheng onetanovelframeworkwithdeepfusionofcnnandtransformerforsimultaneoussegmentationandclassification
AT jianghaiyan onetanovelframeworkwithdeepfusionofcnnandtransformerforsimultaneoussegmentationandclassification
AT leerenguey onetanovelframeworkwithdeepfusionofcnnandtransformerforsimultaneoussegmentationandclassification
AT gaoqinquan onetanovelframeworkwithdeepfusionofcnnandtransformerforsimultaneoussegmentationandclassification
AT duming onetanovelframeworkwithdeepfusionofcnnandtransformerforsimultaneoussegmentationandclassification
AT tongtong onetanovelframeworkwithdeepfusionofcnnandtransformerforsimultaneoussegmentationandclassification
AT chengang onetanovelframeworkwithdeepfusionofcnnandtransformerforsimultaneoussegmentationandclassification