Cargando…

Inter-Level Feature Balanced Fusion Network for Street Scene Segmentation

Semantic segmentation, as a pixel-level recognition task, has been widely used in a variety of practical scenes. Most of the existing methods try to improve the performance of the network by fusing the information of high and low layers. This kind of simple concatenation or element-wise addition wil...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Dongqian, Fan, Cien, Zou, Lian, Zuo, Qi, Jiang, Hao, Liu, Yifeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8659532/
https://www.ncbi.nlm.nih.gov/pubmed/34883845
http://dx.doi.org/10.3390/s21237844
_version_ 1784612985226919936
author Li, Dongqian
Fan, Cien
Zou, Lian
Zuo, Qi
Jiang, Hao
Liu, Yifeng
author_facet Li, Dongqian
Fan, Cien
Zou, Lian
Zuo, Qi
Jiang, Hao
Liu, Yifeng
author_sort Li, Dongqian
collection PubMed
description Semantic segmentation, as a pixel-level recognition task, has been widely used in a variety of practical scenes. Most of the existing methods try to improve the performance of the network by fusing the information of high and low layers. This kind of simple concatenation or element-wise addition will lead to the problem of unbalanced fusion and low utilization of inter-level features. To solve this problem, we propose the Inter-Level Feature Balanced Fusion Network (IFBFNet) to guide the inter-level feature fusion towards a more balanced and effective direction. Our overall network architecture is based on the encoder–decoder architecture. In the encoder, we use a relatively deep convolution network to extract rich semantic information. In the decoder, skip-connections are added to connect and fuse low-level spatial features to restore a clearer boundary expression gradually. We add an inter-level feature balanced fusion module to each skip connection. Additionally, to better capture the boundary information, we added a shallower spatial information stream to supplement more spatial information details. Experiments have proved the effectiveness of our module. Our IFBFNet achieved a competitive performance on the Cityscapes dataset with only finely annotated data used for training and has been greatly improved on the baseline network.
format Online
Article
Text
id pubmed-8659532
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-86595322021-12-10 Inter-Level Feature Balanced Fusion Network for Street Scene Segmentation Li, Dongqian Fan, Cien Zou, Lian Zuo, Qi Jiang, Hao Liu, Yifeng Sensors (Basel) Article Semantic segmentation, as a pixel-level recognition task, has been widely used in a variety of practical scenes. Most of the existing methods try to improve the performance of the network by fusing the information of high and low layers. This kind of simple concatenation or element-wise addition will lead to the problem of unbalanced fusion and low utilization of inter-level features. To solve this problem, we propose the Inter-Level Feature Balanced Fusion Network (IFBFNet) to guide the inter-level feature fusion towards a more balanced and effective direction. Our overall network architecture is based on the encoder–decoder architecture. In the encoder, we use a relatively deep convolution network to extract rich semantic information. In the decoder, skip-connections are added to connect and fuse low-level spatial features to restore a clearer boundary expression gradually. We add an inter-level feature balanced fusion module to each skip connection. Additionally, to better capture the boundary information, we added a shallower spatial information stream to supplement more spatial information details. Experiments have proved the effectiveness of our module. Our IFBFNet achieved a competitive performance on the Cityscapes dataset with only finely annotated data used for training and has been greatly improved on the baseline network. MDPI 2021-11-25 /pmc/articles/PMC8659532/ /pubmed/34883845 http://dx.doi.org/10.3390/s21237844 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Li, Dongqian
Fan, Cien
Zou, Lian
Zuo, Qi
Jiang, Hao
Liu, Yifeng
Inter-Level Feature Balanced Fusion Network for Street Scene Segmentation
title Inter-Level Feature Balanced Fusion Network for Street Scene Segmentation
title_full Inter-Level Feature Balanced Fusion Network for Street Scene Segmentation
title_fullStr Inter-Level Feature Balanced Fusion Network for Street Scene Segmentation
title_full_unstemmed Inter-Level Feature Balanced Fusion Network for Street Scene Segmentation
title_short Inter-Level Feature Balanced Fusion Network for Street Scene Segmentation
title_sort inter-level feature balanced fusion network for street scene segmentation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8659532/
https://www.ncbi.nlm.nih.gov/pubmed/34883845
http://dx.doi.org/10.3390/s21237844
work_keys_str_mv AT lidongqian interlevelfeaturebalancedfusionnetworkforstreetscenesegmentation
AT fancien interlevelfeaturebalancedfusionnetworkforstreetscenesegmentation
AT zoulian interlevelfeaturebalancedfusionnetworkforstreetscenesegmentation
AT zuoqi interlevelfeaturebalancedfusionnetworkforstreetscenesegmentation
AT jianghao interlevelfeaturebalancedfusionnetworkforstreetscenesegmentation
AT liuyifeng interlevelfeaturebalancedfusionnetworkforstreetscenesegmentation