Cargando…

Semantic segmentation method of underwater images based on encoder-decoder architecture

With the exploration and development of marine resources, deep learning is more and more widely used in underwater image processing. However, the quality of the original underwater images is so low that traditional semantic segmentation methods obtain poor segmentation results, such as blurred targe...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Jinkang, He, Xiaohui, Shao, Faming, Lu, Guanlin, Hu, Ruizhe, Jiang, Qunyan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9409518/
https://www.ncbi.nlm.nih.gov/pubmed/36006956
http://dx.doi.org/10.1371/journal.pone.0272666
_version_ 1784774870691741696
author Wang, Jinkang
He, Xiaohui
Shao, Faming
Lu, Guanlin
Hu, Ruizhe
Jiang, Qunyan
author_facet Wang, Jinkang
He, Xiaohui
Shao, Faming
Lu, Guanlin
Hu, Ruizhe
Jiang, Qunyan
author_sort Wang, Jinkang
collection PubMed
description With the exploration and development of marine resources, deep learning is more and more widely used in underwater image processing. However, the quality of the original underwater images is so low that traditional semantic segmentation methods obtain poor segmentation results, such as blurred target edges, insufficient segmentation accuracy, and poor regional boundary segmentation effects. To solve these problems, this paper proposes a semantic segmentation method for underwater images. Firstly, the image enhancement based on multi-spatial transformation is performed to improve the quality of the original images, which is not common in other advanced semantic segmentation methods. Then, the densely connected hybrid atrous convolution effectively expands the receptive field and slows down the speed of resolution reduction. Next, the cascaded atrous convolutional spatial pyramid pooling module integrates boundary features of different scales to enrich target details. Finally, the context information aggregation decoder fuses the features of the shallow network and the deep network to extract rich contextual information, which greatly reduces information loss. The proposed method was evaluated on RUIE, HabCam UID, and UIEBD. Compared with the state-of-the-art semantic segmentation algorithms, the proposed method has advantages in segmentation integrity, location accuracy, boundary clarity, and detail in subjective perception. On the objective data, the proposed method achieves the highest MIOU of 68.3 and OA of 79.4, and it has a low resource consumption. Besides, the ablation experiment also verifies the effectiveness of our method.
format Online
Article
Text
id pubmed-9409518
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-94095182022-08-26 Semantic segmentation method of underwater images based on encoder-decoder architecture Wang, Jinkang He, Xiaohui Shao, Faming Lu, Guanlin Hu, Ruizhe Jiang, Qunyan PLoS One Research Article With the exploration and development of marine resources, deep learning is more and more widely used in underwater image processing. However, the quality of the original underwater images is so low that traditional semantic segmentation methods obtain poor segmentation results, such as blurred target edges, insufficient segmentation accuracy, and poor regional boundary segmentation effects. To solve these problems, this paper proposes a semantic segmentation method for underwater images. Firstly, the image enhancement based on multi-spatial transformation is performed to improve the quality of the original images, which is not common in other advanced semantic segmentation methods. Then, the densely connected hybrid atrous convolution effectively expands the receptive field and slows down the speed of resolution reduction. Next, the cascaded atrous convolutional spatial pyramid pooling module integrates boundary features of different scales to enrich target details. Finally, the context information aggregation decoder fuses the features of the shallow network and the deep network to extract rich contextual information, which greatly reduces information loss. The proposed method was evaluated on RUIE, HabCam UID, and UIEBD. Compared with the state-of-the-art semantic segmentation algorithms, the proposed method has advantages in segmentation integrity, location accuracy, boundary clarity, and detail in subjective perception. On the objective data, the proposed method achieves the highest MIOU of 68.3 and OA of 79.4, and it has a low resource consumption. Besides, the ablation experiment also verifies the effectiveness of our method. Public Library of Science 2022-08-25 /pmc/articles/PMC9409518/ /pubmed/36006956 http://dx.doi.org/10.1371/journal.pone.0272666 Text en © 2022 Wang et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wang, Jinkang
He, Xiaohui
Shao, Faming
Lu, Guanlin
Hu, Ruizhe
Jiang, Qunyan
Semantic segmentation method of underwater images based on encoder-decoder architecture
title Semantic segmentation method of underwater images based on encoder-decoder architecture
title_full Semantic segmentation method of underwater images based on encoder-decoder architecture
title_fullStr Semantic segmentation method of underwater images based on encoder-decoder architecture
title_full_unstemmed Semantic segmentation method of underwater images based on encoder-decoder architecture
title_short Semantic segmentation method of underwater images based on encoder-decoder architecture
title_sort semantic segmentation method of underwater images based on encoder-decoder architecture
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9409518/
https://www.ncbi.nlm.nih.gov/pubmed/36006956
http://dx.doi.org/10.1371/journal.pone.0272666
work_keys_str_mv AT wangjinkang semanticsegmentationmethodofunderwaterimagesbasedonencoderdecoderarchitecture
AT hexiaohui semanticsegmentationmethodofunderwaterimagesbasedonencoderdecoderarchitecture
AT shaofaming semanticsegmentationmethodofunderwaterimagesbasedonencoderdecoderarchitecture
AT luguanlin semanticsegmentationmethodofunderwaterimagesbasedonencoderdecoderarchitecture
AT huruizhe semanticsegmentationmethodofunderwaterimagesbasedonencoderdecoderarchitecture
AT jiangqunyan semanticsegmentationmethodofunderwaterimagesbasedonencoderdecoderarchitecture