Cargando…
Towards More Accurate and Complete Heterogeneous Iris Segmentation Using a Hybrid Deep Learning Approach
Accurate iris segmentation is a crucial preprocessing stage for computer-aided ophthalmic disease diagnosis. The quality of iris images taken under different camera sensors varies greatly, and thus accurate segmentation of heterogeneous iris databases is a huge challenge. At present, network archite...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9501181/ https://www.ncbi.nlm.nih.gov/pubmed/36135411 http://dx.doi.org/10.3390/jimaging8090246 |
_version_ | 1784795410760466432 |
---|---|
author | Meng, Yuan Bao, Tie |
author_facet | Meng, Yuan Bao, Tie |
author_sort | Meng, Yuan |
collection | PubMed |
description | Accurate iris segmentation is a crucial preprocessing stage for computer-aided ophthalmic disease diagnosis. The quality of iris images taken under different camera sensors varies greatly, and thus accurate segmentation of heterogeneous iris databases is a huge challenge. At present, network architectures based on convolutional neural networks (CNNs) have been widely applied in iris segmentation tasks. However, due to the limited kernel size of convolution layers, iris segmentation networks based on CNNs cannot learn global and long-term semantic information interactions well, and this will bring challenges to accurately segmenting the iris region. Inspired by the success of vision transformer (VIT) and swin transformer (Swin T), a hybrid deep learning approach is proposed to segment heterogeneous iris images. Specifically, we first proposed a bilateral segmentation backbone network that combines the benefits of Swin T with CNNs. Then, a multiscale feature information extraction module (MFIEM) is proposed to extract multiscale spatial information at a more granular level. Finally, a channel attention mechanism module (CAMM) is used in this paper to enhance the discriminability of the iris region. Experimental results on a multisource heterogeneous iris database show that our network has a significant performance advantage compared with some state-of-the-art (SOTA) iris segmentation networks. |
format | Online Article Text |
id | pubmed-9501181 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-95011812022-09-24 Towards More Accurate and Complete Heterogeneous Iris Segmentation Using a Hybrid Deep Learning Approach Meng, Yuan Bao, Tie J Imaging Article Accurate iris segmentation is a crucial preprocessing stage for computer-aided ophthalmic disease diagnosis. The quality of iris images taken under different camera sensors varies greatly, and thus accurate segmentation of heterogeneous iris databases is a huge challenge. At present, network architectures based on convolutional neural networks (CNNs) have been widely applied in iris segmentation tasks. However, due to the limited kernel size of convolution layers, iris segmentation networks based on CNNs cannot learn global and long-term semantic information interactions well, and this will bring challenges to accurately segmenting the iris region. Inspired by the success of vision transformer (VIT) and swin transformer (Swin T), a hybrid deep learning approach is proposed to segment heterogeneous iris images. Specifically, we first proposed a bilateral segmentation backbone network that combines the benefits of Swin T with CNNs. Then, a multiscale feature information extraction module (MFIEM) is proposed to extract multiscale spatial information at a more granular level. Finally, a channel attention mechanism module (CAMM) is used in this paper to enhance the discriminability of the iris region. Experimental results on a multisource heterogeneous iris database show that our network has a significant performance advantage compared with some state-of-the-art (SOTA) iris segmentation networks. MDPI 2022-09-10 /pmc/articles/PMC9501181/ /pubmed/36135411 http://dx.doi.org/10.3390/jimaging8090246 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Meng, Yuan Bao, Tie Towards More Accurate and Complete Heterogeneous Iris Segmentation Using a Hybrid Deep Learning Approach |
title | Towards More Accurate and Complete Heterogeneous Iris Segmentation Using a Hybrid Deep Learning Approach |
title_full | Towards More Accurate and Complete Heterogeneous Iris Segmentation Using a Hybrid Deep Learning Approach |
title_fullStr | Towards More Accurate and Complete Heterogeneous Iris Segmentation Using a Hybrid Deep Learning Approach |
title_full_unstemmed | Towards More Accurate and Complete Heterogeneous Iris Segmentation Using a Hybrid Deep Learning Approach |
title_short | Towards More Accurate and Complete Heterogeneous Iris Segmentation Using a Hybrid Deep Learning Approach |
title_sort | towards more accurate and complete heterogeneous iris segmentation using a hybrid deep learning approach |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9501181/ https://www.ncbi.nlm.nih.gov/pubmed/36135411 http://dx.doi.org/10.3390/jimaging8090246 |
work_keys_str_mv | AT mengyuan towardsmoreaccurateandcompleteheterogeneousirissegmentationusingahybriddeeplearningapproach AT baotie towardsmoreaccurateandcompleteheterogeneousirissegmentationusingahybriddeeplearningapproach |