Cargando…
RAFF-Net: An improved tongue segmentation algorithm based on residual attention network and multiscale feature fusion
OBJECTIVE: Due to the complexity of face images, tongue segmentation is susceptible to interference from uneven tongue texture, lips and face, resulting in traditional methods failing to segment the tongue accurately. To address this problem, RAFF-Net, an automatic tongue region segmentation network...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9634193/ https://www.ncbi.nlm.nih.gov/pubmed/36339902 http://dx.doi.org/10.1177/20552076221136362 |
_version_ | 1784824417258307584 |
---|---|
author | Song, Haibei Huang, Zonghai Feng, Li Zhong, Yanmei Wen, Chuanbiao Guo, Jinhong |
author_facet | Song, Haibei Huang, Zonghai Feng, Li Zhong, Yanmei Wen, Chuanbiao Guo, Jinhong |
author_sort | Song, Haibei |
collection | PubMed |
description | OBJECTIVE: Due to the complexity of face images, tongue segmentation is susceptible to interference from uneven tongue texture, lips and face, resulting in traditional methods failing to segment the tongue accurately. To address this problem, RAFF-Net, an automatic tongue region segmentation network based on residual attention network and multiscale feature fusion, was proposed. It aims to improve tongue segmentation accuracy and achieve end-to-end automated segmentation. METHODS: Based on the UNet backbone network, different numbers of ResBlocks combined with the Squeeze-and-Excitation (SE) block was used as an encoder to extract image layered features. The decoder structure of UNet was simplified and the number of parameters of the network model was reduced. Meanwhile, the multiscale feature fusion module was designed to optimize the network parameters by combining a custom loss function instead of the common cross-entropy loss function to further improve the detection accuracy. RESULTS: The RAFF-Net network structure achieved Mean Intersection over Union (MIoU) and F1-score of 97.85% and 97.73%, respectively, which improved 0.56% and 0.46%, respectively, compared with the original UNet; ablation experiments demonstrated that the improved algorithm could contribute to the enhancement of tongue segmentation effect. CONCLUSION: This study combined the residual attention network with multiscale feature fusion to effectively improve the segmentation accuracy of the tongue region, and optimized the input and output of the UNet network using different numbers of ResBlocks, SE block, multiscale feature fusion and weighted loss function, increased the stability of the network and improved the overall effect of the network. |
format | Online Article Text |
id | pubmed-9634193 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-96341932022-11-05 RAFF-Net: An improved tongue segmentation algorithm based on residual attention network and multiscale feature fusion Song, Haibei Huang, Zonghai Feng, Li Zhong, Yanmei Wen, Chuanbiao Guo, Jinhong Digit Health Original Research OBJECTIVE: Due to the complexity of face images, tongue segmentation is susceptible to interference from uneven tongue texture, lips and face, resulting in traditional methods failing to segment the tongue accurately. To address this problem, RAFF-Net, an automatic tongue region segmentation network based on residual attention network and multiscale feature fusion, was proposed. It aims to improve tongue segmentation accuracy and achieve end-to-end automated segmentation. METHODS: Based on the UNet backbone network, different numbers of ResBlocks combined with the Squeeze-and-Excitation (SE) block was used as an encoder to extract image layered features. The decoder structure of UNet was simplified and the number of parameters of the network model was reduced. Meanwhile, the multiscale feature fusion module was designed to optimize the network parameters by combining a custom loss function instead of the common cross-entropy loss function to further improve the detection accuracy. RESULTS: The RAFF-Net network structure achieved Mean Intersection over Union (MIoU) and F1-score of 97.85% and 97.73%, respectively, which improved 0.56% and 0.46%, respectively, compared with the original UNet; ablation experiments demonstrated that the improved algorithm could contribute to the enhancement of tongue segmentation effect. CONCLUSION: This study combined the residual attention network with multiscale feature fusion to effectively improve the segmentation accuracy of the tongue region, and optimized the input and output of the UNet network using different numbers of ResBlocks, SE block, multiscale feature fusion and weighted loss function, increased the stability of the network and improved the overall effect of the network. SAGE Publications 2022-11-03 /pmc/articles/PMC9634193/ /pubmed/36339902 http://dx.doi.org/10.1177/20552076221136362 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc-nd/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 License (https://creativecommons.org/licenses/by-nc-nd/4.0/) which permits non-commercial use, reproduction and distribution of the work as published without adaptation or alteration, without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Original Research Song, Haibei Huang, Zonghai Feng, Li Zhong, Yanmei Wen, Chuanbiao Guo, Jinhong RAFF-Net: An improved tongue segmentation algorithm based on residual attention network and multiscale feature fusion |
title | RAFF-Net: An improved tongue segmentation algorithm based on residual
attention network and multiscale feature fusion |
title_full | RAFF-Net: An improved tongue segmentation algorithm based on residual
attention network and multiscale feature fusion |
title_fullStr | RAFF-Net: An improved tongue segmentation algorithm based on residual
attention network and multiscale feature fusion |
title_full_unstemmed | RAFF-Net: An improved tongue segmentation algorithm based on residual
attention network and multiscale feature fusion |
title_short | RAFF-Net: An improved tongue segmentation algorithm based on residual
attention network and multiscale feature fusion |
title_sort | raff-net: an improved tongue segmentation algorithm based on residual
attention network and multiscale feature fusion |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9634193/ https://www.ncbi.nlm.nih.gov/pubmed/36339902 http://dx.doi.org/10.1177/20552076221136362 |
work_keys_str_mv | AT songhaibei raffnetanimprovedtonguesegmentationalgorithmbasedonresidualattentionnetworkandmultiscalefeaturefusion AT huangzonghai raffnetanimprovedtonguesegmentationalgorithmbasedonresidualattentionnetworkandmultiscalefeaturefusion AT fengli raffnetanimprovedtonguesegmentationalgorithmbasedonresidualattentionnetworkandmultiscalefeaturefusion AT zhongyanmei raffnetanimprovedtonguesegmentationalgorithmbasedonresidualattentionnetworkandmultiscalefeaturefusion AT wenchuanbiao raffnetanimprovedtonguesegmentationalgorithmbasedonresidualattentionnetworkandmultiscalefeaturefusion AT guojinhong raffnetanimprovedtonguesegmentationalgorithmbasedonresidualattentionnetworkandmultiscalefeaturefusion |