Cargando…
Facial image inpainting for big data using an effective attention mechanism and a convolutional neural network
Big data facial image is an important identity information for people. However, facial image inpainting using existing deep learning methods has some problems such as insufficient feature mining and incomplete semantic expression, leading to output image artifacts or fuzzy textures. Therefore, it is...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9877444/ https://www.ncbi.nlm.nih.gov/pubmed/36714154 http://dx.doi.org/10.3389/fnbot.2022.1111621 |
_version_ | 1784878367249530880 |
---|---|
author | Lu, Xiaoman Lu, Ran Zhao, Wenhao Ma, Erbin |
author_facet | Lu, Xiaoman Lu, Ran Zhao, Wenhao Ma, Erbin |
author_sort | Lu, Xiaoman |
collection | PubMed |
description | Big data facial image is an important identity information for people. However, facial image inpainting using existing deep learning methods has some problems such as insufficient feature mining and incomplete semantic expression, leading to output image artifacts or fuzzy textures. Therefore, it is of practical significance to study how to effectively restore an incomplete facial image. In this study, we proposed a facial image inpainting method using a multistage generative adversarial network (GAN) and the global attention mechanism (GAM). For the overall network structure, we used the GAN as the main body, then we established skip connections to optimize the network structure, and used the encoder–decoder structure to better capture the semantic information of the missing part of a facial image. A local refinement network has been proposed to enhance the local restoration effect and to weaken the influence of unsatisfactory results. Moreover, GAM is added to the network to magnify the interactive features of the global dimension while reducing information dispersion, which is more suitable for restoring human facial information. Comparative experiments on CelebA and CelebA-HQ big datasets show that the proposed method generates realistic inpainting results in both regular and irregular masks and achieves peak signal-to-noise ratio (PSNR) and structural similarity (SSIM), as well as other evaluation indicators that illustrate the performance and efficiency of the proposed model. |
format | Online Article Text |
id | pubmed-9877444 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-98774442023-01-27 Facial image inpainting for big data using an effective attention mechanism and a convolutional neural network Lu, Xiaoman Lu, Ran Zhao, Wenhao Ma, Erbin Front Neurorobot Neuroscience Big data facial image is an important identity information for people. However, facial image inpainting using existing deep learning methods has some problems such as insufficient feature mining and incomplete semantic expression, leading to output image artifacts or fuzzy textures. Therefore, it is of practical significance to study how to effectively restore an incomplete facial image. In this study, we proposed a facial image inpainting method using a multistage generative adversarial network (GAN) and the global attention mechanism (GAM). For the overall network structure, we used the GAN as the main body, then we established skip connections to optimize the network structure, and used the encoder–decoder structure to better capture the semantic information of the missing part of a facial image. A local refinement network has been proposed to enhance the local restoration effect and to weaken the influence of unsatisfactory results. Moreover, GAM is added to the network to magnify the interactive features of the global dimension while reducing information dispersion, which is more suitable for restoring human facial information. Comparative experiments on CelebA and CelebA-HQ big datasets show that the proposed method generates realistic inpainting results in both regular and irregular masks and achieves peak signal-to-noise ratio (PSNR) and structural similarity (SSIM), as well as other evaluation indicators that illustrate the performance and efficiency of the proposed model. Frontiers Media S.A. 2023-01-12 /pmc/articles/PMC9877444/ /pubmed/36714154 http://dx.doi.org/10.3389/fnbot.2022.1111621 Text en Copyright © 2023 Lu, Lu, Zhao and Ma. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Lu, Xiaoman Lu, Ran Zhao, Wenhao Ma, Erbin Facial image inpainting for big data using an effective attention mechanism and a convolutional neural network |
title | Facial image inpainting for big data using an effective attention mechanism and a convolutional neural network |
title_full | Facial image inpainting for big data using an effective attention mechanism and a convolutional neural network |
title_fullStr | Facial image inpainting for big data using an effective attention mechanism and a convolutional neural network |
title_full_unstemmed | Facial image inpainting for big data using an effective attention mechanism and a convolutional neural network |
title_short | Facial image inpainting for big data using an effective attention mechanism and a convolutional neural network |
title_sort | facial image inpainting for big data using an effective attention mechanism and a convolutional neural network |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9877444/ https://www.ncbi.nlm.nih.gov/pubmed/36714154 http://dx.doi.org/10.3389/fnbot.2022.1111621 |
work_keys_str_mv | AT luxiaoman facialimageinpaintingforbigdatausinganeffectiveattentionmechanismandaconvolutionalneuralnetwork AT luran facialimageinpaintingforbigdatausinganeffectiveattentionmechanismandaconvolutionalneuralnetwork AT zhaowenhao facialimageinpaintingforbigdatausinganeffectiveattentionmechanismandaconvolutionalneuralnetwork AT maerbin facialimageinpaintingforbigdatausinganeffectiveattentionmechanismandaconvolutionalneuralnetwork |