Cargando…

Improved Deep Neural Network for Cross-Media Visual Communication

Cross-media visual communication is an extremely complex task. In order to solve the problem of segmentation of visual foreground and background, improve the accuracy of visual communication scene reconstruction, and complete the task of visual real-time communication. We propose an improved generat...

Descripción completa

Detalles Bibliográficos
Autor principal:	Miao, Yubo
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9078775/ https://www.ncbi.nlm.nih.gov/pubmed/35535179 http://dx.doi.org/10.1155/2022/1556352

_version_	1784702411006279680
author	Miao, Yubo
author_facet	Miao, Yubo
author_sort	Miao, Yubo
collection	PubMed
description	Cross-media visual communication is an extremely complex task. In order to solve the problem of segmentation of visual foreground and background, improve the accuracy of visual communication scene reconstruction, and complete the task of visual real-time communication. We propose an improved generative adversarial network. We take the generative adversarial network as the basis and add a combined codec package to the generator, while configuring the generator and discriminator as a cascade structure, preserving the feature upsampling and downsampling convolutional layers of visual scenes with different layers through correspondence. To classify features with different visual scene layers, we add a new auxiliary classifier based on convolutional neural networks. With the help of the auxiliary classifier, similar visual scenes with different feature layers have a more accurate recognition rate. In the experimental part, to better distinguish foreground and background in visual communication, we perform performance tests on foreground and background using separate datasets. The experimental results show that our method has good accuracy in both foreground and background in cross-media communication for real-time visual communication. In addition, we validate the efficiency of our method on Cityscapes, NoW, and Replica datasets, respectively, and experimentally demonstrate that our method performs better than traditional machine learning methods and outperforms deep learning methods of the same type.
format	Online Article Text
id	pubmed-9078775
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Hindawi
record_format	MEDLINE/PubMed
spelling	pubmed-90787752022-05-08 Improved Deep Neural Network for Cross-Media Visual Communication Miao, Yubo Comput Intell Neurosci Research Article Cross-media visual communication is an extremely complex task. In order to solve the problem of segmentation of visual foreground and background, improve the accuracy of visual communication scene reconstruction, and complete the task of visual real-time communication. We propose an improved generative adversarial network. We take the generative adversarial network as the basis and add a combined codec package to the generator, while configuring the generator and discriminator as a cascade structure, preserving the feature upsampling and downsampling convolutional layers of visual scenes with different layers through correspondence. To classify features with different visual scene layers, we add a new auxiliary classifier based on convolutional neural networks. With the help of the auxiliary classifier, similar visual scenes with different feature layers have a more accurate recognition rate. In the experimental part, to better distinguish foreground and background in visual communication, we perform performance tests on foreground and background using separate datasets. The experimental results show that our method has good accuracy in both foreground and background in cross-media communication for real-time visual communication. In addition, we validate the efficiency of our method on Cityscapes, NoW, and Replica datasets, respectively, and experimentally demonstrate that our method performs better than traditional machine learning methods and outperforms deep learning methods of the same type. Hindawi 2022-04-30 /pmc/articles/PMC9078775/ /pubmed/35535179 http://dx.doi.org/10.1155/2022/1556352 Text en Copyright © 2022 Yubo Miao. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Miao, Yubo Improved Deep Neural Network for Cross-Media Visual Communication
title	Improved Deep Neural Network for Cross-Media Visual Communication
title_full	Improved Deep Neural Network for Cross-Media Visual Communication
title_fullStr	Improved Deep Neural Network for Cross-Media Visual Communication
title_full_unstemmed	Improved Deep Neural Network for Cross-Media Visual Communication
title_short	Improved Deep Neural Network for Cross-Media Visual Communication
title_sort	improved deep neural network for cross-media visual communication
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9078775/ https://www.ncbi.nlm.nih.gov/pubmed/35535179 http://dx.doi.org/10.1155/2022/1556352
work_keys_str_mv	AT miaoyubo improveddeepneuralnetworkforcrossmediavisualcommunication

Improved Deep Neural Network for Cross-Media Visual Communication

Ejemplares similares