Cargando…

E2VIDX: improved bridge between conventional vision and bionic vision

Common RGBD, CMOS, and CCD-based cameras produce motion blur and incorrect exposure under high-speed and improper lighting conditions. According to the bionic principle, the event camera developed has the advantages of low delay, high dynamic range, and no motion blur. However, due to its unique dat...

Descripción completa

Detalles Bibliográficos
Autores principales:	Hou, Xujia, Zhang, Feihu, Gulati, Dhiraj, Tan, Tingfeng, Zhang, Wei
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2023
Materias:	Neuroscience
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10639115/ https://www.ncbi.nlm.nih.gov/pubmed/37954492 http://dx.doi.org/10.3389/fnbot.2023.1277160

_version_	1785133728260947968
author	Hou, Xujia Zhang, Feihu Gulati, Dhiraj Tan, Tingfeng Zhang, Wei
author_facet	Hou, Xujia Zhang, Feihu Gulati, Dhiraj Tan, Tingfeng Zhang, Wei
author_sort	Hou, Xujia
collection	PubMed
description	Common RGBD, CMOS, and CCD-based cameras produce motion blur and incorrect exposure under high-speed and improper lighting conditions. According to the bionic principle, the event camera developed has the advantages of low delay, high dynamic range, and no motion blur. However, due to its unique data representation, it encounters significant obstacles in practical applications. The image reconstruction algorithm based on an event camera solves the problem by converting a series of “events” into common frames to apply existing vision algorithms. Due to the rapid development of neural networks, this field has made significant breakthroughs in past few years. Based on the most popular Events-to-Video (E2VID) method, this study designs a new network called E2VIDX. The proposed network includes group convolution and sub-pixel convolution, which not only achieves better feature fusion but also the network model size is reduced by 25%. Futhermore, we propose a new loss function. The loss function is divided into two parts, first part calculates the high level features and the second part calculates the low level features of the reconstructed image. The experimental results clearly outperform against the state-of-the-art method. Compared with the original method, Structural Similarity (SSIM) increases by 1.3%, Learned Perceptual Image Patch Similarity (LPIPS) decreases by 1.7%, Mean Squared Error (MSE) decreases by 2.5%, and it runs faster on GPU and CPU. Additionally, we evaluate the results of E2VIDX with application to image classification, object detection, and instance segmentation. The experiments show that conversions using our method can help event cameras directly apply existing vision algorithms in most scenarios.
format	Online Article Text
id	pubmed-10639115
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-106391152023-11-11 E2VIDX: improved bridge between conventional vision and bionic vision Hou, Xujia Zhang, Feihu Gulati, Dhiraj Tan, Tingfeng Zhang, Wei Front Neurorobot Neuroscience Common RGBD, CMOS, and CCD-based cameras produce motion blur and incorrect exposure under high-speed and improper lighting conditions. According to the bionic principle, the event camera developed has the advantages of low delay, high dynamic range, and no motion blur. However, due to its unique data representation, it encounters significant obstacles in practical applications. The image reconstruction algorithm based on an event camera solves the problem by converting a series of “events” into common frames to apply existing vision algorithms. Due to the rapid development of neural networks, this field has made significant breakthroughs in past few years. Based on the most popular Events-to-Video (E2VID) method, this study designs a new network called E2VIDX. The proposed network includes group convolution and sub-pixel convolution, which not only achieves better feature fusion but also the network model size is reduced by 25%. Futhermore, we propose a new loss function. The loss function is divided into two parts, first part calculates the high level features and the second part calculates the low level features of the reconstructed image. The experimental results clearly outperform against the state-of-the-art method. Compared with the original method, Structural Similarity (SSIM) increases by 1.3%, Learned Perceptual Image Patch Similarity (LPIPS) decreases by 1.7%, Mean Squared Error (MSE) decreases by 2.5%, and it runs faster on GPU and CPU. Additionally, we evaluate the results of E2VIDX with application to image classification, object detection, and instance segmentation. The experiments show that conversions using our method can help event cameras directly apply existing vision algorithms in most scenarios. Frontiers Media S.A. 2023-10-26 /pmc/articles/PMC10639115/ /pubmed/37954492 http://dx.doi.org/10.3389/fnbot.2023.1277160 Text en Copyright © 2023 Hou, Zhang, Gulati, Tan and Zhang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Neuroscience Hou, Xujia Zhang, Feihu Gulati, Dhiraj Tan, Tingfeng Zhang, Wei E2VIDX: improved bridge between conventional vision and bionic vision
title	E2VIDX: improved bridge between conventional vision and bionic vision
title_full	E2VIDX: improved bridge between conventional vision and bionic vision
title_fullStr	E2VIDX: improved bridge between conventional vision and bionic vision
title_full_unstemmed	E2VIDX: improved bridge between conventional vision and bionic vision
title_short	E2VIDX: improved bridge between conventional vision and bionic vision
title_sort	e2vidx: improved bridge between conventional vision and bionic vision
topic	Neuroscience
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10639115/ https://www.ncbi.nlm.nih.gov/pubmed/37954492 http://dx.doi.org/10.3389/fnbot.2023.1277160
work_keys_str_mv	AT houxujia e2vidximprovedbridgebetweenconventionalvisionandbionicvision AT zhangfeihu e2vidximprovedbridgebetweenconventionalvisionandbionicvision AT gulatidhiraj e2vidximprovedbridgebetweenconventionalvisionandbionicvision AT tantingfeng e2vidximprovedbridgebetweenconventionalvisionandbionicvision AT zhangwei e2vidximprovedbridgebetweenconventionalvisionandbionicvision

E2VIDX: improved bridge between conventional vision and bionic vision

Ejemplares similares