Cargando…

DBGC: Dimension-Based Generic Convolution Block for Object Recognition

The object recognition concept is being widely used a result of increasing CCTV surveillance and the need for automatic object or activity detection from images or video. Increases in the use of various sensor networks have also raised the need of lightweight process frameworks. Much research has be...

Descripción completa

Detalles Bibliográficos
Autores principales: Patel, Chirag, Bhatt, Dulari, Sharma, Urvashi, Patel, Radhika, Pandya, Sharnil, Modi, Kirit, Cholli, Nagaraj, Patel, Akash, Bhatt, Urvi, Khan, Muhammad Ahmed, Majumdar, Shubhankar, Zuhair, Mohd, Patel, Khushi, Shah, Syed Aziz, Ghayvat, Hemant
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8914730/
https://www.ncbi.nlm.nih.gov/pubmed/35270929
http://dx.doi.org/10.3390/s22051780
_version_ 1784667797824995328
author Patel, Chirag
Bhatt, Dulari
Sharma, Urvashi
Patel, Radhika
Pandya, Sharnil
Modi, Kirit
Cholli, Nagaraj
Patel, Akash
Bhatt, Urvi
Khan, Muhammad Ahmed
Majumdar, Shubhankar
Zuhair, Mohd
Patel, Khushi
Shah, Syed Aziz
Ghayvat, Hemant
author_facet Patel, Chirag
Bhatt, Dulari
Sharma, Urvashi
Patel, Radhika
Pandya, Sharnil
Modi, Kirit
Cholli, Nagaraj
Patel, Akash
Bhatt, Urvi
Khan, Muhammad Ahmed
Majumdar, Shubhankar
Zuhair, Mohd
Patel, Khushi
Shah, Syed Aziz
Ghayvat, Hemant
author_sort Patel, Chirag
collection PubMed
description The object recognition concept is being widely used a result of increasing CCTV surveillance and the need for automatic object or activity detection from images or video. Increases in the use of various sensor networks have also raised the need of lightweight process frameworks. Much research has been carried out in this area, but the research scope is colossal as it deals with open-ended problems such as being able to achieve high accuracy in little time using lightweight process frameworks. Convolution Neural Networks and their variants are widely used in various computer vision activities, but most of the architectures of CNN are application-specific. There is always a need for generic architectures with better performance. This paper introduces the Dimension-Based Generic Convolution Block (DBGC), which can be used with any CNN to make the architecture generic and provide a dimension-wise selection of various height, width, and depth kernels. This single unit which uses the separable convolution concept provides multiple combinations using various dimension-based kernels. This single unit can be used for height-based, width-based, or depth-based dimensions; the same unit can even be used for height and width, width and depth, and depth and height dimensions. It can also be used for combinations involving all three dimensions of height, width, and depth. The main novelty of DBGC lies in the dimension selector block included in the proposed architecture. Proposed unoptimized kernel dimensions reduce FLOPs by around one third and also reduce the accuracy by around one half; semi-optimized kernel dimensions yield almost the same or higher accuracy with half the FLOPs of the original architecture, while optimized kernel dimensions provide 5 to 6% higher accuracy with around a 10 M reduction in FLOPs.
format Online
Article
Text
id pubmed-8914730
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-89147302022-03-12 DBGC: Dimension-Based Generic Convolution Block for Object Recognition Patel, Chirag Bhatt, Dulari Sharma, Urvashi Patel, Radhika Pandya, Sharnil Modi, Kirit Cholli, Nagaraj Patel, Akash Bhatt, Urvi Khan, Muhammad Ahmed Majumdar, Shubhankar Zuhair, Mohd Patel, Khushi Shah, Syed Aziz Ghayvat, Hemant Sensors (Basel) Article The object recognition concept is being widely used a result of increasing CCTV surveillance and the need for automatic object or activity detection from images or video. Increases in the use of various sensor networks have also raised the need of lightweight process frameworks. Much research has been carried out in this area, but the research scope is colossal as it deals with open-ended problems such as being able to achieve high accuracy in little time using lightweight process frameworks. Convolution Neural Networks and their variants are widely used in various computer vision activities, but most of the architectures of CNN are application-specific. There is always a need for generic architectures with better performance. This paper introduces the Dimension-Based Generic Convolution Block (DBGC), which can be used with any CNN to make the architecture generic and provide a dimension-wise selection of various height, width, and depth kernels. This single unit which uses the separable convolution concept provides multiple combinations using various dimension-based kernels. This single unit can be used for height-based, width-based, or depth-based dimensions; the same unit can even be used for height and width, width and depth, and depth and height dimensions. It can also be used for combinations involving all three dimensions of height, width, and depth. The main novelty of DBGC lies in the dimension selector block included in the proposed architecture. Proposed unoptimized kernel dimensions reduce FLOPs by around one third and also reduce the accuracy by around one half; semi-optimized kernel dimensions yield almost the same or higher accuracy with half the FLOPs of the original architecture, while optimized kernel dimensions provide 5 to 6% higher accuracy with around a 10 M reduction in FLOPs. MDPI 2022-02-24 /pmc/articles/PMC8914730/ /pubmed/35270929 http://dx.doi.org/10.3390/s22051780 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Patel, Chirag
Bhatt, Dulari
Sharma, Urvashi
Patel, Radhika
Pandya, Sharnil
Modi, Kirit
Cholli, Nagaraj
Patel, Akash
Bhatt, Urvi
Khan, Muhammad Ahmed
Majumdar, Shubhankar
Zuhair, Mohd
Patel, Khushi
Shah, Syed Aziz
Ghayvat, Hemant
DBGC: Dimension-Based Generic Convolution Block for Object Recognition
title DBGC: Dimension-Based Generic Convolution Block for Object Recognition
title_full DBGC: Dimension-Based Generic Convolution Block for Object Recognition
title_fullStr DBGC: Dimension-Based Generic Convolution Block for Object Recognition
title_full_unstemmed DBGC: Dimension-Based Generic Convolution Block for Object Recognition
title_short DBGC: Dimension-Based Generic Convolution Block for Object Recognition
title_sort dbgc: dimension-based generic convolution block for object recognition
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8914730/
https://www.ncbi.nlm.nih.gov/pubmed/35270929
http://dx.doi.org/10.3390/s22051780
work_keys_str_mv AT patelchirag dbgcdimensionbasedgenericconvolutionblockforobjectrecognition
AT bhattdulari dbgcdimensionbasedgenericconvolutionblockforobjectrecognition
AT sharmaurvashi dbgcdimensionbasedgenericconvolutionblockforobjectrecognition
AT patelradhika dbgcdimensionbasedgenericconvolutionblockforobjectrecognition
AT pandyasharnil dbgcdimensionbasedgenericconvolutionblockforobjectrecognition
AT modikirit dbgcdimensionbasedgenericconvolutionblockforobjectrecognition
AT chollinagaraj dbgcdimensionbasedgenericconvolutionblockforobjectrecognition
AT patelakash dbgcdimensionbasedgenericconvolutionblockforobjectrecognition
AT bhatturvi dbgcdimensionbasedgenericconvolutionblockforobjectrecognition
AT khanmuhammadahmed dbgcdimensionbasedgenericconvolutionblockforobjectrecognition
AT majumdarshubhankar dbgcdimensionbasedgenericconvolutionblockforobjectrecognition
AT zuhairmohd dbgcdimensionbasedgenericconvolutionblockforobjectrecognition
AT patelkhushi dbgcdimensionbasedgenericconvolutionblockforobjectrecognition
AT shahsyedaziz dbgcdimensionbasedgenericconvolutionblockforobjectrecognition
AT ghayvathemant dbgcdimensionbasedgenericconvolutionblockforobjectrecognition