Cargando…

EnsemV3X: a novel ensembled deep learning architecture for multi-label scene classification

Convolutional neural network is widely used to perform the task of image classification, including pretraining, followed by fine-tuning whereby features are adapted to perform the target task, on ImageNet. ImageNet is a large database consisting of 15 million images belonging to 22,000 categories. I...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sobti, Priyal, Nayyar, Anand, Niharika, Nagrath, Preeti
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	PeerJ Inc. 2021
Materias:	Artificial Intelligence
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8176534/ https://www.ncbi.nlm.nih.gov/pubmed/34141887 http://dx.doi.org/10.7717/peerj-cs.557

_version_	1783703274679959552
author	Sobti, Priyal Nayyar, Anand Niharika, Nagrath, Preeti
author_facet	Sobti, Priyal Nayyar, Anand Niharika, Nagrath, Preeti
author_sort	Sobti, Priyal
collection	PubMed
description	Convolutional neural network is widely used to perform the task of image classification, including pretraining, followed by fine-tuning whereby features are adapted to perform the target task, on ImageNet. ImageNet is a large database consisting of 15 million images belonging to 22,000 categories. Images collected from the Web are labeled using Amazon Mechanical Turk crowd-sourcing tool by human labelers. ImageNet is useful for transfer learning because of the sheer volume of its dataset and the number of object classes available. Transfer learning using pretrained models is useful because it helps to build computer vision models in an accurate and inexpensive manner. Models that have been pretrained on substantial datasets are used and repurposed for our requirements. Scene recognition is a widely used application of computer vision in many communities and industries, such as tourism. This study aims to show multilabel scene classification using five architectures, namely, VGG16, VGG19, ResNet50, InceptionV3, and Xception using ImageNet weights available in the Keras library. The performance of different architectures is comprehensively compared in the study. Finally, EnsemV3X is presented in this study. The proposed model with reduced number of parameters is superior to state-of-of-the-art models Inception and Xception because it demonstrates an accuracy of 91%.
format	Online Article Text
id	pubmed-8176534
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	PeerJ Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-81765342021-06-16 EnsemV3X: a novel ensembled deep learning architecture for multi-label scene classification Sobti, Priyal Nayyar, Anand Niharika, Nagrath, Preeti PeerJ Comput Sci Artificial Intelligence Convolutional neural network is widely used to perform the task of image classification, including pretraining, followed by fine-tuning whereby features are adapted to perform the target task, on ImageNet. ImageNet is a large database consisting of 15 million images belonging to 22,000 categories. Images collected from the Web are labeled using Amazon Mechanical Turk crowd-sourcing tool by human labelers. ImageNet is useful for transfer learning because of the sheer volume of its dataset and the number of object classes available. Transfer learning using pretrained models is useful because it helps to build computer vision models in an accurate and inexpensive manner. Models that have been pretrained on substantial datasets are used and repurposed for our requirements. Scene recognition is a widely used application of computer vision in many communities and industries, such as tourism. This study aims to show multilabel scene classification using five architectures, namely, VGG16, VGG19, ResNet50, InceptionV3, and Xception using ImageNet weights available in the Keras library. The performance of different architectures is comprehensively compared in the study. Finally, EnsemV3X is presented in this study. The proposed model with reduced number of parameters is superior to state-of-of-the-art models Inception and Xception because it demonstrates an accuracy of 91%. PeerJ Inc. 2021-05-25 /pmc/articles/PMC8176534/ /pubmed/34141887 http://dx.doi.org/10.7717/peerj-cs.557 Text en © 2021 Sobti et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle	Artificial Intelligence Sobti, Priyal Nayyar, Anand Niharika, Nagrath, Preeti EnsemV3X: a novel ensembled deep learning architecture for multi-label scene classification
title	EnsemV3X: a novel ensembled deep learning architecture for multi-label scene classification
title_full	EnsemV3X: a novel ensembled deep learning architecture for multi-label scene classification
title_fullStr	EnsemV3X: a novel ensembled deep learning architecture for multi-label scene classification
title_full_unstemmed	EnsemV3X: a novel ensembled deep learning architecture for multi-label scene classification
title_short	EnsemV3X: a novel ensembled deep learning architecture for multi-label scene classification
title_sort	ensemv3x: a novel ensembled deep learning architecture for multi-label scene classification
topic	Artificial Intelligence
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8176534/ https://www.ncbi.nlm.nih.gov/pubmed/34141887 http://dx.doi.org/10.7717/peerj-cs.557
work_keys_str_mv	AT sobtipriyal ensemv3xanovelensembleddeeplearningarchitectureformultilabelsceneclassification AT nayyaranand ensemv3xanovelensembleddeeplearningarchitectureformultilabelsceneclassification AT niharika ensemv3xanovelensembleddeeplearningarchitectureformultilabelsceneclassification AT nagrathpreeti ensemv3xanovelensembleddeeplearningarchitectureformultilabelsceneclassification

EnsemV3X: a novel ensembled deep learning architecture for multi-label scene classification

Ejemplares similares