Cargando…

Hybrid Bag-of-Visual-Words and FeatureWiz Selection for Content-Based Visual Information Retrieval

Recently, content-based image retrieval (CBIR) based on bag-of-visual-words (BoVW) model has been one of the most promising and increasingly active research areas. In this paper, we propose a new CBIR framework based on the visual words fusion of multiple feature descriptors to achieve an improved r...

Descripción completa

Detalles Bibliográficos
Autores principales: Bakheet, Samy, Al-Hamadi, Ayoub, Soliman, Emadeldeen, Heshmat, Mohamed
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9919877/
https://www.ncbi.nlm.nih.gov/pubmed/36772705
http://dx.doi.org/10.3390/s23031653
_version_ 1784886932614938624
author Bakheet, Samy
Al-Hamadi, Ayoub
Soliman, Emadeldeen
Heshmat, Mohamed
author_facet Bakheet, Samy
Al-Hamadi, Ayoub
Soliman, Emadeldeen
Heshmat, Mohamed
author_sort Bakheet, Samy
collection PubMed
description Recently, content-based image retrieval (CBIR) based on bag-of-visual-words (BoVW) model has been one of the most promising and increasingly active research areas. In this paper, we propose a new CBIR framework based on the visual words fusion of multiple feature descriptors to achieve an improved retrieval performance, where interest points are separately extracted from an image using features from accelerated segment test (FAST) and speeded-up robust features (SURF). The extracted keypoints are then fused together in a single keypoint feature vector and the improved RootSIFT algorithm is applied to describe the region surrounding each keypoint. Afterward, the FeatureWiz algorithm is employed to reduce features and select the best features for the BoVW learning model. To create the codebook, K-means clustering is applied to quantize visual features into a smaller set of visual words. Finally, the feature vectors extracted from the BoVW model are fed into a support vector machines (SVMs) classifier for image retrieval. An inverted index technique based on cosine distance metric is applied to sort the retrieved images to the similarity of the query image. Experiments on three benchmark datasets (Corel-1000, Caltech-10 and Oxford Flower-17) show that the presented CBIR technique can deliver comparable results to other state-of-the-art techniques, by achieving average accuracies of 92.94%, 98.40% and 84.94% on these datasets, respectively.
format Online
Article
Text
id pubmed-9919877
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-99198772023-02-12 Hybrid Bag-of-Visual-Words and FeatureWiz Selection for Content-Based Visual Information Retrieval Bakheet, Samy Al-Hamadi, Ayoub Soliman, Emadeldeen Heshmat, Mohamed Sensors (Basel) Article Recently, content-based image retrieval (CBIR) based on bag-of-visual-words (BoVW) model has been one of the most promising and increasingly active research areas. In this paper, we propose a new CBIR framework based on the visual words fusion of multiple feature descriptors to achieve an improved retrieval performance, where interest points are separately extracted from an image using features from accelerated segment test (FAST) and speeded-up robust features (SURF). The extracted keypoints are then fused together in a single keypoint feature vector and the improved RootSIFT algorithm is applied to describe the region surrounding each keypoint. Afterward, the FeatureWiz algorithm is employed to reduce features and select the best features for the BoVW learning model. To create the codebook, K-means clustering is applied to quantize visual features into a smaller set of visual words. Finally, the feature vectors extracted from the BoVW model are fed into a support vector machines (SVMs) classifier for image retrieval. An inverted index technique based on cosine distance metric is applied to sort the retrieved images to the similarity of the query image. Experiments on three benchmark datasets (Corel-1000, Caltech-10 and Oxford Flower-17) show that the presented CBIR technique can deliver comparable results to other state-of-the-art techniques, by achieving average accuracies of 92.94%, 98.40% and 84.94% on these datasets, respectively. MDPI 2023-02-02 /pmc/articles/PMC9919877/ /pubmed/36772705 http://dx.doi.org/10.3390/s23031653 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Bakheet, Samy
Al-Hamadi, Ayoub
Soliman, Emadeldeen
Heshmat, Mohamed
Hybrid Bag-of-Visual-Words and FeatureWiz Selection for Content-Based Visual Information Retrieval
title Hybrid Bag-of-Visual-Words and FeatureWiz Selection for Content-Based Visual Information Retrieval
title_full Hybrid Bag-of-Visual-Words and FeatureWiz Selection for Content-Based Visual Information Retrieval
title_fullStr Hybrid Bag-of-Visual-Words and FeatureWiz Selection for Content-Based Visual Information Retrieval
title_full_unstemmed Hybrid Bag-of-Visual-Words and FeatureWiz Selection for Content-Based Visual Information Retrieval
title_short Hybrid Bag-of-Visual-Words and FeatureWiz Selection for Content-Based Visual Information Retrieval
title_sort hybrid bag-of-visual-words and featurewiz selection for content-based visual information retrieval
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9919877/
https://www.ncbi.nlm.nih.gov/pubmed/36772705
http://dx.doi.org/10.3390/s23031653
work_keys_str_mv AT bakheetsamy hybridbagofvisualwordsandfeaturewizselectionforcontentbasedvisualinformationretrieval
AT alhamadiayoub hybridbagofvisualwordsandfeaturewizselectionforcontentbasedvisualinformationretrieval
AT solimanemadeldeen hybridbagofvisualwordsandfeaturewizselectionforcontentbasedvisualinformationretrieval
AT heshmatmohamed hybridbagofvisualwordsandfeaturewizselectionforcontentbasedvisualinformationretrieval