Cargando…

Improved support vector machine classification algorithm based on adaptive feature weight updating in the Hadoop cluster environment

An image classification algorithm based on adaptive feature weight updating is proposed to address the low classification accuracy of the current single-feature classification algorithms and simple multifeature fusion algorithms. The MapReduce parallel programming model on the Hadoop platform is use...

Descripción completa

Detalles Bibliográficos
Autores principales: Cao, Jianfang, Wang, Min, Li, Yanfei, Zhang, Qi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6457544/
https://www.ncbi.nlm.nih.gov/pubmed/30970014
http://dx.doi.org/10.1371/journal.pone.0215136
_version_ 1783409916942221312
author Cao, Jianfang
Wang, Min
Li, Yanfei
Zhang, Qi
author_facet Cao, Jianfang
Wang, Min
Li, Yanfei
Zhang, Qi
author_sort Cao, Jianfang
collection PubMed
description An image classification algorithm based on adaptive feature weight updating is proposed to address the low classification accuracy of the current single-feature classification algorithms and simple multifeature fusion algorithms. The MapReduce parallel programming model on the Hadoop platform is used to perform an adaptive fusion of hue, local binary pattern (LBP) and scale-invariant feature transform (SIFT) features extracted from images to derive optimal combinations of weights. The support vector machine (SVM) classifier is then used to perform parallel training to obtain the optimal SVM classification model, which is then tested. The Pascal VOC 2012, Caltech 256 and SUN databases are adopted to build a massive image library. The speedup, classification accuracy and training time are tested in the experiment, and the results show that a linear growth tendency is present in the speedup of the system in a cluster environment. In consideration of the hardware costs, time, performance and accuracy, the algorithm is superior to mainstream classification algorithms, such as the power mean SVM and convolutional neural network (CNN). As the number and types of images both increase, the classification accuracy rate exceeds 95%. When the number of images reaches 80,000, the training time of the proposed algorithm is only 1/5 that of traditional single-node architecture algorithms. This result reflects the effectiveness of the algorithm, which provides a basis for the effective analysis and processing of image big data.
format Online
Article
Text
id pubmed-6457544
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-64575442019-05-03 Improved support vector machine classification algorithm based on adaptive feature weight updating in the Hadoop cluster environment Cao, Jianfang Wang, Min Li, Yanfei Zhang, Qi PLoS One Research Article An image classification algorithm based on adaptive feature weight updating is proposed to address the low classification accuracy of the current single-feature classification algorithms and simple multifeature fusion algorithms. The MapReduce parallel programming model on the Hadoop platform is used to perform an adaptive fusion of hue, local binary pattern (LBP) and scale-invariant feature transform (SIFT) features extracted from images to derive optimal combinations of weights. The support vector machine (SVM) classifier is then used to perform parallel training to obtain the optimal SVM classification model, which is then tested. The Pascal VOC 2012, Caltech 256 and SUN databases are adopted to build a massive image library. The speedup, classification accuracy and training time are tested in the experiment, and the results show that a linear growth tendency is present in the speedup of the system in a cluster environment. In consideration of the hardware costs, time, performance and accuracy, the algorithm is superior to mainstream classification algorithms, such as the power mean SVM and convolutional neural network (CNN). As the number and types of images both increase, the classification accuracy rate exceeds 95%. When the number of images reaches 80,000, the training time of the proposed algorithm is only 1/5 that of traditional single-node architecture algorithms. This result reflects the effectiveness of the algorithm, which provides a basis for the effective analysis and processing of image big data. Public Library of Science 2019-04-10 /pmc/articles/PMC6457544/ /pubmed/30970014 http://dx.doi.org/10.1371/journal.pone.0215136 Text en © 2019 Cao et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Cao, Jianfang
Wang, Min
Li, Yanfei
Zhang, Qi
Improved support vector machine classification algorithm based on adaptive feature weight updating in the Hadoop cluster environment
title Improved support vector machine classification algorithm based on adaptive feature weight updating in the Hadoop cluster environment
title_full Improved support vector machine classification algorithm based on adaptive feature weight updating in the Hadoop cluster environment
title_fullStr Improved support vector machine classification algorithm based on adaptive feature weight updating in the Hadoop cluster environment
title_full_unstemmed Improved support vector machine classification algorithm based on adaptive feature weight updating in the Hadoop cluster environment
title_short Improved support vector machine classification algorithm based on adaptive feature weight updating in the Hadoop cluster environment
title_sort improved support vector machine classification algorithm based on adaptive feature weight updating in the hadoop cluster environment
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6457544/
https://www.ncbi.nlm.nih.gov/pubmed/30970014
http://dx.doi.org/10.1371/journal.pone.0215136
work_keys_str_mv AT caojianfang improvedsupportvectormachineclassificationalgorithmbasedonadaptivefeatureweightupdatinginthehadoopclusterenvironment
AT wangmin improvedsupportvectormachineclassificationalgorithmbasedonadaptivefeatureweightupdatinginthehadoopclusterenvironment
AT liyanfei improvedsupportvectormachineclassificationalgorithmbasedonadaptivefeatureweightupdatinginthehadoopclusterenvironment
AT zhangqi improvedsupportvectormachineclassificationalgorithmbasedonadaptivefeatureweightupdatinginthehadoopclusterenvironment