Cargando…

Big Data: A Parallel Particle Swarm Optimization-Back-Propagation Neural Network Algorithm Based on MapReduce

A back-propagation (BP) neural network can solve complicated random nonlinear mapping problems; therefore, it can be applied to a wide range of problems. However, as the sample size increases, the time required to train BP neural networks becomes lengthy. Moreover, the classification accuracy decrea...

Descripción completa

Detalles Bibliográficos
Autores principales: Cao, Jianfang, Cui, Hongyan, Shi, Hao, Jiao, Lijuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4909218/
https://www.ncbi.nlm.nih.gov/pubmed/27304987
http://dx.doi.org/10.1371/journal.pone.0157551
_version_ 1782437801156935680
author Cao, Jianfang
Cui, Hongyan
Shi, Hao
Jiao, Lijuan
author_facet Cao, Jianfang
Cui, Hongyan
Shi, Hao
Jiao, Lijuan
author_sort Cao, Jianfang
collection PubMed
description A back-propagation (BP) neural network can solve complicated random nonlinear mapping problems; therefore, it can be applied to a wide range of problems. However, as the sample size increases, the time required to train BP neural networks becomes lengthy. Moreover, the classification accuracy decreases as well. To improve the classification accuracy and runtime efficiency of the BP neural network algorithm, we proposed a parallel design and realization method for a particle swarm optimization (PSO)-optimized BP neural network based on MapReduce on the Hadoop platform using both the PSO algorithm and a parallel design. The PSO algorithm was used to optimize the BP neural network’s initial weights and thresholds and improve the accuracy of the classification algorithm. The MapReduce parallel programming model was utilized to achieve parallel processing of the BP algorithm, thereby solving the problems of hardware and communication overhead when the BP neural network addresses big data. Datasets on 5 different scales were constructed using the scene image library from the SUN Database. The classification accuracy of the parallel PSO-BP neural network algorithm is approximately 92%, and the system efficiency is approximately 0.85, which presents obvious advantages when processing big data. The algorithm proposed in this study demonstrated both higher classification accuracy and improved time efficiency, which represents a significant improvement obtained from applying parallel processing to an intelligent algorithm on big data.
format Online
Article
Text
id pubmed-4909218
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-49092182016-07-06 Big Data: A Parallel Particle Swarm Optimization-Back-Propagation Neural Network Algorithm Based on MapReduce Cao, Jianfang Cui, Hongyan Shi, Hao Jiao, Lijuan PLoS One Research Article A back-propagation (BP) neural network can solve complicated random nonlinear mapping problems; therefore, it can be applied to a wide range of problems. However, as the sample size increases, the time required to train BP neural networks becomes lengthy. Moreover, the classification accuracy decreases as well. To improve the classification accuracy and runtime efficiency of the BP neural network algorithm, we proposed a parallel design and realization method for a particle swarm optimization (PSO)-optimized BP neural network based on MapReduce on the Hadoop platform using both the PSO algorithm and a parallel design. The PSO algorithm was used to optimize the BP neural network’s initial weights and thresholds and improve the accuracy of the classification algorithm. The MapReduce parallel programming model was utilized to achieve parallel processing of the BP algorithm, thereby solving the problems of hardware and communication overhead when the BP neural network addresses big data. Datasets on 5 different scales were constructed using the scene image library from the SUN Database. The classification accuracy of the parallel PSO-BP neural network algorithm is approximately 92%, and the system efficiency is approximately 0.85, which presents obvious advantages when processing big data. The algorithm proposed in this study demonstrated both higher classification accuracy and improved time efficiency, which represents a significant improvement obtained from applying parallel processing to an intelligent algorithm on big data. Public Library of Science 2016-06-15 /pmc/articles/PMC4909218/ /pubmed/27304987 http://dx.doi.org/10.1371/journal.pone.0157551 Text en © 2016 Cao et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Cao, Jianfang
Cui, Hongyan
Shi, Hao
Jiao, Lijuan
Big Data: A Parallel Particle Swarm Optimization-Back-Propagation Neural Network Algorithm Based on MapReduce
title Big Data: A Parallel Particle Swarm Optimization-Back-Propagation Neural Network Algorithm Based on MapReduce
title_full Big Data: A Parallel Particle Swarm Optimization-Back-Propagation Neural Network Algorithm Based on MapReduce
title_fullStr Big Data: A Parallel Particle Swarm Optimization-Back-Propagation Neural Network Algorithm Based on MapReduce
title_full_unstemmed Big Data: A Parallel Particle Swarm Optimization-Back-Propagation Neural Network Algorithm Based on MapReduce
title_short Big Data: A Parallel Particle Swarm Optimization-Back-Propagation Neural Network Algorithm Based on MapReduce
title_sort big data: a parallel particle swarm optimization-back-propagation neural network algorithm based on mapreduce
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4909218/
https://www.ncbi.nlm.nih.gov/pubmed/27304987
http://dx.doi.org/10.1371/journal.pone.0157551
work_keys_str_mv AT caojianfang bigdataaparallelparticleswarmoptimizationbackpropagationneuralnetworkalgorithmbasedonmapreduce
AT cuihongyan bigdataaparallelparticleswarmoptimizationbackpropagationneuralnetworkalgorithmbasedonmapreduce
AT shihao bigdataaparallelparticleswarmoptimizationbackpropagationneuralnetworkalgorithmbasedonmapreduce
AT jiaolijuan bigdataaparallelparticleswarmoptimizationbackpropagationneuralnetworkalgorithmbasedonmapreduce