Cargando…
Big Data: A Parallel Particle Swarm Optimization-Back-Propagation Neural Network Algorithm Based on MapReduce
A back-propagation (BP) neural network can solve complicated random nonlinear mapping problems; therefore, it can be applied to a wide range of problems. However, as the sample size increases, the time required to train BP neural networks becomes lengthy. Moreover, the classification accuracy decrea...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4909218/ https://www.ncbi.nlm.nih.gov/pubmed/27304987 http://dx.doi.org/10.1371/journal.pone.0157551 |
_version_ | 1782437801156935680 |
---|---|
author | Cao, Jianfang Cui, Hongyan Shi, Hao Jiao, Lijuan |
author_facet | Cao, Jianfang Cui, Hongyan Shi, Hao Jiao, Lijuan |
author_sort | Cao, Jianfang |
collection | PubMed |
description | A back-propagation (BP) neural network can solve complicated random nonlinear mapping problems; therefore, it can be applied to a wide range of problems. However, as the sample size increases, the time required to train BP neural networks becomes lengthy. Moreover, the classification accuracy decreases as well. To improve the classification accuracy and runtime efficiency of the BP neural network algorithm, we proposed a parallel design and realization method for a particle swarm optimization (PSO)-optimized BP neural network based on MapReduce on the Hadoop platform using both the PSO algorithm and a parallel design. The PSO algorithm was used to optimize the BP neural network’s initial weights and thresholds and improve the accuracy of the classification algorithm. The MapReduce parallel programming model was utilized to achieve parallel processing of the BP algorithm, thereby solving the problems of hardware and communication overhead when the BP neural network addresses big data. Datasets on 5 different scales were constructed using the scene image library from the SUN Database. The classification accuracy of the parallel PSO-BP neural network algorithm is approximately 92%, and the system efficiency is approximately 0.85, which presents obvious advantages when processing big data. The algorithm proposed in this study demonstrated both higher classification accuracy and improved time efficiency, which represents a significant improvement obtained from applying parallel processing to an intelligent algorithm on big data. |
format | Online Article Text |
id | pubmed-4909218 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-49092182016-07-06 Big Data: A Parallel Particle Swarm Optimization-Back-Propagation Neural Network Algorithm Based on MapReduce Cao, Jianfang Cui, Hongyan Shi, Hao Jiao, Lijuan PLoS One Research Article A back-propagation (BP) neural network can solve complicated random nonlinear mapping problems; therefore, it can be applied to a wide range of problems. However, as the sample size increases, the time required to train BP neural networks becomes lengthy. Moreover, the classification accuracy decreases as well. To improve the classification accuracy and runtime efficiency of the BP neural network algorithm, we proposed a parallel design and realization method for a particle swarm optimization (PSO)-optimized BP neural network based on MapReduce on the Hadoop platform using both the PSO algorithm and a parallel design. The PSO algorithm was used to optimize the BP neural network’s initial weights and thresholds and improve the accuracy of the classification algorithm. The MapReduce parallel programming model was utilized to achieve parallel processing of the BP algorithm, thereby solving the problems of hardware and communication overhead when the BP neural network addresses big data. Datasets on 5 different scales were constructed using the scene image library from the SUN Database. The classification accuracy of the parallel PSO-BP neural network algorithm is approximately 92%, and the system efficiency is approximately 0.85, which presents obvious advantages when processing big data. The algorithm proposed in this study demonstrated both higher classification accuracy and improved time efficiency, which represents a significant improvement obtained from applying parallel processing to an intelligent algorithm on big data. Public Library of Science 2016-06-15 /pmc/articles/PMC4909218/ /pubmed/27304987 http://dx.doi.org/10.1371/journal.pone.0157551 Text en © 2016 Cao et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Cao, Jianfang Cui, Hongyan Shi, Hao Jiao, Lijuan Big Data: A Parallel Particle Swarm Optimization-Back-Propagation Neural Network Algorithm Based on MapReduce |
title | Big Data: A Parallel Particle Swarm Optimization-Back-Propagation Neural Network Algorithm Based on MapReduce |
title_full | Big Data: A Parallel Particle Swarm Optimization-Back-Propagation Neural Network Algorithm Based on MapReduce |
title_fullStr | Big Data: A Parallel Particle Swarm Optimization-Back-Propagation Neural Network Algorithm Based on MapReduce |
title_full_unstemmed | Big Data: A Parallel Particle Swarm Optimization-Back-Propagation Neural Network Algorithm Based on MapReduce |
title_short | Big Data: A Parallel Particle Swarm Optimization-Back-Propagation Neural Network Algorithm Based on MapReduce |
title_sort | big data: a parallel particle swarm optimization-back-propagation neural network algorithm based on mapreduce |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4909218/ https://www.ncbi.nlm.nih.gov/pubmed/27304987 http://dx.doi.org/10.1371/journal.pone.0157551 |
work_keys_str_mv | AT caojianfang bigdataaparallelparticleswarmoptimizationbackpropagationneuralnetworkalgorithmbasedonmapreduce AT cuihongyan bigdataaparallelparticleswarmoptimizationbackpropagationneuralnetworkalgorithmbasedonmapreduce AT shihao bigdataaparallelparticleswarmoptimizationbackpropagationneuralnetworkalgorithmbasedonmapreduce AT jiaolijuan bigdataaparallelparticleswarmoptimizationbackpropagationneuralnetworkalgorithmbasedonmapreduce |