Cargando…

Load Balancing Algorithms for Hadoop Cluster in Unbalanced Environment

Considering that in the process of job scheduling, the cluster load should be prebalanced rather than remedied when the load is seriously unbalanced; therefore, in this paper, the task scheduling flow of the Hadoop cluster is analyzed deeply. On the Hadoop platform, a self-dividing algorithm is prop...

Descripción completa

Detalles Bibliográficos
Autores principales: Fu, Weiyu, Wang, Lixia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9568311/
https://www.ncbi.nlm.nih.gov/pubmed/36248928
http://dx.doi.org/10.1155/2022/1545024
_version_ 1784809617525571584
author Fu, Weiyu
Wang, Lixia
author_facet Fu, Weiyu
Wang, Lixia
author_sort Fu, Weiyu
collection PubMed
description Considering that in the process of job scheduling, the cluster load should be prebalanced rather than remedied when the load is seriously unbalanced; therefore, in this paper, the task scheduling flow of the Hadoop cluster is analyzed deeply. On the Hadoop platform, a self-dividing algorithm is proposed for load balancing. An intelligent optimization algorithm is used to solve load balance. A dynamic feedback load balancing scheduling method is proposed from the point of view of task scheduling. In order to solve the shortcoming of the fair scheduling algorithm, this paper proposes two ways to improve the resource utilization and overall performance of Hadoop. When the mapping task is completed and the tasks to be reduced are assigned, the task assignment is based on the performance of the nodes to be reduced. It gives full play to the advantages of the ant colony algorithm and the hive colony algorithm so that the fusion algorithm can better deal with load balance. Then, three existing scheduling algorithms are introduced in detail: single queue scheduling, capacity scheduling, and fair scheduling. On this basis, an improved task scheduling strategy based on genetic algorithm is proposed to allocate and execute application tasks to reduce task completion time. The experiment verifies the effectiveness of the algorithm. The LBNP algorithm greatly improves the efficiency of reducing task execution and job execution. The delay capacity scheduling algorithm can ensure that most tasks can achieve localization scheduling, improve resource utilization, improve load balance, and speed up job completion time.
format Online
Article
Text
id pubmed-9568311
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-95683112022-10-15 Load Balancing Algorithms for Hadoop Cluster in Unbalanced Environment Fu, Weiyu Wang, Lixia Comput Intell Neurosci Research Article Considering that in the process of job scheduling, the cluster load should be prebalanced rather than remedied when the load is seriously unbalanced; therefore, in this paper, the task scheduling flow of the Hadoop cluster is analyzed deeply. On the Hadoop platform, a self-dividing algorithm is proposed for load balancing. An intelligent optimization algorithm is used to solve load balance. A dynamic feedback load balancing scheduling method is proposed from the point of view of task scheduling. In order to solve the shortcoming of the fair scheduling algorithm, this paper proposes two ways to improve the resource utilization and overall performance of Hadoop. When the mapping task is completed and the tasks to be reduced are assigned, the task assignment is based on the performance of the nodes to be reduced. It gives full play to the advantages of the ant colony algorithm and the hive colony algorithm so that the fusion algorithm can better deal with load balance. Then, three existing scheduling algorithms are introduced in detail: single queue scheduling, capacity scheduling, and fair scheduling. On this basis, an improved task scheduling strategy based on genetic algorithm is proposed to allocate and execute application tasks to reduce task completion time. The experiment verifies the effectiveness of the algorithm. The LBNP algorithm greatly improves the efficiency of reducing task execution and job execution. The delay capacity scheduling algorithm can ensure that most tasks can achieve localization scheduling, improve resource utilization, improve load balance, and speed up job completion time. Hindawi 2022-10-07 /pmc/articles/PMC9568311/ /pubmed/36248928 http://dx.doi.org/10.1155/2022/1545024 Text en Copyright © 2022 Weiyu Fu and Lixia Wang. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Fu, Weiyu
Wang, Lixia
Load Balancing Algorithms for Hadoop Cluster in Unbalanced Environment
title Load Balancing Algorithms for Hadoop Cluster in Unbalanced Environment
title_full Load Balancing Algorithms for Hadoop Cluster in Unbalanced Environment
title_fullStr Load Balancing Algorithms for Hadoop Cluster in Unbalanced Environment
title_full_unstemmed Load Balancing Algorithms for Hadoop Cluster in Unbalanced Environment
title_short Load Balancing Algorithms for Hadoop Cluster in Unbalanced Environment
title_sort load balancing algorithms for hadoop cluster in unbalanced environment
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9568311/
https://www.ncbi.nlm.nih.gov/pubmed/36248928
http://dx.doi.org/10.1155/2022/1545024
work_keys_str_mv AT fuweiyu loadbalancingalgorithmsforhadoopclusterinunbalancedenvironment
AT wanglixia loadbalancingalgorithmsforhadoopclusterinunbalancedenvironment