Cargando…

An improved ant colony optimization algorithm with fault tolerance for job scheduling in grid computing systems

The Grid scheduler, schedules user jobs on the best available resource in terms of resource characteristics by optimizing job execution time. Resource failure in Grid is no longer an exception but a regular occurring event as resources are increasingly being used by the scientific community to solve...

Descripción completa

Detalles Bibliográficos
Autores principales:	Idris, Hajara, Ezugwu, Absalom E., Junaidu, Sahalu B., Adewumi, Aderemi O.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2017
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5435234/ https://www.ncbi.nlm.nih.gov/pubmed/28545075 http://dx.doi.org/10.1371/journal.pone.0177567

_version_	1783237199157788672
author	Idris, Hajara Ezugwu, Absalom E. Junaidu, Sahalu B. Adewumi, Aderemi O.
author_facet	Idris, Hajara Ezugwu, Absalom E. Junaidu, Sahalu B. Adewumi, Aderemi O.
author_sort	Idris, Hajara
collection	PubMed
description	The Grid scheduler, schedules user jobs on the best available resource in terms of resource characteristics by optimizing job execution time. Resource failure in Grid is no longer an exception but a regular occurring event as resources are increasingly being used by the scientific community to solve computationally intensive problems which typically run for days or even months. It is therefore absolutely essential that these long-running applications are able to tolerate failures and avoid re-computations from scratch after resource failure has occurred, to satisfy the user’s Quality of Service (QoS) requirement. Job Scheduling with Fault Tolerance in Grid Computing using Ant Colony Optimization is proposed to ensure that jobs are executed successfully even when resource failure has occurred. The technique employed in this paper, is the use of resource failure rate, as well as checkpoint-based roll back recovery strategy. Check-pointing aims at reducing the amount of work that is lost upon failure of the system by immediately saving the state of the system. A comparison of the proposed approach with an existing Ant Colony Optimization (ACO) algorithm is discussed. The experimental results of the implemented Fault Tolerance scheduling algorithm show that there is an improvement in the user’s QoS requirement over the existing ACO algorithm, which has no fault tolerance integrated in it. The performance evaluation of the two algorithms was measured in terms of the three main scheduling performance metrics: makespan, throughput and average turnaround time.
format	Online Article Text
id	pubmed-5435234
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-54352342017-05-26 An improved ant colony optimization algorithm with fault tolerance for job scheduling in grid computing systems Idris, Hajara Ezugwu, Absalom E. Junaidu, Sahalu B. Adewumi, Aderemi O. PLoS One Research Article The Grid scheduler, schedules user jobs on the best available resource in terms of resource characteristics by optimizing job execution time. Resource failure in Grid is no longer an exception but a regular occurring event as resources are increasingly being used by the scientific community to solve computationally intensive problems which typically run for days or even months. It is therefore absolutely essential that these long-running applications are able to tolerate failures and avoid re-computations from scratch after resource failure has occurred, to satisfy the user’s Quality of Service (QoS) requirement. Job Scheduling with Fault Tolerance in Grid Computing using Ant Colony Optimization is proposed to ensure that jobs are executed successfully even when resource failure has occurred. The technique employed in this paper, is the use of resource failure rate, as well as checkpoint-based roll back recovery strategy. Check-pointing aims at reducing the amount of work that is lost upon failure of the system by immediately saving the state of the system. A comparison of the proposed approach with an existing Ant Colony Optimization (ACO) algorithm is discussed. The experimental results of the implemented Fault Tolerance scheduling algorithm show that there is an improvement in the user’s QoS requirement over the existing ACO algorithm, which has no fault tolerance integrated in it. The performance evaluation of the two algorithms was measured in terms of the three main scheduling performance metrics: makespan, throughput and average turnaround time. Public Library of Science 2017-05-17 /pmc/articles/PMC5435234/ /pubmed/28545075 http://dx.doi.org/10.1371/journal.pone.0177567 Text en © 2017 Idris et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Idris, Hajara Ezugwu, Absalom E. Junaidu, Sahalu B. Adewumi, Aderemi O. An improved ant colony optimization algorithm with fault tolerance for job scheduling in grid computing systems
title	An improved ant colony optimization algorithm with fault tolerance for job scheduling in grid computing systems
title_full	An improved ant colony optimization algorithm with fault tolerance for job scheduling in grid computing systems
title_fullStr	An improved ant colony optimization algorithm with fault tolerance for job scheduling in grid computing systems
title_full_unstemmed	An improved ant colony optimization algorithm with fault tolerance for job scheduling in grid computing systems
title_short	An improved ant colony optimization algorithm with fault tolerance for job scheduling in grid computing systems
title_sort	improved ant colony optimization algorithm with fault tolerance for job scheduling in grid computing systems
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5435234/ https://www.ncbi.nlm.nih.gov/pubmed/28545075 http://dx.doi.org/10.1371/journal.pone.0177567
work_keys_str_mv	AT idrishajara animprovedantcolonyoptimizationalgorithmwithfaulttoleranceforjobschedulingingridcomputingsystems AT ezugwuabsalome animprovedantcolonyoptimizationalgorithmwithfaulttoleranceforjobschedulingingridcomputingsystems AT junaidusahalub animprovedantcolonyoptimizationalgorithmwithfaulttoleranceforjobschedulingingridcomputingsystems AT adewumiaderemio animprovedantcolonyoptimizationalgorithmwithfaulttoleranceforjobschedulingingridcomputingsystems AT idrishajara improvedantcolonyoptimizationalgorithmwithfaulttoleranceforjobschedulingingridcomputingsystems AT ezugwuabsalome improvedantcolonyoptimizationalgorithmwithfaulttoleranceforjobschedulingingridcomputingsystems AT junaidusahalub improvedantcolonyoptimizationalgorithmwithfaulttoleranceforjobschedulingingridcomputingsystems AT adewumiaderemio improvedantcolonyoptimizationalgorithmwithfaulttoleranceforjobschedulingingridcomputingsystems

An improved ant colony optimization algorithm with fault tolerance for job scheduling in grid computing systems

Ejemplares similares