Cargando…

Fault-Tolerant and Data-Intensive Resource Scheduling and Management for Scientific Applications in Cloud Computing

Cloud computing is a fully fledged, matured and flexible computing paradigm that provides services to scientific and business applications in a subscription-based environment. Scientific applications such as Montage and CyberShake are organized scientific workflows with data and compute-intensive ta...

Descripción completa

Detalles Bibliográficos
Autores principales: Ahmad, Zulfiqar, Jehangiri, Ali Imran, Ala’anzy, Mohammed Alaa, Othman, Mohamed, Umar, Arif Iqbal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8587021/
https://www.ncbi.nlm.nih.gov/pubmed/34770545
http://dx.doi.org/10.3390/s21217238
_version_ 1784598008220876800
author Ahmad, Zulfiqar
Jehangiri, Ali Imran
Ala’anzy, Mohammed Alaa
Othman, Mohamed
Umar, Arif Iqbal
author_facet Ahmad, Zulfiqar
Jehangiri, Ali Imran
Ala’anzy, Mohammed Alaa
Othman, Mohamed
Umar, Arif Iqbal
author_sort Ahmad, Zulfiqar
collection PubMed
description Cloud computing is a fully fledged, matured and flexible computing paradigm that provides services to scientific and business applications in a subscription-based environment. Scientific applications such as Montage and CyberShake are organized scientific workflows with data and compute-intensive tasks and also have some special characteristics. These characteristics include the tasks of scientific workflows that are executed in terms of integration, disintegration, pipeline, and parallelism, and thus require special attention to task management and data-oriented resource scheduling and management. The tasks executed during pipeline are considered as bottleneck executions, the failure of which result in the wholly futile execution, which requires a fault-tolerant-aware execution. The tasks executed during parallelism require similar instances of cloud resources, and thus, cluster-based execution may upgrade the system performance in terms of make-span and execution cost. Therefore, this research work presents a cluster-based, fault-tolerant and data-intensive (CFD) scheduling for scientific applications in cloud environments. The CFD strategy addresses the data intensiveness of tasks of scientific workflows with cluster-based, fault-tolerant mechanisms. The Montage scientific workflow is considered as a simulation and the results of the CFD strategy were compared with three well-known heuristic scheduling policies: (a) MCT, (b) Max-min, and (c) Min-min. The simulation results showed that the CFD strategy reduced the make-span by 14.28%, 20.37%, and 11.77%, respectively, as compared with the existing three policies. Similarly, the CFD reduces the execution cost by 1.27%, 5.3%, and 2.21%, respectively, as compared with the existing three policies. In case of the CFD strategy, the SLA is not violated with regard to time and cost constraints, whereas it is violated by the existing policies numerous times.
format Online
Article
Text
id pubmed-8587021
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-85870212021-11-13 Fault-Tolerant and Data-Intensive Resource Scheduling and Management for Scientific Applications in Cloud Computing Ahmad, Zulfiqar Jehangiri, Ali Imran Ala’anzy, Mohammed Alaa Othman, Mohamed Umar, Arif Iqbal Sensors (Basel) Article Cloud computing is a fully fledged, matured and flexible computing paradigm that provides services to scientific and business applications in a subscription-based environment. Scientific applications such as Montage and CyberShake are organized scientific workflows with data and compute-intensive tasks and also have some special characteristics. These characteristics include the tasks of scientific workflows that are executed in terms of integration, disintegration, pipeline, and parallelism, and thus require special attention to task management and data-oriented resource scheduling and management. The tasks executed during pipeline are considered as bottleneck executions, the failure of which result in the wholly futile execution, which requires a fault-tolerant-aware execution. The tasks executed during parallelism require similar instances of cloud resources, and thus, cluster-based execution may upgrade the system performance in terms of make-span and execution cost. Therefore, this research work presents a cluster-based, fault-tolerant and data-intensive (CFD) scheduling for scientific applications in cloud environments. The CFD strategy addresses the data intensiveness of tasks of scientific workflows with cluster-based, fault-tolerant mechanisms. The Montage scientific workflow is considered as a simulation and the results of the CFD strategy were compared with three well-known heuristic scheduling policies: (a) MCT, (b) Max-min, and (c) Min-min. The simulation results showed that the CFD strategy reduced the make-span by 14.28%, 20.37%, and 11.77%, respectively, as compared with the existing three policies. Similarly, the CFD reduces the execution cost by 1.27%, 5.3%, and 2.21%, respectively, as compared with the existing three policies. In case of the CFD strategy, the SLA is not violated with regard to time and cost constraints, whereas it is violated by the existing policies numerous times. MDPI 2021-10-30 /pmc/articles/PMC8587021/ /pubmed/34770545 http://dx.doi.org/10.3390/s21217238 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Ahmad, Zulfiqar
Jehangiri, Ali Imran
Ala’anzy, Mohammed Alaa
Othman, Mohamed
Umar, Arif Iqbal
Fault-Tolerant and Data-Intensive Resource Scheduling and Management for Scientific Applications in Cloud Computing
title Fault-Tolerant and Data-Intensive Resource Scheduling and Management for Scientific Applications in Cloud Computing
title_full Fault-Tolerant and Data-Intensive Resource Scheduling and Management for Scientific Applications in Cloud Computing
title_fullStr Fault-Tolerant and Data-Intensive Resource Scheduling and Management for Scientific Applications in Cloud Computing
title_full_unstemmed Fault-Tolerant and Data-Intensive Resource Scheduling and Management for Scientific Applications in Cloud Computing
title_short Fault-Tolerant and Data-Intensive Resource Scheduling and Management for Scientific Applications in Cloud Computing
title_sort fault-tolerant and data-intensive resource scheduling and management for scientific applications in cloud computing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8587021/
https://www.ncbi.nlm.nih.gov/pubmed/34770545
http://dx.doi.org/10.3390/s21217238
work_keys_str_mv AT ahmadzulfiqar faulttolerantanddataintensiveresourceschedulingandmanagementforscientificapplicationsincloudcomputing
AT jehangirialiimran faulttolerantanddataintensiveresourceschedulingandmanagementforscientificapplicationsincloudcomputing
AT alaanzymohammedalaa faulttolerantanddataintensiveresourceschedulingandmanagementforscientificapplicationsincloudcomputing
AT othmanmohamed faulttolerantanddataintensiveresourceschedulingandmanagementforscientificapplicationsincloudcomputing
AT umararifiqbal faulttolerantanddataintensiveresourceschedulingandmanagementforscientificapplicationsincloudcomputing