Cargando…

Processing moldable tasks on the grid: Late job binding with lightweight user-level overlay

Independent observations and everyday user experience indicate that performance and reliability of large grid infrastructures may suffer from large and unpredictable variations. In this paper we study the impact of the job queuing time on processing of moldable tasks which are commonly found in larg...

Descripción completa

Detalles Bibliográficos
Autores principales: Moscicki, J T, Bubak, M, Sloot, P M A, Lamanna, M
Lenguaje:eng
Publicado: 2011
Materias:
XX
Acceso en línea:https://dx.doi.org/10.1016/j.future.2011.02.002
http://cds.cern.ch/record/1399779
_version_ 1780923623107723264
author Moscicki, J T
Bubak, M
Sloot, P M A
Lamanna, M
author_facet Moscicki, J T
Bubak, M
Sloot, P M A
Lamanna, M
author_sort Moscicki, J T
collection CERN
description Independent observations and everyday user experience indicate that performance and reliability of large grid infrastructures may suffer from large and unpredictable variations. In this paper we study the impact of the job queuing time on processing of moldable tasks which are commonly found in large-scale production grids. We use the mean value and variance of makespan as the quality of service indicators. We develop a general task processing model to provide a quantitative comparison between two models: early and late job binding in a user-level overlay applied to the EGEE Grid infrastructure. We find that the late-binding model effectively defines a transformation of the distribution of makespan according to the Central Limit Theorem. As demonstrated by Monte Carlo simulations using real job traces, this transformation allows to substantially reduce the mean value and variance of makespan. For certain classes of applications task granularity may be adjusted such that a speedup of an order of magnitude or more may be achieved. We use this result to propose a general strategy for managing access to resources and optimization of workload based on Ganga and DIANE user-level overlay tools. Key features of this approach include: a late-binding scheduler, an ability to interface to a wide range of distributed systems, an ability to extend and customize the system to cover application-specific scheduling and processing patterns and finally, ease of use and lightweight deployment in the user space. We discuss the impact of this approach for some practical applications where efficient processing of many tasks is required to solve scientific problems. (C) 2011 Elsevier B.V. All rights reserved.
id cern-1399779
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2011
record_format invenio
spelling cern-13997792019-09-30T06:29:59Zdoi:10.1016/j.future.2011.02.002http://cds.cern.ch/record/1399779engMoscicki, J TBubak, MSloot, P M ALamanna, MProcessing moldable tasks on the grid: Late job binding with lightweight user-level overlayXXIndependent observations and everyday user experience indicate that performance and reliability of large grid infrastructures may suffer from large and unpredictable variations. In this paper we study the impact of the job queuing time on processing of moldable tasks which are commonly found in large-scale production grids. We use the mean value and variance of makespan as the quality of service indicators. We develop a general task processing model to provide a quantitative comparison between two models: early and late job binding in a user-level overlay applied to the EGEE Grid infrastructure. We find that the late-binding model effectively defines a transformation of the distribution of makespan according to the Central Limit Theorem. As demonstrated by Monte Carlo simulations using real job traces, this transformation allows to substantially reduce the mean value and variance of makespan. For certain classes of applications task granularity may be adjusted such that a speedup of an order of magnitude or more may be achieved. We use this result to propose a general strategy for managing access to resources and optimization of workload based on Ganga and DIANE user-level overlay tools. Key features of this approach include: a late-binding scheduler, an ability to interface to a wide range of distributed systems, an ability to extend and customize the system to cover application-specific scheduling and processing patterns and finally, ease of use and lightweight deployment in the user space. We discuss the impact of this approach for some practical applications where efficient processing of many tasks is required to solve scientific problems. (C) 2011 Elsevier B.V. All rights reserved.oai:cds.cern.ch:13997792011
spellingShingle XX
Moscicki, J T
Bubak, M
Sloot, P M A
Lamanna, M
Processing moldable tasks on the grid: Late job binding with lightweight user-level overlay
title Processing moldable tasks on the grid: Late job binding with lightweight user-level overlay
title_full Processing moldable tasks on the grid: Late job binding with lightweight user-level overlay
title_fullStr Processing moldable tasks on the grid: Late job binding with lightweight user-level overlay
title_full_unstemmed Processing moldable tasks on the grid: Late job binding with lightweight user-level overlay
title_short Processing moldable tasks on the grid: Late job binding with lightweight user-level overlay
title_sort processing moldable tasks on the grid: late job binding with lightweight user-level overlay
topic XX
url https://dx.doi.org/10.1016/j.future.2011.02.002
http://cds.cern.ch/record/1399779
work_keys_str_mv AT moscickijt processingmoldabletasksonthegridlatejobbindingwithlightweightuserleveloverlay
AT bubakm processingmoldabletasksonthegridlatejobbindingwithlightweightuserleveloverlay
AT slootpma processingmoldabletasksonthegridlatejobbindingwithlightweightuserleveloverlay
AT lamannam processingmoldabletasksonthegridlatejobbindingwithlightweightuserleveloverlay