Cargando…

CHEP2015: Dynamic Resource Allocation with arcControlTower

Distributed computing resources available for high-energy physics research are becoming less dedicated to one type of workflow and researchers’ workloads are increasingly exploiting modern computing technologies such as parallelism. The current pilot job management model used by many experiments rel...

Descripción completa

Detalles Bibliográficos
Autores principales: Filipcic, Andrej, Cameron, David, Nilsen, Jon Kerr
Lenguaje:eng
Publicado: 2015
Materias:
Acceso en línea:http://cds.cern.ch/record/2007500
_version_ 1780946359149395968
author Filipcic, Andrej
Cameron, David
Nilsen, Jon Kerr
author_facet Filipcic, Andrej
Cameron, David
Nilsen, Jon Kerr
author_sort Filipcic, Andrej
collection CERN
description Distributed computing resources available for high-energy physics research are becoming less dedicated to one type of workflow and researchers’ workloads are increasingly exploiting modern computing technologies such as parallelism. The current pilot job management model used by many experiments relies on static dedicated resources and cannot easily adapt to these changes. The model used for ATLAS in Nordic countries and some other places enables a flexible job management system based on dynamic resources allocation. Rather than a fixed set of resources managed centrally, the model allows resources to be requested on the fly. The ARC Computing Element (ARC-CE) and ARC Control Tower (aCT) are the key components of the model. The aCT requests jobs from the ATLAS job mangement system (Panda) and submits a fully-formed job description to ARC-CEs. ARC-CE can then dynamically request the required resources from the underlying batch system. In this paper we describe the architecture of the model and the experience of running many millions of ATLAS jobs on it.
id cern-2007500
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2015
record_format invenio
spelling cern-20075002019-09-30T06:29:59Zhttp://cds.cern.ch/record/2007500engFilipcic, AndrejCameron, DavidNilsen, Jon KerrCHEP2015: Dynamic Resource Allocation with arcControlTowerParticle Physics - ExperimentDistributed computing resources available for high-energy physics research are becoming less dedicated to one type of workflow and researchers’ workloads are increasingly exploiting modern computing technologies such as parallelism. The current pilot job management model used by many experiments relies on static dedicated resources and cannot easily adapt to these changes. The model used for ATLAS in Nordic countries and some other places enables a flexible job management system based on dynamic resources allocation. Rather than a fixed set of resources managed centrally, the model allows resources to be requested on the fly. The ARC Computing Element (ARC-CE) and ARC Control Tower (aCT) are the key components of the model. The aCT requests jobs from the ATLAS job mangement system (Panda) and submits a fully-formed job description to ARC-CEs. ARC-CE can then dynamically request the required resources from the underlying batch system. In this paper we describe the architecture of the model and the experience of running many millions of ATLAS jobs on it.ATL-SOFT-SLIDE-2015-156oai:cds.cern.ch:20075002015-04-08
spellingShingle Particle Physics - Experiment
Filipcic, Andrej
Cameron, David
Nilsen, Jon Kerr
CHEP2015: Dynamic Resource Allocation with arcControlTower
title CHEP2015: Dynamic Resource Allocation with arcControlTower
title_full CHEP2015: Dynamic Resource Allocation with arcControlTower
title_fullStr CHEP2015: Dynamic Resource Allocation with arcControlTower
title_full_unstemmed CHEP2015: Dynamic Resource Allocation with arcControlTower
title_short CHEP2015: Dynamic Resource Allocation with arcControlTower
title_sort chep2015: dynamic resource allocation with arccontroltower
topic Particle Physics - Experiment
url http://cds.cern.ch/record/2007500
work_keys_str_mv AT filipcicandrej chep2015dynamicresourceallocationwitharccontroltower
AT camerondavid chep2015dynamicresourceallocationwitharccontroltower
AT nilsenjonkerr chep2015dynamicresourceallocationwitharccontroltower