Cargando…
DIRAC Site Director: Improving Pilot-Job provisioning on grid resources
To study the constituents of matter, CERN mainly relies on the Worldwide LHC Computing Grid (WLCG), which processes petabytes of data coming from the Large Hadron Collider (LHC). LHC experiments have adopted the Pilot-Job paradigm, and deliver tools to supply grid resources with Pilot-Jobs, to effic...
Autores principales: | , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2022
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1016/j.future.2022.03.002 http://cds.cern.ch/record/2806286 |
_version_ | 1780972986687291392 |
---|---|
author | Boyer, Alexandre F Haen, Christophe Stagni, Federico Hill, David R C |
author_facet | Boyer, Alexandre F Haen, Christophe Stagni, Federico Hill, David R C |
author_sort | Boyer, Alexandre F |
collection | CERN |
description | To study the constituents of matter, CERN mainly relies on the Worldwide LHC Computing Grid
(WLCG), which processes petabytes of data coming from the Large Hadron Collider (LHC). LHC
experiments have adopted the Pilot-Job paradigm, and deliver tools to supply grid resources with
Pilot-Jobs, to efficiently leverage the computing power offered by WLCG. This sole approach will
be insufficient and will need to be complemented to meet future computing needs – of the HighLuminosity LHC – and the rise of data generated over time: national science programs are consolidating
computing resources and encourage using cloud and High-Performance Computing systems. Yet, even
though they have started to integrate their workflows on such infrastructures, LHC experiments still
largely depend on WLCG resources. This paper lays out an approach to increase the throughput of the
jobs, on grid resources, by improving the performance of the Pilot-Job provisioning tools through a case
study: the LHCb-specific solution, known as ‘‘DIRAC Site Director’’. We propose: (i) a complete analysis
of the capabilities and limitations of the DIRAC Site Director; (ii) several methods to speed up its
execution, including parallel processing as well as bulk operations; (iii) a comprehensive analysis of a
group of Site Directors in the LHCb production environment during 12 months. With our approach, we
recorded an increase of 40.86% of the number of jobs processed simultaneously per second, enabling
the simultaneous management of 80,300 LHCb jobs, while only 57,000 of them could be managed
before our improvements. |
id | cern-2806286 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2022 |
record_format | invenio |
spelling | cern-28062862023-03-22T15:38:58Zdoi:10.1016/j.future.2022.03.002http://cds.cern.ch/record/2806286engBoyer, Alexandre FHaen, ChristopheStagni, FedericoHill, David R CDIRAC Site Director: Improving Pilot-Job provisioning on grid resourcesComputing and ComputersTo study the constituents of matter, CERN mainly relies on the Worldwide LHC Computing Grid (WLCG), which processes petabytes of data coming from the Large Hadron Collider (LHC). LHC experiments have adopted the Pilot-Job paradigm, and deliver tools to supply grid resources with Pilot-Jobs, to efficiently leverage the computing power offered by WLCG. This sole approach will be insufficient and will need to be complemented to meet future computing needs – of the HighLuminosity LHC – and the rise of data generated over time: national science programs are consolidating computing resources and encourage using cloud and High-Performance Computing systems. Yet, even though they have started to integrate their workflows on such infrastructures, LHC experiments still largely depend on WLCG resources. This paper lays out an approach to increase the throughput of the jobs, on grid resources, by improving the performance of the Pilot-Job provisioning tools through a case study: the LHCb-specific solution, known as ‘‘DIRAC Site Director’’. We propose: (i) a complete analysis of the capabilities and limitations of the DIRAC Site Director; (ii) several methods to speed up its execution, including parallel processing as well as bulk operations; (iii) a comprehensive analysis of a group of Site Directors in the LHCb production environment during 12 months. With our approach, we recorded an increase of 40.86% of the number of jobs processed simultaneously per second, enabling the simultaneous management of 80,300 LHCb jobs, while only 57,000 of them could be managed before our improvements.oai:cds.cern.ch:28062862022 |
spellingShingle | Computing and Computers Boyer, Alexandre F Haen, Christophe Stagni, Federico Hill, David R C DIRAC Site Director: Improving Pilot-Job provisioning on grid resources |
title | DIRAC Site Director: Improving Pilot-Job provisioning on grid resources |
title_full | DIRAC Site Director: Improving Pilot-Job provisioning on grid resources |
title_fullStr | DIRAC Site Director: Improving Pilot-Job provisioning on grid resources |
title_full_unstemmed | DIRAC Site Director: Improving Pilot-Job provisioning on grid resources |
title_short | DIRAC Site Director: Improving Pilot-Job provisioning on grid resources |
title_sort | dirac site director: improving pilot-job provisioning on grid resources |
topic | Computing and Computers |
url | https://dx.doi.org/10.1016/j.future.2022.03.002 http://cds.cern.ch/record/2806286 |
work_keys_str_mv | AT boyeralexandref diracsitedirectorimprovingpilotjobprovisioningongridresources AT haenchristophe diracsitedirectorimprovingpilotjobprovisioningongridresources AT stagnifederico diracsitedirectorimprovingpilotjobprovisioningongridresources AT hilldavidrc diracsitedirectorimprovingpilotjobprovisioningongridresources |