Cargando…

DIRAC Site Director: Improving Pilot-Job provisioning on grid resources

To study the constituents of matter, CERN mainly relies on the Worldwide LHC Computing Grid (WLCG), which processes petabytes of data coming from the Large Hadron Collider (LHC). LHC experiments have adopted the Pilot-Job paradigm, and deliver tools to supply grid resources with Pilot-Jobs, to effic...

Descripción completa

Detalles Bibliográficos
Autores principales: Boyer, Alexandre F, Haen, Christophe, Stagni, Federico, Hill, David R C
Lenguaje:eng
Publicado: 2022
Materias:
Acceso en línea:https://dx.doi.org/10.1016/j.future.2022.03.002
http://cds.cern.ch/record/2806286
_version_ 1780972986687291392
author Boyer, Alexandre F
Haen, Christophe
Stagni, Federico
Hill, David R C
author_facet Boyer, Alexandre F
Haen, Christophe
Stagni, Federico
Hill, David R C
author_sort Boyer, Alexandre F
collection CERN
description To study the constituents of matter, CERN mainly relies on the Worldwide LHC Computing Grid (WLCG), which processes petabytes of data coming from the Large Hadron Collider (LHC). LHC experiments have adopted the Pilot-Job paradigm, and deliver tools to supply grid resources with Pilot-Jobs, to efficiently leverage the computing power offered by WLCG. This sole approach will be insufficient and will need to be complemented to meet future computing needs – of the HighLuminosity LHC – and the rise of data generated over time: national science programs are consolidating computing resources and encourage using cloud and High-Performance Computing systems. Yet, even though they have started to integrate their workflows on such infrastructures, LHC experiments still largely depend on WLCG resources. This paper lays out an approach to increase the throughput of the jobs, on grid resources, by improving the performance of the Pilot-Job provisioning tools through a case study: the LHCb-specific solution, known as ‘‘DIRAC Site Director’’. We propose: (i) a complete analysis of the capabilities and limitations of the DIRAC Site Director; (ii) several methods to speed up its execution, including parallel processing as well as bulk operations; (iii) a comprehensive analysis of a group of Site Directors in the LHCb production environment during 12 months. With our approach, we recorded an increase of 40.86% of the number of jobs processed simultaneously per second, enabling the simultaneous management of 80,300 LHCb jobs, while only 57,000 of them could be managed before our improvements.
id cern-2806286
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2022
record_format invenio
spelling cern-28062862023-03-22T15:38:58Zdoi:10.1016/j.future.2022.03.002http://cds.cern.ch/record/2806286engBoyer, Alexandre FHaen, ChristopheStagni, FedericoHill, David R CDIRAC Site Director: Improving Pilot-Job provisioning on grid resourcesComputing and ComputersTo study the constituents of matter, CERN mainly relies on the Worldwide LHC Computing Grid (WLCG), which processes petabytes of data coming from the Large Hadron Collider (LHC). LHC experiments have adopted the Pilot-Job paradigm, and deliver tools to supply grid resources with Pilot-Jobs, to efficiently leverage the computing power offered by WLCG. This sole approach will be insufficient and will need to be complemented to meet future computing needs – of the HighLuminosity LHC – and the rise of data generated over time: national science programs are consolidating computing resources and encourage using cloud and High-Performance Computing systems. Yet, even though they have started to integrate their workflows on such infrastructures, LHC experiments still largely depend on WLCG resources. This paper lays out an approach to increase the throughput of the jobs, on grid resources, by improving the performance of the Pilot-Job provisioning tools through a case study: the LHCb-specific solution, known as ‘‘DIRAC Site Director’’. We propose: (i) a complete analysis of the capabilities and limitations of the DIRAC Site Director; (ii) several methods to speed up its execution, including parallel processing as well as bulk operations; (iii) a comprehensive analysis of a group of Site Directors in the LHCb production environment during 12 months. With our approach, we recorded an increase of 40.86% of the number of jobs processed simultaneously per second, enabling the simultaneous management of 80,300 LHCb jobs, while only 57,000 of them could be managed before our improvements.oai:cds.cern.ch:28062862022
spellingShingle Computing and Computers
Boyer, Alexandre F
Haen, Christophe
Stagni, Federico
Hill, David R C
DIRAC Site Director: Improving Pilot-Job provisioning on grid resources
title DIRAC Site Director: Improving Pilot-Job provisioning on grid resources
title_full DIRAC Site Director: Improving Pilot-Job provisioning on grid resources
title_fullStr DIRAC Site Director: Improving Pilot-Job provisioning on grid resources
title_full_unstemmed DIRAC Site Director: Improving Pilot-Job provisioning on grid resources
title_short DIRAC Site Director: Improving Pilot-Job provisioning on grid resources
title_sort dirac site director: improving pilot-job provisioning on grid resources
topic Computing and Computers
url https://dx.doi.org/10.1016/j.future.2022.03.002
http://cds.cern.ch/record/2806286
work_keys_str_mv AT boyeralexandref diracsitedirectorimprovingpilotjobprovisioningongridresources
AT haenchristophe diracsitedirectorimprovingpilotjobprovisioningongridresources
AT stagnifederico diracsitedirectorimprovingpilotjobprovisioningongridresources
AT hilldavidrc diracsitedirectorimprovingpilotjobprovisioningongridresources