Cargando…

Pooling the resources of the CMS Tier-1 sites

The CMS experiment at the LHC relies on 7 Tier-1 centres of the WLCG to perform the majority of its bulk processing activity, and to archive its data. During the first run of the LHC, these two functions were tightly coupled as each Tier-1 was constrained to process only the data archived on its hie...

Descripción completa

Detalles Bibliográficos
Autores principales: Apyan, A, Badillo, J, Cruz, J Diaz, Gadrat, S, Gutsche, O, Holzman, B, Lahiff, A, Magini, N, Mason, D, Perez, A, Stober, F, Taneja, S, Taze, M, Wissing, C
Lenguaje:eng
Publicado: 2015
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/664/4/042056
http://cds.cern.ch/record/2134578
_version_ 1780949908714422272
author Apyan, A
Badillo, J
Cruz, J Diaz
Gadrat, S
Gutsche, O
Holzman, B
Lahiff, A
Magini, N
Mason, D
Perez, A
Stober, F
Taneja, S
Taze, M
Wissing, C
author_facet Apyan, A
Badillo, J
Cruz, J Diaz
Gadrat, S
Gutsche, O
Holzman, B
Lahiff, A
Magini, N
Mason, D
Perez, A
Stober, F
Taneja, S
Taze, M
Wissing, C
author_sort Apyan, A
collection CERN
description The CMS experiment at the LHC relies on 7 Tier-1 centres of the WLCG to perform the majority of its bulk processing activity, and to archive its data. During the first run of the LHC, these two functions were tightly coupled as each Tier-1 was constrained to process only the data archived on its hierarchical storage. This lack of flexibility in the assignment of processing workflows occasionally resulted in uneven resource utilisation and in an increased latency in the delivery of the results to the physics community.The long shutdown of the LHC in 2013-2014 was an opportunity to revisit this mode of operations, disentangling the processing and archive functionalities of the Tier-1 centres. The storage services at the Tier-1s were redeployed breaking the traditional hierarchical model: each site now provides a large disk storage to host input and output data for processing, and an independent tape storage used exclusively for archiving. Movement of data between the tape and disk endpoints is not automated, but triggered externally through the WLCG transfer management systems.With this new setup, CMS operations actively controls at any time which data is available on disk for processing and which data should be sent to archive. Thanks to the high-bandwidth connectivity guaranteed by the LHCOPN, input data can be freely transferred between disk endpoints as needed to take advantage of free CPU, turning the Tier-1s into a large pool of shared resources. The output data can be validated before archiving them permanently, and temporary data formats can be produced without wasting valuable tape resources. Finally, the data hosted on disk at Tier-1s can now be made available also for user analysis since there is no risk any longer of triggering chaotic staging from tape.In this contribution, we describe the technical solutions adopted for the new disk and tape endpoints at the sites, and we report on the commissioning and scale testing of the service. We detail the procedures implemented by CMS computing operations to actively manage data on disk at Tier-1 sites, and we give examples of the benefits brought to CMS workflows by the additional flexibility of the new system.
id oai-inspirehep.net-1413883
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2015
record_format invenio
spelling oai-inspirehep.net-14138832022-08-10T13:00:56Zdoi:10.1088/1742-6596/664/4/042056http://cds.cern.ch/record/2134578engApyan, ABadillo, JCruz, J DiazGadrat, SGutsche, OHolzman, BLahiff, AMagini, NMason, DPerez, AStober, FTaneja, STaze, MWissing, CPooling the resources of the CMS Tier-1 sitesComputing and ComputersThe CMS experiment at the LHC relies on 7 Tier-1 centres of the WLCG to perform the majority of its bulk processing activity, and to archive its data. During the first run of the LHC, these two functions were tightly coupled as each Tier-1 was constrained to process only the data archived on its hierarchical storage. This lack of flexibility in the assignment of processing workflows occasionally resulted in uneven resource utilisation and in an increased latency in the delivery of the results to the physics community.The long shutdown of the LHC in 2013-2014 was an opportunity to revisit this mode of operations, disentangling the processing and archive functionalities of the Tier-1 centres. The storage services at the Tier-1s were redeployed breaking the traditional hierarchical model: each site now provides a large disk storage to host input and output data for processing, and an independent tape storage used exclusively for archiving. Movement of data between the tape and disk endpoints is not automated, but triggered externally through the WLCG transfer management systems.With this new setup, CMS operations actively controls at any time which data is available on disk for processing and which data should be sent to archive. Thanks to the high-bandwidth connectivity guaranteed by the LHCOPN, input data can be freely transferred between disk endpoints as needed to take advantage of free CPU, turning the Tier-1s into a large pool of shared resources. The output data can be validated before archiving them permanently, and temporary data formats can be produced without wasting valuable tape resources. Finally, the data hosted on disk at Tier-1s can now be made available also for user analysis since there is no risk any longer of triggering chaotic staging from tape.In this contribution, we describe the technical solutions adopted for the new disk and tape endpoints at the sites, and we report on the commissioning and scale testing of the service. We detail the procedures implemented by CMS computing operations to actively manage data on disk at Tier-1 sites, and we give examples of the benefits brought to CMS workflows by the additional flexibility of the new system.FERMILAB-CONF-15-447-CDoai:inspirehep.net:14138832015
spellingShingle Computing and Computers
Apyan, A
Badillo, J
Cruz, J Diaz
Gadrat, S
Gutsche, O
Holzman, B
Lahiff, A
Magini, N
Mason, D
Perez, A
Stober, F
Taneja, S
Taze, M
Wissing, C
Pooling the resources of the CMS Tier-1 sites
title Pooling the resources of the CMS Tier-1 sites
title_full Pooling the resources of the CMS Tier-1 sites
title_fullStr Pooling the resources of the CMS Tier-1 sites
title_full_unstemmed Pooling the resources of the CMS Tier-1 sites
title_short Pooling the resources of the CMS Tier-1 sites
title_sort pooling the resources of the cms tier-1 sites
topic Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/664/4/042056
http://cds.cern.ch/record/2134578
work_keys_str_mv AT apyana poolingtheresourcesofthecmstier1sites
AT badilloj poolingtheresourcesofthecmstier1sites
AT cruzjdiaz poolingtheresourcesofthecmstier1sites
AT gadrats poolingtheresourcesofthecmstier1sites
AT gutscheo poolingtheresourcesofthecmstier1sites
AT holzmanb poolingtheresourcesofthecmstier1sites
AT lahiffa poolingtheresourcesofthecmstier1sites
AT maginin poolingtheresourcesofthecmstier1sites
AT masond poolingtheresourcesofthecmstier1sites
AT pereza poolingtheresourcesofthecmstier1sites
AT stoberf poolingtheresourcesofthecmstier1sites
AT tanejas poolingtheresourcesofthecmstier1sites
AT tazem poolingtheresourcesofthecmstier1sites
AT wissingc poolingtheresourcesofthecmstier1sites