Cargando…

Opportunistic usage of the CMS online cluster using a cloud overlay

After two years of maintenance and upgrade, the Large Hadron Collider (LHC), the largest and most powerful particle accelerator in the world, has started its second three year run. Around 1500 computers make up the CMS (Compact Muon Solenoid) Online cluster. This cluster is used for Data Acquisition...

Descripción completa

Detalles Bibliográficos
Autores principales: Chaze, Olivier, Jean-Marc, Andre, Andronidis, Anastasios, Behrens, Ulf, Branson, James, Brummer, Philipp, Contescu, Alexandru-Cristian, Cittolin, Sergio, Craigs, Benjamin, Darlea, Georgiana-Lavinia, Deldicque, Christian, Demiragli, Zeynep, Dobson, M, Doualot, Nicolas, Erhan, Samim, Fulcher, Jonathan Richard, Gigi, Dominique, Glege, Frank, Gomez-Ceballos, Guillelmo, Hegeman, Jeroen, Holzner, Andre Georg, Jimenez-Estupiñán, Raul, Masetti, Lorenzo, Meijers, Frans, Meschi, Emilio, Mommsen, Remigius, Morovic, Srecko, O'Dell, Vivian, Orsini, Luciano, Paus, Christoph, Pieri, Marco, Racz, Attila, Sakulin, Hannes, Schwick, Christoph, Reis, Thomas, Simelevicius, Dainius, Zejdl, Petr
Lenguaje:eng
Publicado: SISSA 2016
Materias:
Acceso en línea:https://dx.doi.org/10.22323/1.270.0022
http://cds.cern.ch/record/2264508
_version_ 1780954407682179072
author Chaze, Olivier
Jean-Marc, Andre
Andronidis, Anastasios
Behrens, Ulf
Branson, James
Brummer, Philipp
Contescu, Alexandru-Cristian
Cittolin, Sergio
Craigs, Benjamin
Darlea, Georgiana-Lavinia
Deldicque, Christian
Demiragli, Zeynep
Dobson, M
Doualot, Nicolas
Erhan, Samim
Fulcher, Jonathan Richard
Gigi, Dominique
Glege, Frank
Gomez-Ceballos, Guillelmo
Hegeman, Jeroen
Holzner, Andre Georg
Jimenez-Estupiñán, Raul
Masetti, Lorenzo
Meijers, Frans
Meschi, Emilio
Mommsen, Remigius
Morovic, Srecko
O'Dell, Vivian
Orsini, Luciano
Paus, Christoph
Pieri, Marco
Racz, Attila
Sakulin, Hannes
Schwick, Christoph
Reis, Thomas
Simelevicius, Dainius
Zejdl, Petr
author_facet Chaze, Olivier
Jean-Marc, Andre
Andronidis, Anastasios
Behrens, Ulf
Branson, James
Brummer, Philipp
Contescu, Alexandru-Cristian
Cittolin, Sergio
Craigs, Benjamin
Darlea, Georgiana-Lavinia
Deldicque, Christian
Demiragli, Zeynep
Dobson, M
Doualot, Nicolas
Erhan, Samim
Fulcher, Jonathan Richard
Gigi, Dominique
Glege, Frank
Gomez-Ceballos, Guillelmo
Hegeman, Jeroen
Holzner, Andre Georg
Jimenez-Estupiñán, Raul
Masetti, Lorenzo
Meijers, Frans
Meschi, Emilio
Mommsen, Remigius
Morovic, Srecko
O'Dell, Vivian
Orsini, Luciano
Paus, Christoph
Pieri, Marco
Racz, Attila
Sakulin, Hannes
Schwick, Christoph
Reis, Thomas
Simelevicius, Dainius
Zejdl, Petr
author_sort Chaze, Olivier
collection CERN
description After two years of maintenance and upgrade, the Large Hadron Collider (LHC), the largest and most powerful particle accelerator in the world, has started its second three year run. Around 1500 computers make up the CMS (Compact Muon Solenoid) Online cluster. This cluster is used for Data Acquisition of the CMS experiment at CERN, selecting and sending to storage around 20 TBytes of data per day that are then analysed by the Worldwide LHC Computing Grid (WLCG) infrastructure that links hundreds of data centres worldwide. 3000 CMS physicists can access and process data, and are always seeking more computing power and data. The backbone of the CMS Online cluster is composed of 16000 cores which provide as much computing power as all CMS WLCG Tier1 sites (352K HEP-SPEC-06 score in the CMS cluster versus 300K across CMS Tier1 sites). The computing power available in the CMS cluster can significantly speed up the processing of data, so an effort has been made to allocate the resources of the CMS Online cluster to the grid when it isn’t used to its full capacity for data acquisition. This occurs during the maintenance periods when the LHC is non-operational, which corresponded to 117 days in 2015. During 2016, the aim is to increase the availability of the CMS Online cluster for data processing by making the cluster accessible during the time between two physics collisions while the LHC and beams are being prepared. This is usually the case for a few hours every day, which would vastly increase the computing power available for data processing. Work has already been undertaken to provide this functionality, as an OpenStack cloud layer has been deployed as a minimal overlay that leaves the primary role of the cluster untouched. This overlay also abstracts the different hardware and networks that the cluster is composed of. The operation of the cloud (starting and stopping the virtual machines) is another challenge that has been overcome as the cluster has only a few hours spare during the aforementioned beam preparation. By improving the virtual image deployment and integrating the OpenStack services with the core services of the Data Acquisition on the CMS Online cluster it is now possible to start a thousand virtual machines within 10 minutes and to turn them off within seconds. This document will explain the architectural choices that were made to reach a fully redundant and scalable cloud, with a minimal impact on the running cluster configuration while giving a maximal segregation between the services. It will also present how to cold start 1000 virtual machines 25 times faster, using tools commonly utilised in all data centres.
id oai-inspirehep.net-1508944
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2016
publisher SISSA
record_format invenio
spelling oai-inspirehep.net-15089442019-10-15T15:27:48Zdoi:10.22323/1.270.0022http://cds.cern.ch/record/2264508engChaze, OlivierJean-Marc, AndreAndronidis, AnastasiosBehrens, UlfBranson, JamesBrummer, PhilippContescu, Alexandru-CristianCittolin, SergioCraigs, BenjaminDarlea, Georgiana-LaviniaDeldicque, ChristianDemiragli, ZeynepDobson, MDoualot, NicolasErhan, SamimFulcher, Jonathan RichardGigi, DominiqueGlege, FrankGomez-Ceballos, GuillelmoHegeman, JeroenHolzner, Andre GeorgJimenez-Estupiñán, RaulMasetti, LorenzoMeijers, FransMeschi, EmilioMommsen, RemigiusMorovic, SreckoO'Dell, VivianOrsini, LucianoPaus, ChristophPieri, MarcoRacz, AttilaSakulin, HannesSchwick, ChristophReis, ThomasSimelevicius, DainiusZejdl, PetrOpportunistic usage of the CMS online cluster using a cloud overlayComputing and ComputersAfter two years of maintenance and upgrade, the Large Hadron Collider (LHC), the largest and most powerful particle accelerator in the world, has started its second three year run. Around 1500 computers make up the CMS (Compact Muon Solenoid) Online cluster. This cluster is used for Data Acquisition of the CMS experiment at CERN, selecting and sending to storage around 20 TBytes of data per day that are then analysed by the Worldwide LHC Computing Grid (WLCG) infrastructure that links hundreds of data centres worldwide. 3000 CMS physicists can access and process data, and are always seeking more computing power and data. The backbone of the CMS Online cluster is composed of 16000 cores which provide as much computing power as all CMS WLCG Tier1 sites (352K HEP-SPEC-06 score in the CMS cluster versus 300K across CMS Tier1 sites). The computing power available in the CMS cluster can significantly speed up the processing of data, so an effort has been made to allocate the resources of the CMS Online cluster to the grid when it isn’t used to its full capacity for data acquisition. This occurs during the maintenance periods when the LHC is non-operational, which corresponded to 117 days in 2015. During 2016, the aim is to increase the availability of the CMS Online cluster for data processing by making the cluster accessible during the time between two physics collisions while the LHC and beams are being prepared. This is usually the case for a few hours every day, which would vastly increase the computing power available for data processing. Work has already been undertaken to provide this functionality, as an OpenStack cloud layer has been deployed as a minimal overlay that leaves the primary role of the cluster untouched. This overlay also abstracts the different hardware and networks that the cluster is composed of. The operation of the cloud (starting and stopping the virtual machines) is another challenge that has been overcome as the cluster has only a few hours spare during the aforementioned beam preparation. By improving the virtual image deployment and integrating the OpenStack services with the core services of the Data Acquisition on the CMS Online cluster it is now possible to start a thousand virtual machines within 10 minutes and to turn them off within seconds. This document will explain the architectural choices that were made to reach a fully redundant and scalable cloud, with a minimal impact on the running cluster configuration while giving a maximal segregation between the services. It will also present how to cold start 1000 virtual machines 25 times faster, using tools commonly utilised in all data centres.SISSAoai:inspirehep.net:15089442016
spellingShingle Computing and Computers
Chaze, Olivier
Jean-Marc, Andre
Andronidis, Anastasios
Behrens, Ulf
Branson, James
Brummer, Philipp
Contescu, Alexandru-Cristian
Cittolin, Sergio
Craigs, Benjamin
Darlea, Georgiana-Lavinia
Deldicque, Christian
Demiragli, Zeynep
Dobson, M
Doualot, Nicolas
Erhan, Samim
Fulcher, Jonathan Richard
Gigi, Dominique
Glege, Frank
Gomez-Ceballos, Guillelmo
Hegeman, Jeroen
Holzner, Andre Georg
Jimenez-Estupiñán, Raul
Masetti, Lorenzo
Meijers, Frans
Meschi, Emilio
Mommsen, Remigius
Morovic, Srecko
O'Dell, Vivian
Orsini, Luciano
Paus, Christoph
Pieri, Marco
Racz, Attila
Sakulin, Hannes
Schwick, Christoph
Reis, Thomas
Simelevicius, Dainius
Zejdl, Petr
Opportunistic usage of the CMS online cluster using a cloud overlay
title Opportunistic usage of the CMS online cluster using a cloud overlay
title_full Opportunistic usage of the CMS online cluster using a cloud overlay
title_fullStr Opportunistic usage of the CMS online cluster using a cloud overlay
title_full_unstemmed Opportunistic usage of the CMS online cluster using a cloud overlay
title_short Opportunistic usage of the CMS online cluster using a cloud overlay
title_sort opportunistic usage of the cms online cluster using a cloud overlay
topic Computing and Computers
url https://dx.doi.org/10.22323/1.270.0022
http://cds.cern.ch/record/2264508
work_keys_str_mv AT chazeolivier opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT jeanmarcandre opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT andronidisanastasios opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT behrensulf opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT bransonjames opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT brummerphilipp opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT contescualexandrucristian opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT cittolinsergio opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT craigsbenjamin opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT darleageorgianalavinia opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT deldicquechristian opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT demiraglizeynep opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT dobsonm opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT doualotnicolas opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT erhansamim opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT fulcherjonathanrichard opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT gigidominique opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT glegefrank opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT gomezceballosguillelmo opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT hegemanjeroen opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT holznerandregeorg opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT jimenezestupinanraul opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT masettilorenzo opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT meijersfrans opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT meschiemilio opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT mommsenremigius opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT morovicsrecko opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT odellvivian opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT orsiniluciano opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT pauschristoph opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT pierimarco opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT raczattila opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT sakulinhannes opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT schwickchristoph opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT reisthomas opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT simeleviciusdainius opportunisticusageofthecmsonlineclusterusingacloudoverlay
AT zejdlpetr opportunisticusageofthecmsonlineclusterusingacloudoverlay