Cargando…
Opportunistic usage of the CMS online cluster using a cloud overlay
After two years of maintenance and upgrade, the Large Hadron Collider (LHC), the largest and most powerful particle accelerator in the world, has started its second three year run. Around 1500 computers make up the CMS (Compact Muon Solenoid) Online cluster. This cluster is used for Data Acquisition...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
SISSA
2016
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.22323/1.270.0022 http://cds.cern.ch/record/2264508 |
_version_ | 1780954407682179072 |
---|---|
author | Chaze, Olivier Jean-Marc, Andre Andronidis, Anastasios Behrens, Ulf Branson, James Brummer, Philipp Contescu, Alexandru-Cristian Cittolin, Sergio Craigs, Benjamin Darlea, Georgiana-Lavinia Deldicque, Christian Demiragli, Zeynep Dobson, M Doualot, Nicolas Erhan, Samim Fulcher, Jonathan Richard Gigi, Dominique Glege, Frank Gomez-Ceballos, Guillelmo Hegeman, Jeroen Holzner, Andre Georg Jimenez-Estupiñán, Raul Masetti, Lorenzo Meijers, Frans Meschi, Emilio Mommsen, Remigius Morovic, Srecko O'Dell, Vivian Orsini, Luciano Paus, Christoph Pieri, Marco Racz, Attila Sakulin, Hannes Schwick, Christoph Reis, Thomas Simelevicius, Dainius Zejdl, Petr |
author_facet | Chaze, Olivier Jean-Marc, Andre Andronidis, Anastasios Behrens, Ulf Branson, James Brummer, Philipp Contescu, Alexandru-Cristian Cittolin, Sergio Craigs, Benjamin Darlea, Georgiana-Lavinia Deldicque, Christian Demiragli, Zeynep Dobson, M Doualot, Nicolas Erhan, Samim Fulcher, Jonathan Richard Gigi, Dominique Glege, Frank Gomez-Ceballos, Guillelmo Hegeman, Jeroen Holzner, Andre Georg Jimenez-Estupiñán, Raul Masetti, Lorenzo Meijers, Frans Meschi, Emilio Mommsen, Remigius Morovic, Srecko O'Dell, Vivian Orsini, Luciano Paus, Christoph Pieri, Marco Racz, Attila Sakulin, Hannes Schwick, Christoph Reis, Thomas Simelevicius, Dainius Zejdl, Petr |
author_sort | Chaze, Olivier |
collection | CERN |
description | After two years of maintenance and upgrade, the Large Hadron Collider (LHC), the largest and most powerful particle accelerator in the world, has started its second three year run. Around 1500 computers make up the CMS (Compact Muon Solenoid) Online cluster. This cluster is used for Data Acquisition of the CMS experiment at CERN, selecting and sending to storage around 20 TBytes of data per day that are then analysed by the Worldwide LHC Computing Grid (WLCG) infrastructure that links hundreds of data centres worldwide. 3000 CMS physicists can access and process data, and are always seeking more computing power and data. The backbone of the CMS Online cluster is composed of 16000 cores which provide as much computing power as all CMS WLCG Tier1 sites (352K HEP-SPEC-06 score in the CMS cluster versus 300K across CMS Tier1 sites). The computing power available in the CMS cluster can significantly speed up the processing of data, so an effort has been made to allocate the resources of the CMS Online cluster to the grid when it isn’t used to its full capacity for data acquisition. This occurs during the maintenance periods when the LHC is non-operational, which corresponded to 117 days in 2015. During 2016, the aim is to increase the availability of the CMS Online cluster for data processing by making the cluster accessible during the time between two physics collisions while the LHC and beams are being prepared. This is usually the case for a few hours every day, which would vastly increase the computing power available for data processing. Work has already been undertaken to provide this functionality, as an OpenStack cloud layer has been deployed as a minimal overlay that leaves the primary role of the cluster untouched. This overlay also abstracts the different hardware and networks that the cluster is composed of. The operation of the cloud (starting and stopping the virtual machines) is another challenge that has been overcome as the cluster has only a few hours spare during the aforementioned beam preparation. By improving the virtual image deployment and integrating the OpenStack services with the core services of the Data Acquisition on the CMS Online cluster it is now possible to start a thousand virtual machines within 10 minutes and to turn them off within seconds. This document will explain the architectural choices that were made to reach a fully redundant and scalable cloud, with a minimal impact on the running cluster configuration while giving a maximal segregation between the services. It will also present how to cold start 1000 virtual machines 25 times faster, using tools commonly utilised in all data centres. |
id | oai-inspirehep.net-1508944 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2016 |
publisher | SISSA |
record_format | invenio |
spelling | oai-inspirehep.net-15089442019-10-15T15:27:48Zdoi:10.22323/1.270.0022http://cds.cern.ch/record/2264508engChaze, OlivierJean-Marc, AndreAndronidis, AnastasiosBehrens, UlfBranson, JamesBrummer, PhilippContescu, Alexandru-CristianCittolin, SergioCraigs, BenjaminDarlea, Georgiana-LaviniaDeldicque, ChristianDemiragli, ZeynepDobson, MDoualot, NicolasErhan, SamimFulcher, Jonathan RichardGigi, DominiqueGlege, FrankGomez-Ceballos, GuillelmoHegeman, JeroenHolzner, Andre GeorgJimenez-Estupiñán, RaulMasetti, LorenzoMeijers, FransMeschi, EmilioMommsen, RemigiusMorovic, SreckoO'Dell, VivianOrsini, LucianoPaus, ChristophPieri, MarcoRacz, AttilaSakulin, HannesSchwick, ChristophReis, ThomasSimelevicius, DainiusZejdl, PetrOpportunistic usage of the CMS online cluster using a cloud overlayComputing and ComputersAfter two years of maintenance and upgrade, the Large Hadron Collider (LHC), the largest and most powerful particle accelerator in the world, has started its second three year run. Around 1500 computers make up the CMS (Compact Muon Solenoid) Online cluster. This cluster is used for Data Acquisition of the CMS experiment at CERN, selecting and sending to storage around 20 TBytes of data per day that are then analysed by the Worldwide LHC Computing Grid (WLCG) infrastructure that links hundreds of data centres worldwide. 3000 CMS physicists can access and process data, and are always seeking more computing power and data. The backbone of the CMS Online cluster is composed of 16000 cores which provide as much computing power as all CMS WLCG Tier1 sites (352K HEP-SPEC-06 score in the CMS cluster versus 300K across CMS Tier1 sites). The computing power available in the CMS cluster can significantly speed up the processing of data, so an effort has been made to allocate the resources of the CMS Online cluster to the grid when it isn’t used to its full capacity for data acquisition. This occurs during the maintenance periods when the LHC is non-operational, which corresponded to 117 days in 2015. During 2016, the aim is to increase the availability of the CMS Online cluster for data processing by making the cluster accessible during the time between two physics collisions while the LHC and beams are being prepared. This is usually the case for a few hours every day, which would vastly increase the computing power available for data processing. Work has already been undertaken to provide this functionality, as an OpenStack cloud layer has been deployed as a minimal overlay that leaves the primary role of the cluster untouched. This overlay also abstracts the different hardware and networks that the cluster is composed of. The operation of the cloud (starting and stopping the virtual machines) is another challenge that has been overcome as the cluster has only a few hours spare during the aforementioned beam preparation. By improving the virtual image deployment and integrating the OpenStack services with the core services of the Data Acquisition on the CMS Online cluster it is now possible to start a thousand virtual machines within 10 minutes and to turn them off within seconds. This document will explain the architectural choices that were made to reach a fully redundant and scalable cloud, with a minimal impact on the running cluster configuration while giving a maximal segregation between the services. It will also present how to cold start 1000 virtual machines 25 times faster, using tools commonly utilised in all data centres.SISSAoai:inspirehep.net:15089442016 |
spellingShingle | Computing and Computers Chaze, Olivier Jean-Marc, Andre Andronidis, Anastasios Behrens, Ulf Branson, James Brummer, Philipp Contescu, Alexandru-Cristian Cittolin, Sergio Craigs, Benjamin Darlea, Georgiana-Lavinia Deldicque, Christian Demiragli, Zeynep Dobson, M Doualot, Nicolas Erhan, Samim Fulcher, Jonathan Richard Gigi, Dominique Glege, Frank Gomez-Ceballos, Guillelmo Hegeman, Jeroen Holzner, Andre Georg Jimenez-Estupiñán, Raul Masetti, Lorenzo Meijers, Frans Meschi, Emilio Mommsen, Remigius Morovic, Srecko O'Dell, Vivian Orsini, Luciano Paus, Christoph Pieri, Marco Racz, Attila Sakulin, Hannes Schwick, Christoph Reis, Thomas Simelevicius, Dainius Zejdl, Petr Opportunistic usage of the CMS online cluster using a cloud overlay |
title | Opportunistic usage of the CMS online cluster using a cloud overlay |
title_full | Opportunistic usage of the CMS online cluster using a cloud overlay |
title_fullStr | Opportunistic usage of the CMS online cluster using a cloud overlay |
title_full_unstemmed | Opportunistic usage of the CMS online cluster using a cloud overlay |
title_short | Opportunistic usage of the CMS online cluster using a cloud overlay |
title_sort | opportunistic usage of the cms online cluster using a cloud overlay |
topic | Computing and Computers |
url | https://dx.doi.org/10.22323/1.270.0022 http://cds.cern.ch/record/2264508 |
work_keys_str_mv | AT chazeolivier opportunisticusageofthecmsonlineclusterusingacloudoverlay AT jeanmarcandre opportunisticusageofthecmsonlineclusterusingacloudoverlay AT andronidisanastasios opportunisticusageofthecmsonlineclusterusingacloudoverlay AT behrensulf opportunisticusageofthecmsonlineclusterusingacloudoverlay AT bransonjames opportunisticusageofthecmsonlineclusterusingacloudoverlay AT brummerphilipp opportunisticusageofthecmsonlineclusterusingacloudoverlay AT contescualexandrucristian opportunisticusageofthecmsonlineclusterusingacloudoverlay AT cittolinsergio opportunisticusageofthecmsonlineclusterusingacloudoverlay AT craigsbenjamin opportunisticusageofthecmsonlineclusterusingacloudoverlay AT darleageorgianalavinia opportunisticusageofthecmsonlineclusterusingacloudoverlay AT deldicquechristian opportunisticusageofthecmsonlineclusterusingacloudoverlay AT demiraglizeynep opportunisticusageofthecmsonlineclusterusingacloudoverlay AT dobsonm opportunisticusageofthecmsonlineclusterusingacloudoverlay AT doualotnicolas opportunisticusageofthecmsonlineclusterusingacloudoverlay AT erhansamim opportunisticusageofthecmsonlineclusterusingacloudoverlay AT fulcherjonathanrichard opportunisticusageofthecmsonlineclusterusingacloudoverlay AT gigidominique opportunisticusageofthecmsonlineclusterusingacloudoverlay AT glegefrank opportunisticusageofthecmsonlineclusterusingacloudoverlay AT gomezceballosguillelmo opportunisticusageofthecmsonlineclusterusingacloudoverlay AT hegemanjeroen opportunisticusageofthecmsonlineclusterusingacloudoverlay AT holznerandregeorg opportunisticusageofthecmsonlineclusterusingacloudoverlay AT jimenezestupinanraul opportunisticusageofthecmsonlineclusterusingacloudoverlay AT masettilorenzo opportunisticusageofthecmsonlineclusterusingacloudoverlay AT meijersfrans opportunisticusageofthecmsonlineclusterusingacloudoverlay AT meschiemilio opportunisticusageofthecmsonlineclusterusingacloudoverlay AT mommsenremigius opportunisticusageofthecmsonlineclusterusingacloudoverlay AT morovicsrecko opportunisticusageofthecmsonlineclusterusingacloudoverlay AT odellvivian opportunisticusageofthecmsonlineclusterusingacloudoverlay AT orsiniluciano opportunisticusageofthecmsonlineclusterusingacloudoverlay AT pauschristoph opportunisticusageofthecmsonlineclusterusingacloudoverlay AT pierimarco opportunisticusageofthecmsonlineclusterusingacloudoverlay AT raczattila opportunisticusageofthecmsonlineclusterusingacloudoverlay AT sakulinhannes opportunisticusageofthecmsonlineclusterusingacloudoverlay AT schwickchristoph opportunisticusageofthecmsonlineclusterusingacloudoverlay AT reisthomas opportunisticusageofthecmsonlineclusterusingacloudoverlay AT simeleviciusdainius opportunisticusageofthecmsonlineclusterusingacloudoverlay AT zejdlpetr opportunisticusageofthecmsonlineclusterusingacloudoverlay |