Cargando…

Design, Results, Evolution and Status of the ATLAS simulation in Point1 project.

During the LHC long shutdown period (LS1), that started in 2013, the simulation in Point1 (Sim@P1) project takes advantage in an opportunistic way of the trigger and data acquisition (TDAQ) farm of the ATLAS experiment. The farm provides more than 1500 computer nodes, and they are particularly suita...

Descripción completa

Detalles Bibliográficos
Autores principales: Ballestrero, Sergio, Fressard-Batraneanu, Silvia Maria, Brasolin, Franco, Contescu, Alexandru Cristian, Fazio, Daniel, Di Girolamo, Alessandro, Lee, Christopher Jon, Pozo Astigarraga, Mikel Eukeni, Scannicchio, Diana, Sedov, Alexey, Twomey, Matthew Shaun, Wang, Fuquan, Zaytsev, Alexander
Lenguaje:eng
Publicado: 2015
Materias:
Acceso en línea:http://cds.cern.ch/record/2002455
_version_ 1780946051612540928
author Ballestrero, Sergio
Fressard-Batraneanu, Silvia Maria
Brasolin, Franco
Contescu, Alexandru Cristian
Fazio, Daniel
Di Girolamo, Alessandro
Lee, Christopher Jon
Pozo Astigarraga, Mikel Eukeni
Scannicchio, Diana
Sedov, Alexey
Twomey, Matthew Shaun
Wang, Fuquan
Zaytsev, Alexander
author_facet Ballestrero, Sergio
Fressard-Batraneanu, Silvia Maria
Brasolin, Franco
Contescu, Alexandru Cristian
Fazio, Daniel
Di Girolamo, Alessandro
Lee, Christopher Jon
Pozo Astigarraga, Mikel Eukeni
Scannicchio, Diana
Sedov, Alexey
Twomey, Matthew Shaun
Wang, Fuquan
Zaytsev, Alexander
author_sort Ballestrero, Sergio
collection CERN
description During the LHC long shutdown period (LS1), that started in 2013, the simulation in Point1 (Sim@P1) project takes advantage in an opportunistic way of the trigger and data acquisition (TDAQ) farm of the ATLAS experiment. The farm provides more than 1500 computer nodes, and they are particularly suitable for running event generation and Monte Carlo production jobs that are mostly CPU and not I/O bound. It is capable of running up to 2500 virtual machines (VM) provided with 8 CPU cores each, for a total of up to 20000 parallel running jobs. This contribution gives a thorough review of the design, the results and the evolution of the Sim@P1 project operating a large scale Openstack based virtualized platform deployed on top of the ATLAS TDAQ farm computing resources. During LS1, Sim@P1 was one of the most productive GRID sites: it delivered more than 50 million CPU-hours and it generated more than 1.7 billion Monte Carlo events to various analysis communities within the ATLAS collaboration. The particular design aspects are presented: the virtualization platform exploited by the Sim@P1 project permits to avoid interferences with TDAQ operations and, more important, it guarantees the security and the usability of the ATLAS private network. The Cloud infrastructure allows to decouple the needed support on both infrastructural (hardware, virtualization layer) and logical (Grid site support and handling the job lifecycle) levels. In particular in this note we focus on the operational aspects of such a large system for the upcoming LHC Run 2 period: customized, simple, reliable and efficient tools are needed to quickly switch from Sim@P1 to TDAQ mode and vice versa to exploit the TDAQ resources when they are not used for the data acquisition, even for short period. We also describe the evolution of the central Openstack infrastructure as it was upgraded from Folsom to Icehouse release and the scalability issues we have addressed. The success of the Sim@P1 project is due to the continuous combined efforts of the ATLAS TDAQ SysAdmins and NetAdmins teams, CERN IT and the RHIC & ATLAS Computing Facility (RACF) at BNL.
id cern-2002455
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2015
record_format invenio
spelling cern-20024552019-09-30T06:29:59Zhttp://cds.cern.ch/record/2002455engBallestrero, SergioFressard-Batraneanu, Silvia MariaBrasolin, FrancoContescu, Alexandru CristianFazio, DanielDi Girolamo, AlessandroLee, Christopher JonPozo Astigarraga, Mikel EukeniScannicchio, DianaSedov, AlexeyTwomey, Matthew ShaunWang, FuquanZaytsev, AlexanderDesign, Results, Evolution and Status of the ATLAS simulation in Point1 project.Particle Physics - ExperimentDuring the LHC long shutdown period (LS1), that started in 2013, the simulation in Point1 (Sim@P1) project takes advantage in an opportunistic way of the trigger and data acquisition (TDAQ) farm of the ATLAS experiment. The farm provides more than 1500 computer nodes, and they are particularly suitable for running event generation and Monte Carlo production jobs that are mostly CPU and not I/O bound. It is capable of running up to 2500 virtual machines (VM) provided with 8 CPU cores each, for a total of up to 20000 parallel running jobs. This contribution gives a thorough review of the design, the results and the evolution of the Sim@P1 project operating a large scale Openstack based virtualized platform deployed on top of the ATLAS TDAQ farm computing resources. During LS1, Sim@P1 was one of the most productive GRID sites: it delivered more than 50 million CPU-hours and it generated more than 1.7 billion Monte Carlo events to various analysis communities within the ATLAS collaboration. The particular design aspects are presented: the virtualization platform exploited by the Sim@P1 project permits to avoid interferences with TDAQ operations and, more important, it guarantees the security and the usability of the ATLAS private network. The Cloud infrastructure allows to decouple the needed support on both infrastructural (hardware, virtualization layer) and logical (Grid site support and handling the job lifecycle) levels. In particular in this note we focus on the operational aspects of such a large system for the upcoming LHC Run 2 period: customized, simple, reliable and efficient tools are needed to quickly switch from Sim@P1 to TDAQ mode and vice versa to exploit the TDAQ resources when they are not used for the data acquisition, even for short period. We also describe the evolution of the central Openstack infrastructure as it was upgraded from Folsom to Icehouse release and the scalability issues we have addressed. The success of the Sim@P1 project is due to the continuous combined efforts of the ATLAS TDAQ SysAdmins and NetAdmins teams, CERN IT and the RHIC & ATLAS Computing Facility (RACF) at BNL.ATL-SOFT-SLIDE-2015-078oai:cds.cern.ch:20024552015-03-20
spellingShingle Particle Physics - Experiment
Ballestrero, Sergio
Fressard-Batraneanu, Silvia Maria
Brasolin, Franco
Contescu, Alexandru Cristian
Fazio, Daniel
Di Girolamo, Alessandro
Lee, Christopher Jon
Pozo Astigarraga, Mikel Eukeni
Scannicchio, Diana
Sedov, Alexey
Twomey, Matthew Shaun
Wang, Fuquan
Zaytsev, Alexander
Design, Results, Evolution and Status of the ATLAS simulation in Point1 project.
title Design, Results, Evolution and Status of the ATLAS simulation in Point1 project.
title_full Design, Results, Evolution and Status of the ATLAS simulation in Point1 project.
title_fullStr Design, Results, Evolution and Status of the ATLAS simulation in Point1 project.
title_full_unstemmed Design, Results, Evolution and Status of the ATLAS simulation in Point1 project.
title_short Design, Results, Evolution and Status of the ATLAS simulation in Point1 project.
title_sort design, results, evolution and status of the atlas simulation in point1 project.
topic Particle Physics - Experiment
url http://cds.cern.ch/record/2002455
work_keys_str_mv AT ballestrerosergio designresultsevolutionandstatusoftheatlassimulationinpoint1project
AT fressardbatraneanusilviamaria designresultsevolutionandstatusoftheatlassimulationinpoint1project
AT brasolinfranco designresultsevolutionandstatusoftheatlassimulationinpoint1project
AT contescualexandrucristian designresultsevolutionandstatusoftheatlassimulationinpoint1project
AT faziodaniel designresultsevolutionandstatusoftheatlassimulationinpoint1project
AT digirolamoalessandro designresultsevolutionandstatusoftheatlassimulationinpoint1project
AT leechristopherjon designresultsevolutionandstatusoftheatlassimulationinpoint1project
AT pozoastigarragamikeleukeni designresultsevolutionandstatusoftheatlassimulationinpoint1project
AT scannicchiodiana designresultsevolutionandstatusoftheatlassimulationinpoint1project
AT sedovalexey designresultsevolutionandstatusoftheatlassimulationinpoint1project
AT twomeymatthewshaun designresultsevolutionandstatusoftheatlassimulationinpoint1project
AT wangfuquan designresultsevolutionandstatusoftheatlassimulationinpoint1project
AT zaytsevalexander designresultsevolutionandstatusoftheatlassimulationinpoint1project