Cargando…

Integration Of PanDA Workload Management System With Supercomputers for ATLAS

The Large Hadron Collider (LHC), operating at the international CERN Laboratory in Geneva, Switzerland, is leading Big Data driven scientific explorations. Experiments at the LHC explore the fundamental nature of matter and the basic forces that shape our universe, and were recently credited for the...

Descripción completa

Detalles Bibliográficos
Autores principales: Oleynik, Danila, Klimentov, Alexei, De, Kaushik, Wenaus, Torre, Maeno, Tadashi, Barreiro Megino, Fernando Harald, Nilsson, Paul, Guan, Wen, Panitkin, Sergey
Lenguaje:eng
Publicado: 2016
Materias:
Acceso en línea:http://cds.cern.ch/record/2195771
_version_ 1780951073643560960
author Oleynik, Danila
Klimentov, Alexei
De, Kaushik
Wenaus, Torre
Maeno, Tadashi
Barreiro Megino, Fernando Harald
Nilsson, Paul
Guan, Wen
Panitkin, Sergey
author_facet Oleynik, Danila
Klimentov, Alexei
De, Kaushik
Wenaus, Torre
Maeno, Tadashi
Barreiro Megino, Fernando Harald
Nilsson, Paul
Guan, Wen
Panitkin, Sergey
author_sort Oleynik, Danila
collection CERN
description The Large Hadron Collider (LHC), operating at the international CERN Laboratory in Geneva, Switzerland, is leading Big Data driven scientific explorations. Experiments at the LHC explore the fundamental nature of matter and the basic forces that shape our universe, and were recently credited for the discovery of a Higgs boson. ATLAS, one of the largest collaborations ever assembled in the sciences, is at the forefront of research at the LHC. To address an unprecedented multi-petabyte data processing challenge, the ATLAS experiment is relying on a heterogeneous distributed computational infrastructure. The ATLAS experiment uses PanDA (Production ANd Distributed Analysis system) Workload Management System for managing the workflow for all data processing on over 150 data centers. Through PanDA, ATLAS physicists see a single computing facility that enables rapid scientific breakthroughs for the experiment, even though the data centers are physically scattered all over the world. While PanDA currently uses more than 250,000 cores with a peak performance of 0.3+ petaFLOPS, next LHC data taking runs will require more resources than Grid computing can possibly provide. To alleviate these challenges, LHC experiments are engaged in an ambitious program to expand the current computing model to include additional resources such as the opportunistic use of supercomputers. We will describe a project aimed at integration of PanDA WMS with supercomputers in United States, Europe and Russia (in particular with Titan supercomputer at Oak Ridge Leadership Computing Facility, MIRA supercomputer at Argonne Leadership Computing Facilities, and others). In our talk we will consider different approaches towards ATLAS data processing on supercomputers: using dedicated allocation of supercomputer time, working in backfill mode, and multi-step processing. Special attention will be devoted to AES (ATLAS event service) on HPC and multi-job pilot. We will present our recent accomplishments with running PanDA at supercomputers and demonstrate our ability to use PanDA as a portal independent of the computing facilities infrastructure for High Energy and Nuclear Physics as well as other data-intensive science applications, such as bioinformatics and astro-particle physics.
id cern-2195771
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2016
record_format invenio
spelling cern-21957712019-09-30T06:29:59Zhttp://cds.cern.ch/record/2195771engOleynik, DanilaKlimentov, AlexeiDe, KaushikWenaus, TorreMaeno, TadashiBarreiro Megino, Fernando HaraldNilsson, PaulGuan, WenPanitkin, SergeyIntegration Of PanDA Workload Management System With Supercomputers for ATLASParticle Physics - ExperimentThe Large Hadron Collider (LHC), operating at the international CERN Laboratory in Geneva, Switzerland, is leading Big Data driven scientific explorations. Experiments at the LHC explore the fundamental nature of matter and the basic forces that shape our universe, and were recently credited for the discovery of a Higgs boson. ATLAS, one of the largest collaborations ever assembled in the sciences, is at the forefront of research at the LHC. To address an unprecedented multi-petabyte data processing challenge, the ATLAS experiment is relying on a heterogeneous distributed computational infrastructure. The ATLAS experiment uses PanDA (Production ANd Distributed Analysis system) Workload Management System for managing the workflow for all data processing on over 150 data centers. Through PanDA, ATLAS physicists see a single computing facility that enables rapid scientific breakthroughs for the experiment, even though the data centers are physically scattered all over the world. While PanDA currently uses more than 250,000 cores with a peak performance of 0.3+ petaFLOPS, next LHC data taking runs will require more resources than Grid computing can possibly provide. To alleviate these challenges, LHC experiments are engaged in an ambitious program to expand the current computing model to include additional resources such as the opportunistic use of supercomputers. We will describe a project aimed at integration of PanDA WMS with supercomputers in United States, Europe and Russia (in particular with Titan supercomputer at Oak Ridge Leadership Computing Facility, MIRA supercomputer at Argonne Leadership Computing Facilities, and others). In our talk we will consider different approaches towards ATLAS data processing on supercomputers: using dedicated allocation of supercomputer time, working in backfill mode, and multi-step processing. Special attention will be devoted to AES (ATLAS event service) on HPC and multi-job pilot. We will present our recent accomplishments with running PanDA at supercomputers and demonstrate our ability to use PanDA as a portal independent of the computing facilities infrastructure for High Energy and Nuclear Physics as well as other data-intensive science applications, such as bioinformatics and astro-particle physics.ATL-SOFT-SLIDE-2016-371oai:cds.cern.ch:21957712016-07-01
spellingShingle Particle Physics - Experiment
Oleynik, Danila
Klimentov, Alexei
De, Kaushik
Wenaus, Torre
Maeno, Tadashi
Barreiro Megino, Fernando Harald
Nilsson, Paul
Guan, Wen
Panitkin, Sergey
Integration Of PanDA Workload Management System With Supercomputers for ATLAS
title Integration Of PanDA Workload Management System With Supercomputers for ATLAS
title_full Integration Of PanDA Workload Management System With Supercomputers for ATLAS
title_fullStr Integration Of PanDA Workload Management System With Supercomputers for ATLAS
title_full_unstemmed Integration Of PanDA Workload Management System With Supercomputers for ATLAS
title_short Integration Of PanDA Workload Management System With Supercomputers for ATLAS
title_sort integration of panda workload management system with supercomputers for atlas
topic Particle Physics - Experiment
url http://cds.cern.ch/record/2195771
work_keys_str_mv AT oleynikdanila integrationofpandaworkloadmanagementsystemwithsupercomputersforatlas
AT klimentovalexei integrationofpandaworkloadmanagementsystemwithsupercomputersforatlas
AT dekaushik integrationofpandaworkloadmanagementsystemwithsupercomputersforatlas
AT wenaustorre integrationofpandaworkloadmanagementsystemwithsupercomputersforatlas
AT maenotadashi integrationofpandaworkloadmanagementsystemwithsupercomputersforatlas
AT barreiromeginofernandoharald integrationofpandaworkloadmanagementsystemwithsupercomputersforatlas
AT nilssonpaul integrationofpandaworkloadmanagementsystemwithsupercomputersforatlas
AT guanwen integrationofpandaworkloadmanagementsystemwithsupercomputersforatlas
AT panitkinsergey integrationofpandaworkloadmanagementsystemwithsupercomputersforatlas