Cargando…

Integration Of PanDA Workload Management System With Supercomputers

The Large Hadron Collider (LHC), operating at the international CERN Laboratory in Geneva, Switzerland, is leading Big Data driven scientific explorations. Experiments at the LHC explore the fundamental nature of matter and the basic forces that shape our universe, and were recently credited for the...

Descripción completa

Detalles Bibliográficos
Autores principales:	Klimentov, Alexei, De, Kaushik, Maeno, Tadashi, Mashinistov, Ruslan, Nilsson, Paul, Oleynik, Danila, Panitkin, Sergey, Read, Kenneth, Ryabinkin, Evgeny, Wenaus, Torre
Lenguaje:	eng
Publicado:	2015
Materias:	Particle Physics - Experiment
Acceso en línea:	http://cds.cern.ch/record/2017381

_version_	1780946726872416256
author	Klimentov, Alexei De, Kaushik Maeno, Tadashi Mashinistov, Ruslan Nilsson, Paul Oleynik, Danila Panitkin, Sergey Read, Kenneth Ryabinkin, Evgeny Wenaus, Torre
author_facet	Klimentov, Alexei De, Kaushik Maeno, Tadashi Mashinistov, Ruslan Nilsson, Paul Oleynik, Danila Panitkin, Sergey Read, Kenneth Ryabinkin, Evgeny Wenaus, Torre
author_sort	Klimentov, Alexei
collection	CERN
description	The Large Hadron Collider (LHC), operating at the international CERN Laboratory in Geneva, Switzerland, is leading Big Data driven scientific explorations. Experiments at the LHC explore the fundamental nature of matter and the basic forces that shape our universe, and were recently credited for the discovery of a Higgs boson. ATLAS, one of the largest collaborations ever assembled in the sciences, is at the forefront of research at the LHC. To address an unprecedented multi-petabyte data processing challenge, the ATLAS experiment is relying on a heterogeneous distributed computational infrastructure. The ATLAS experiment uses PanDA (Production and Data Analysis) Workload Management System for managing the workflow for all data processing on over 140 data centers. Through PanDA, ATLAS physicists see a single computing facility that enables rapid scientific breakthroughs for the experiment, even though the data centers are physically scattered all over the world. While PanDA currently uses more than 100,000 cores with a peak performance of 0.3 petaFLOPS, next LHC data taking runs will require more resources than Grid computing can possibly provide. To alleviate these challenges, LHC experiments are engaged in an ambitious program to expand the current computing model to include additional resources such as the opportunistic use of supercomputers. We will describe a project aimed at integration of PanDA WMS with supercomputers in United States, Europe and Russia (in particular with Titan supercomputer at Oak Ridge Leadership Computing Facility (OLCF), Supercomputer at the National Research Center “Kurchatov Institute”, IT4 in Ostrava and others). Current approach utilizes modified PanDA pilot framework for job submission to the supercomputers batch queues and local data management, with light-weight MPI wrappers to run single threaded workloads in parallel on Titan's multi-core worker nodes. This implementation was tested with a variety of Monte-Carlo workloads on several supercomputing platforms. We will present our current accomplishments with running PanDA WMS at supercomputers and demonstrate our ability to use PanDA as a portal independent of the computing facilities infrastructure for High Energy and Nuclear Physics as well as other data-intensive science applications, such as bioinformatics and astro-particle physics.
id	cern-2017381
institution	Organización Europea para la Investigación Nuclear
language	eng
publishDate	2015
record_format	invenio
spelling	cern-20173812019-09-30T06:29:59Zhttp://cds.cern.ch/record/2017381engKlimentov, AlexeiDe, KaushikMaeno, TadashiMashinistov, RuslanNilsson, PaulOleynik, DanilaPanitkin, SergeyRead, KennethRyabinkin, EvgenyWenaus, TorreIntegration Of PanDA Workload Management System With SupercomputersParticle Physics - ExperimentThe Large Hadron Collider (LHC), operating at the international CERN Laboratory in Geneva, Switzerland, is leading Big Data driven scientific explorations. Experiments at the LHC explore the fundamental nature of matter and the basic forces that shape our universe, and were recently credited for the discovery of a Higgs boson. ATLAS, one of the largest collaborations ever assembled in the sciences, is at the forefront of research at the LHC. To address an unprecedented multi-petabyte data processing challenge, the ATLAS experiment is relying on a heterogeneous distributed computational infrastructure. The ATLAS experiment uses PanDA (Production and Data Analysis) Workload Management System for managing the workflow for all data processing on over 140 data centers. Through PanDA, ATLAS physicists see a single computing facility that enables rapid scientific breakthroughs for the experiment, even though the data centers are physically scattered all over the world. While PanDA currently uses more than 100,000 cores with a peak performance of 0.3 petaFLOPS, next LHC data taking runs will require more resources than Grid computing can possibly provide. To alleviate these challenges, LHC experiments are engaged in an ambitious program to expand the current computing model to include additional resources such as the opportunistic use of supercomputers. We will describe a project aimed at integration of PanDA WMS with supercomputers in United States, Europe and Russia (in particular with Titan supercomputer at Oak Ridge Leadership Computing Facility (OLCF), Supercomputer at the National Research Center “Kurchatov Institute”, IT4 in Ostrava and others). Current approach utilizes modified PanDA pilot framework for job submission to the supercomputers batch queues and local data management, with light-weight MPI wrappers to run single threaded workloads in parallel on Titan's multi-core worker nodes. This implementation was tested with a variety of Monte-Carlo workloads on several supercomputing platforms. We will present our current accomplishments with running PanDA WMS at supercomputers and demonstrate our ability to use PanDA as a portal independent of the computing facilities infrastructure for High Energy and Nuclear Physics as well as other data-intensive science applications, such as bioinformatics and astro-particle physics.ATL-SOFT-SLIDE-2015-258oai:cds.cern.ch:20173812015-05-20
spellingShingle	Particle Physics - Experiment Klimentov, Alexei De, Kaushik Maeno, Tadashi Mashinistov, Ruslan Nilsson, Paul Oleynik, Danila Panitkin, Sergey Read, Kenneth Ryabinkin, Evgeny Wenaus, Torre Integration Of PanDA Workload Management System With Supercomputers
title	Integration Of PanDA Workload Management System With Supercomputers
title_full	Integration Of PanDA Workload Management System With Supercomputers
title_fullStr	Integration Of PanDA Workload Management System With Supercomputers
title_full_unstemmed	Integration Of PanDA Workload Management System With Supercomputers
title_short	Integration Of PanDA Workload Management System With Supercomputers
title_sort	integration of panda workload management system with supercomputers
topic	Particle Physics - Experiment
url	http://cds.cern.ch/record/2017381
work_keys_str_mv	AT klimentovalexei integrationofpandaworkloadmanagementsystemwithsupercomputers AT dekaushik integrationofpandaworkloadmanagementsystemwithsupercomputers AT maenotadashi integrationofpandaworkloadmanagementsystemwithsupercomputers AT mashinistovruslan integrationofpandaworkloadmanagementsystemwithsupercomputers AT nilssonpaul integrationofpandaworkloadmanagementsystemwithsupercomputers AT oleynikdanila integrationofpandaworkloadmanagementsystemwithsupercomputers AT panitkinsergey integrationofpandaworkloadmanagementsystemwithsupercomputers AT readkenneth integrationofpandaworkloadmanagementsystemwithsupercomputers AT ryabinkinevgeny integrationofpandaworkloadmanagementsystemwithsupercomputers AT wenaustorre integrationofpandaworkloadmanagementsystemwithsupercomputers

Integration Of PanDA Workload Management System With Supercomputers

Ejemplares similares