Cargando…

Integration Of PanDA Workload Management System With Supercomputers for ATLAS and Data Intensive Science

The.LHC, operating at CERN, is leading Big Data driven scientific explorations. Experiments at the LHC explore the fundamental nature of matter and the basic forces that shape our universe. ATLAS, one of the largest collaborations ever assembled in the sciences, is at the forefront of research at th...

Descripción completa

Detalles Bibliográficos
Autores principales: Klimentov, A, De, K, Jha, S, Maeno, T, Nilsson, P, Oleynik, D, Panitkin, S, Wells, J, Wenaus, T
Lenguaje:eng
Publicado: 2016
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/762/1/012021
http://cds.cern.ch/record/2277801
_version_ 1780955337937911808
author Klimentov, A
De, K
Jha, S
Maeno, T
Nilsson, P
Oleynik, D
Panitkin, S
Wells, J
Wenaus, T
author_facet Klimentov, A
De, K
Jha, S
Maeno, T
Nilsson, P
Oleynik, D
Panitkin, S
Wells, J
Wenaus, T
author_sort Klimentov, A
collection CERN
description The.LHC, operating at CERN, is leading Big Data driven scientific explorations. Experiments at the LHC explore the fundamental nature of matter and the basic forces that shape our universe. ATLAS, one of the largest collaborations ever assembled in the sciences, is at the forefront of research at the LHC. To address an unprecedented multi-petabyte data processing challenge, the ATLAS experiment is relying on a heterogeneous distributed computational infrastructure. The ATLAS experiment uses PanDA (Production and Data Analysis) Workload Management System for managing the workflow for all data processing on over 150 data centers. Through PanDA, ATLAS physicists see a single computing facility that enables rapid scientific breakthroughs for the experiment, even though the data centers are physically scattered all over the world. While PanDA currently uses more than 250,000 cores with a peak performance of 0.3 petaFLOPS, LHC data taking runs require more resources than grid can possibly provide. To alleviate these challenges, LHC experiments are engaged in an ambitious program to expand the current computing model to include additional resources such as the opportunistic use of supercomputers. We will describe a project aimed at integration of PanDA WMS with supercomputers in United States, in particular with Titan supercomputer at Oak Ridge Leadership Computing Facility. Current approach utilizes modified PanDA pilot framework for job submission to the supercomputers batch queues and local data management, with light-weight MPI wrappers to run single threaded workloads in parallel on LCFs multi-core worker nodes. This implementation was tested with a variety of Monte-Carlo workloads on several supercomputing platforms for ALICE and ATLAS experiments and it is in full pro duction for the ATLAS since September 2015. We will present our current accomplishments with running PanDA at supercomputers and demonstrate our ability to use PanDA as a portal independent of the computing facilities infrastructure for High Energy and Nuclear Physics as well as other data-intensive science applications, such as bioinformatics and astro-particle physics.
id oai-inspirehep.net-1499967
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2016
record_format invenio
spelling oai-inspirehep.net-14999672021-02-09T10:05:28Zdoi:10.1088/1742-6596/762/1/012021http://cds.cern.ch/record/2277801engKlimentov, ADe, KJha, SMaeno, TNilsson, POleynik, DPanitkin, SWells, JWenaus, TIntegration Of PanDA Workload Management System With Supercomputers for ATLAS and Data Intensive ScienceComputing and ComputersThe.LHC, operating at CERN, is leading Big Data driven scientific explorations. Experiments at the LHC explore the fundamental nature of matter and the basic forces that shape our universe. ATLAS, one of the largest collaborations ever assembled in the sciences, is at the forefront of research at the LHC. To address an unprecedented multi-petabyte data processing challenge, the ATLAS experiment is relying on a heterogeneous distributed computational infrastructure. The ATLAS experiment uses PanDA (Production and Data Analysis) Workload Management System for managing the workflow for all data processing on over 150 data centers. Through PanDA, ATLAS physicists see a single computing facility that enables rapid scientific breakthroughs for the experiment, even though the data centers are physically scattered all over the world. While PanDA currently uses more than 250,000 cores with a peak performance of 0.3 petaFLOPS, LHC data taking runs require more resources than grid can possibly provide. To alleviate these challenges, LHC experiments are engaged in an ambitious program to expand the current computing model to include additional resources such as the opportunistic use of supercomputers. We will describe a project aimed at integration of PanDA WMS with supercomputers in United States, in particular with Titan supercomputer at Oak Ridge Leadership Computing Facility. Current approach utilizes modified PanDA pilot framework for job submission to the supercomputers batch queues and local data management, with light-weight MPI wrappers to run single threaded workloads in parallel on LCFs multi-core worker nodes. This implementation was tested with a variety of Monte-Carlo workloads on several supercomputing platforms for ALICE and ATLAS experiments and it is in full pro duction for the ATLAS since September 2015. We will present our current accomplishments with running PanDA at supercomputers and demonstrate our ability to use PanDA as a portal independent of the computing facilities infrastructure for High Energy and Nuclear Physics as well as other data-intensive science applications, such as bioinformatics and astro-particle physics.oai:inspirehep.net:14999672016
spellingShingle Computing and Computers
Klimentov, A
De, K
Jha, S
Maeno, T
Nilsson, P
Oleynik, D
Panitkin, S
Wells, J
Wenaus, T
Integration Of PanDA Workload Management System With Supercomputers for ATLAS and Data Intensive Science
title Integration Of PanDA Workload Management System With Supercomputers for ATLAS and Data Intensive Science
title_full Integration Of PanDA Workload Management System With Supercomputers for ATLAS and Data Intensive Science
title_fullStr Integration Of PanDA Workload Management System With Supercomputers for ATLAS and Data Intensive Science
title_full_unstemmed Integration Of PanDA Workload Management System With Supercomputers for ATLAS and Data Intensive Science
title_short Integration Of PanDA Workload Management System With Supercomputers for ATLAS and Data Intensive Science
title_sort integration of panda workload management system with supercomputers for atlas and data intensive science
topic Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/762/1/012021
http://cds.cern.ch/record/2277801
work_keys_str_mv AT klimentova integrationofpandaworkloadmanagementsystemwithsupercomputersforatlasanddataintensivescience
AT dek integrationofpandaworkloadmanagementsystemwithsupercomputersforatlasanddataintensivescience
AT jhas integrationofpandaworkloadmanagementsystemwithsupercomputersforatlasanddataintensivescience
AT maenot integrationofpandaworkloadmanagementsystemwithsupercomputersforatlasanddataintensivescience
AT nilssonp integrationofpandaworkloadmanagementsystemwithsupercomputersforatlasanddataintensivescience
AT oleynikd integrationofpandaworkloadmanagementsystemwithsupercomputersforatlasanddataintensivescience
AT panitkins integrationofpandaworkloadmanagementsystemwithsupercomputersforatlasanddataintensivescience
AT wellsj integrationofpandaworkloadmanagementsystemwithsupercomputersforatlasanddataintensivescience
AT wenaust integrationofpandaworkloadmanagementsystemwithsupercomputersforatlasanddataintensivescience