Cargando…

Extending the ATLAS PanDA Workload Management System for New Big Data Applications

The LHC experiments are today at the leading edge of large scale distributed data-intensive computational science. The LHC's ATLAS experiment processes data volumes which are particularly extreme, over 130 PB to date, distributed worldwide at over of 120 sites. An important element in the succe...

Descripción completa

Detalles Bibliográficos
Autores principales: De, K, Klimentov, A, Maeno, T, Nilsson, P, Panitkin, S, Vaniachine, A, Wenaus, T, Yu, D
Lenguaje:eng
Publicado: 2013
Materias:
Acceso en línea:http://cds.cern.ch/record/1551737
_version_ 1780930253994065920
author De, K
Klimentov, A
Maeno, T
Nilsson, P
Panitkin, S
Vaniachine, A
Wenaus, T
Yu, D
author_facet De, K
Klimentov, A
Maeno, T
Nilsson, P
Panitkin, S
Vaniachine, A
Wenaus, T
Yu, D
author_sort De, K
collection CERN
description The LHC experiments are today at the leading edge of large scale distributed data-intensive computational science. The LHC's ATLAS experiment processes data volumes which are particularly extreme, over 130 PB to date, distributed worldwide at over of 120 sites. An important element in the success of the exciting physics results from ATLAS is the highly scalable integrated workflow and dataflow management afforded by the PanDA workload management system, used for all the distributed computing needs of the experiment. The PanDA design is not experiment specific and PanDA is now being extended to support other data intensive scientific applications. Alpha-Magnetic Spectrometer, an astro-particle experiment on the International Space Station, and the Compact Muon Solenoid, an LHC experiment, have successfully evaluated PanDA and are pursuing its adoption. PanDA was cited as an example of "a high performance, fault tolerant software for fast, scalable access to data repositories of many kinds" during the "Big Data Research and Development Initiative" announcement, a $200 million U.S. government investment in tools to handle huge volumes of digital data needed to spur science and engineering discoveries. In this talk, a description of the new program of work to develop a generic version of PanDA will be given, as well as the progress in extending PanDA's capabilities to support supercomputers, clouds, leverage intelligent networking, while accommodating the ever growing needs of current users. PanDA has already demonstrated at a very large scale the value of automated data-aware dynamic brokering of diverse workloads across distributed computing resources. The next generation of PanDA will allow many data-intensive sciences employing a variety of computing platforms to benefit from ATLAS' experience and proven tools in highly scalable processing.
id cern-1551737
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2013
record_format invenio
spelling cern-15517372019-09-30T06:29:59Zhttp://cds.cern.ch/record/1551737engDe, KKlimentov, AMaeno, TNilsson, PPanitkin, SVaniachine, AWenaus, TYu, DExtending the ATLAS PanDA Workload Management System for New Big Data ApplicationsDetectors and Experimental TechniquesThe LHC experiments are today at the leading edge of large scale distributed data-intensive computational science. The LHC's ATLAS experiment processes data volumes which are particularly extreme, over 130 PB to date, distributed worldwide at over of 120 sites. An important element in the success of the exciting physics results from ATLAS is the highly scalable integrated workflow and dataflow management afforded by the PanDA workload management system, used for all the distributed computing needs of the experiment. The PanDA design is not experiment specific and PanDA is now being extended to support other data intensive scientific applications. Alpha-Magnetic Spectrometer, an astro-particle experiment on the International Space Station, and the Compact Muon Solenoid, an LHC experiment, have successfully evaluated PanDA and are pursuing its adoption. PanDA was cited as an example of "a high performance, fault tolerant software for fast, scalable access to data repositories of many kinds" during the "Big Data Research and Development Initiative" announcement, a $200 million U.S. government investment in tools to handle huge volumes of digital data needed to spur science and engineering discoveries. In this talk, a description of the new program of work to develop a generic version of PanDA will be given, as well as the progress in extending PanDA's capabilities to support supercomputers, clouds, leverage intelligent networking, while accommodating the ever growing needs of current users. PanDA has already demonstrated at a very large scale the value of automated data-aware dynamic brokering of diverse workloads across distributed computing resources. The next generation of PanDA will allow many data-intensive sciences employing a variety of computing platforms to benefit from ATLAS' experience and proven tools in highly scalable processing.ATL-SOFT-SLIDE-2013-296oai:cds.cern.ch:15517372013-05-29
spellingShingle Detectors and Experimental Techniques
De, K
Klimentov, A
Maeno, T
Nilsson, P
Panitkin, S
Vaniachine, A
Wenaus, T
Yu, D
Extending the ATLAS PanDA Workload Management System for New Big Data Applications
title Extending the ATLAS PanDA Workload Management System for New Big Data Applications
title_full Extending the ATLAS PanDA Workload Management System for New Big Data Applications
title_fullStr Extending the ATLAS PanDA Workload Management System for New Big Data Applications
title_full_unstemmed Extending the ATLAS PanDA Workload Management System for New Big Data Applications
title_short Extending the ATLAS PanDA Workload Management System for New Big Data Applications
title_sort extending the atlas panda workload management system for new big data applications
topic Detectors and Experimental Techniques
url http://cds.cern.ch/record/1551737
work_keys_str_mv AT dek extendingtheatlaspandaworkloadmanagementsystemfornewbigdataapplications
AT klimentova extendingtheatlaspandaworkloadmanagementsystemfornewbigdataapplications
AT maenot extendingtheatlaspandaworkloadmanagementsystemfornewbigdataapplications
AT nilssonp extendingtheatlaspandaworkloadmanagementsystemfornewbigdataapplications
AT panitkins extendingtheatlaspandaworkloadmanagementsystemfornewbigdataapplications
AT vaniachinea extendingtheatlaspandaworkloadmanagementsystemfornewbigdataapplications
AT wenaust extendingtheatlaspandaworkloadmanagementsystemfornewbigdataapplications
AT yud extendingtheatlaspandaworkloadmanagementsystemfornewbigdataapplications