Cargando…
Extending the ATLAS PanDA Workload Management System for New Big Data Applications
The LHC experiments are today at the leading edge of large scale distributed data-intensive computational science. The LHC's ATLAS experiment processes data volumes which are particularly extreme, over 130 PB to date, distributed worldwide at over of 120 sites. An important element in the succe...
Autores principales: | , , , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2013
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/1551737 |
_version_ | 1780930253994065920 |
---|---|
author | De, K Klimentov, A Maeno, T Nilsson, P Panitkin, S Vaniachine, A Wenaus, T Yu, D |
author_facet | De, K Klimentov, A Maeno, T Nilsson, P Panitkin, S Vaniachine, A Wenaus, T Yu, D |
author_sort | De, K |
collection | CERN |
description | The LHC experiments are today at the leading edge of large scale distributed data-intensive computational science. The LHC's ATLAS experiment processes data volumes which are particularly extreme, over 130 PB to date, distributed worldwide at over of 120 sites. An important element in the success of the exciting physics results from ATLAS is the highly scalable integrated workflow and dataflow management afforded by the PanDA workload management system, used for all the distributed computing needs of the experiment. The PanDA design is not experiment specific and PanDA is now being extended to support other data intensive scientific applications. Alpha-Magnetic Spectrometer, an astro-particle experiment on the International Space Station, and the Compact Muon Solenoid, an LHC experiment, have successfully evaluated PanDA and are pursuing its adoption. PanDA was cited as an example of "a high performance, fault tolerant software for fast, scalable access to data repositories of many kinds" during the "Big Data Research and Development Initiative" announcement, a $200 million U.S. government investment in tools to handle huge volumes of digital data needed to spur science and engineering discoveries. In this talk, a description of the new program of work to develop a generic version of PanDA will be given, as well as the progress in extending PanDA's capabilities to support supercomputers, clouds, leverage intelligent networking, while accommodating the ever growing needs of current users. PanDA has already demonstrated at a very large scale the value of automated data-aware dynamic brokering of diverse workloads across distributed computing resources. The next generation of PanDA will allow many data-intensive sciences employing a variety of computing platforms to benefit from ATLAS' experience and proven tools in highly scalable processing. |
id | cern-1551737 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2013 |
record_format | invenio |
spelling | cern-15517372019-09-30T06:29:59Zhttp://cds.cern.ch/record/1551737engDe, KKlimentov, AMaeno, TNilsson, PPanitkin, SVaniachine, AWenaus, TYu, DExtending the ATLAS PanDA Workload Management System for New Big Data ApplicationsDetectors and Experimental TechniquesThe LHC experiments are today at the leading edge of large scale distributed data-intensive computational science. The LHC's ATLAS experiment processes data volumes which are particularly extreme, over 130 PB to date, distributed worldwide at over of 120 sites. An important element in the success of the exciting physics results from ATLAS is the highly scalable integrated workflow and dataflow management afforded by the PanDA workload management system, used for all the distributed computing needs of the experiment. The PanDA design is not experiment specific and PanDA is now being extended to support other data intensive scientific applications. Alpha-Magnetic Spectrometer, an astro-particle experiment on the International Space Station, and the Compact Muon Solenoid, an LHC experiment, have successfully evaluated PanDA and are pursuing its adoption. PanDA was cited as an example of "a high performance, fault tolerant software for fast, scalable access to data repositories of many kinds" during the "Big Data Research and Development Initiative" announcement, a $200 million U.S. government investment in tools to handle huge volumes of digital data needed to spur science and engineering discoveries. In this talk, a description of the new program of work to develop a generic version of PanDA will be given, as well as the progress in extending PanDA's capabilities to support supercomputers, clouds, leverage intelligent networking, while accommodating the ever growing needs of current users. PanDA has already demonstrated at a very large scale the value of automated data-aware dynamic brokering of diverse workloads across distributed computing resources. The next generation of PanDA will allow many data-intensive sciences employing a variety of computing platforms to benefit from ATLAS' experience and proven tools in highly scalable processing.ATL-SOFT-SLIDE-2013-296oai:cds.cern.ch:15517372013-05-29 |
spellingShingle | Detectors and Experimental Techniques De, K Klimentov, A Maeno, T Nilsson, P Panitkin, S Vaniachine, A Wenaus, T Yu, D Extending the ATLAS PanDA Workload Management System for New Big Data Applications |
title | Extending the ATLAS PanDA Workload Management System for New Big Data Applications |
title_full | Extending the ATLAS PanDA Workload Management System for New Big Data Applications |
title_fullStr | Extending the ATLAS PanDA Workload Management System for New Big Data Applications |
title_full_unstemmed | Extending the ATLAS PanDA Workload Management System for New Big Data Applications |
title_short | Extending the ATLAS PanDA Workload Management System for New Big Data Applications |
title_sort | extending the atlas panda workload management system for new big data applications |
topic | Detectors and Experimental Techniques |
url | http://cds.cern.ch/record/1551737 |
work_keys_str_mv | AT dek extendingtheatlaspandaworkloadmanagementsystemfornewbigdataapplications AT klimentova extendingtheatlaspandaworkloadmanagementsystemfornewbigdataapplications AT maenot extendingtheatlaspandaworkloadmanagementsystemfornewbigdataapplications AT nilssonp extendingtheatlaspandaworkloadmanagementsystemfornewbigdataapplications AT panitkins extendingtheatlaspandaworkloadmanagementsystemfornewbigdataapplications AT vaniachinea extendingtheatlaspandaworkloadmanagementsystemfornewbigdataapplications AT wenaust extendingtheatlaspandaworkloadmanagementsystemfornewbigdataapplications AT yud extendingtheatlaspandaworkloadmanagementsystemfornewbigdataapplications |