Cargando…
Development of noSQL data storage for the ATLAS PanDA Monitoring System
For several years the PanDA Workload Management System has been the basis for distributed production and analysis for the ATLAS experiment at the LHC. Since the start of data taking PanDA usage has ramped up steadily, typically exceeding 500k completed jobs/day by June 2011. The associated monitorin...
Autores principales: | , , |
---|---|
Lenguaje: | eng |
Publicado: |
2012
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/396/5/052041 http://cds.cern.ch/record/1446655 |
_version_ | 1780924806896549888 |
---|---|
author | Ito, H Potekhin, M Wenaus, T |
author_facet | Ito, H Potekhin, M Wenaus, T |
author_sort | Ito, H |
collection | CERN |
description | For several years the PanDA Workload Management System has been the basis for distributed production and analysis for the ATLAS experiment at the LHC. Since the start of data taking PanDA usage has ramped up steadily, typically exceeding 500k completed jobs/day by June 2011. The associated monitoring data volume has been rising as well, to levels that present a new set of challenges in the areas of database scalability and monitoring system performance and efficiency. These challenges are being met with an R&D effort aimed at implementing a scalable and efficient monitoring data storage based on a noSQL solution (Cassandra). We present our motivations for using this technology, as well as data design and the techniques used for efficient indexing of the data. We also discuss the hardware requirements as they were determined by testing with actual data and realistic rate of queries. In conclusion, we present our experience with operating a Cassandra cluster over an extended period of time and with data load adequate for planned application. |
id | cern-1446655 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2012 |
record_format | invenio |
spelling | cern-14466552019-09-30T06:29:59Zdoi:10.1088/1742-6596/396/5/052041http://cds.cern.ch/record/1446655engIto, HPotekhin, MWenaus, TDevelopment of noSQL data storage for the ATLAS PanDA Monitoring SystemComputing and ComputersFor several years the PanDA Workload Management System has been the basis for distributed production and analysis for the ATLAS experiment at the LHC. Since the start of data taking PanDA usage has ramped up steadily, typically exceeding 500k completed jobs/day by June 2011. The associated monitoring data volume has been rising as well, to levels that present a new set of challenges in the areas of database scalability and monitoring system performance and efficiency. These challenges are being met with an R&D effort aimed at implementing a scalable and efficient monitoring data storage based on a noSQL solution (Cassandra). We present our motivations for using this technology, as well as data design and the techniques used for efficient indexing of the data. We also discuss the hardware requirements as they were determined by testing with actual data and realistic rate of queries. In conclusion, we present our experience with operating a Cassandra cluster over an extended period of time and with data load adequate for planned application.ATL-SOFT-PROC-2012-012oai:cds.cern.ch:14466552012-05-08 |
spellingShingle | Computing and Computers Ito, H Potekhin, M Wenaus, T Development of noSQL data storage for the ATLAS PanDA Monitoring System |
title | Development of noSQL data storage for the ATLAS PanDA Monitoring System |
title_full | Development of noSQL data storage for the ATLAS PanDA Monitoring System |
title_fullStr | Development of noSQL data storage for the ATLAS PanDA Monitoring System |
title_full_unstemmed | Development of noSQL data storage for the ATLAS PanDA Monitoring System |
title_short | Development of noSQL data storage for the ATLAS PanDA Monitoring System |
title_sort | development of nosql data storage for the atlas panda monitoring system |
topic | Computing and Computers |
url | https://dx.doi.org/10.1088/1742-6596/396/5/052041 http://cds.cern.ch/record/1446655 |
work_keys_str_mv | AT itoh developmentofnosqldatastoragefortheatlaspandamonitoringsystem AT potekhinm developmentofnosqldatastoragefortheatlaspandamonitoringsystem AT wenaust developmentofnosqldatastoragefortheatlaspandamonitoringsystem |