Cargando…
A scalable online monitoring system based on Elasticsearch for distributed data acquisition in CMS
The part of the CMS Data Acquisition (DAQ) system responsible for data readout and event building is a complex network of interdependent distributed applications. To ensure successful data taking, these programs have to be constantly monitored in order to facilitate the timeliness of necessary corre...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2019
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1051/epjconf/201921401048 http://cds.cern.ch/record/2701775 |
_version_ | 1780964594301272064 |
---|---|
author | Andre, Jean-Marc Behrens, Ulf Branson, James Brummer, Philipp Chaze, Olivier Cittolin, Sergio da Silva Gomes, Diego Darlea, Georgiana-Lavinia Deldicque, Christian Demiragli, Zeynep Dobson, Marc Doualot, Nicolas Erhan, Samim Fulcher, Jonathan Richard Gigi, Dominique Gladki, Maciej Glege, Frank Gomez-Ceballos, Guillelmo Hegeman, Jeroen Holzner, Andre Janulis, Mindaugas Lettrich, Michael Mecionis, Audrius Meijers, Frans Meschi, Emilio Mommsen, Remigius K Morovic, Srecko O'Dell, Vivian Orsini, Luciano Papakrivopoulos, Ioannis Paus, Christoph Petrova, Petia Petrucci, Andrea Pieri, Marco Rabady, Dinyar Racz, Attila Rapsevicius, Valdas Reis, Thomas Sakulin, Hannes Schwick, Christoph Simelevicius, Dainius Stankevicius, Mantas Vazquez Velez, Cristina Vougioukas, Michail Wernet, Christian Zejdl, Petr |
author_facet | Andre, Jean-Marc Behrens, Ulf Branson, James Brummer, Philipp Chaze, Olivier Cittolin, Sergio da Silva Gomes, Diego Darlea, Georgiana-Lavinia Deldicque, Christian Demiragli, Zeynep Dobson, Marc Doualot, Nicolas Erhan, Samim Fulcher, Jonathan Richard Gigi, Dominique Gladki, Maciej Glege, Frank Gomez-Ceballos, Guillelmo Hegeman, Jeroen Holzner, Andre Janulis, Mindaugas Lettrich, Michael Mecionis, Audrius Meijers, Frans Meschi, Emilio Mommsen, Remigius K Morovic, Srecko O'Dell, Vivian Orsini, Luciano Papakrivopoulos, Ioannis Paus, Christoph Petrova, Petia Petrucci, Andrea Pieri, Marco Rabady, Dinyar Racz, Attila Rapsevicius, Valdas Reis, Thomas Sakulin, Hannes Schwick, Christoph Simelevicius, Dainius Stankevicius, Mantas Vazquez Velez, Cristina Vougioukas, Michail Wernet, Christian Zejdl, Petr |
author_sort | Andre, Jean-Marc |
collection | CERN |
description | The part of the CMS Data Acquisition (DAQ) system responsible for data readout and event building is a complex network of interdependent distributed applications. To ensure successful data taking, these programs have to be constantly monitored in order to facilitate the timeliness of necessary corrections in case of any deviation from specified behaviour. A large number of diverse monitoring data samples are periodically collected from multiple sources across the network. Monitoring data are kept in memory for online operations and optionally stored on disk for post-mortem analysis. We present a generic, reusable solution based on an open source NoSQL database, Elasticsearch, which is fully compatible and non-intrusive with respect to the existing system. The motivation is to benefit from an off-the-shelf software to facilitate the development, maintenance and support efforts. Elasticsearch provides failover and data redundancy capabilities as well as a programming language independent JSON-over-HTTP interface. The possibility of horizontal scaling matches the requirements of a DAQ monitoring system. The data load from all sources is balanced by redistribution over an Elasticsearch cluster that can be hosted on a computer cloud. In order to achieve the necessary robustness and to validate the scalability of the approach the above monitoring solution currently runs in parallel with an existing in-house developed DAQ monitoring system. |
id | oai-inspirehep.net-1760542 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2019 |
record_format | invenio |
spelling | oai-inspirehep.net-17605422022-08-10T12:21:55Zdoi:10.1051/epjconf/201921401048http://cds.cern.ch/record/2701775engAndre, Jean-MarcBehrens, UlfBranson, JamesBrummer, PhilippChaze, OlivierCittolin, Sergioda Silva Gomes, DiegoDarlea, Georgiana-LaviniaDeldicque, ChristianDemiragli, ZeynepDobson, MarcDoualot, NicolasErhan, SamimFulcher, Jonathan RichardGigi, DominiqueGladki, MaciejGlege, FrankGomez-Ceballos, GuillelmoHegeman, JeroenHolzner, AndreJanulis, MindaugasLettrich, MichaelMecionis, AudriusMeijers, FransMeschi, EmilioMommsen, Remigius KMorovic, SreckoO'Dell, VivianOrsini, LucianoPapakrivopoulos, IoannisPaus, ChristophPetrova, PetiaPetrucci, AndreaPieri, MarcoRabady, DinyarRacz, AttilaRapsevicius, ValdasReis, ThomasSakulin, HannesSchwick, ChristophSimelevicius, DainiusStankevicius, MantasVazquez Velez, CristinaVougioukas, MichailWernet, ChristianZejdl, PetrA scalable online monitoring system based on Elasticsearch for distributed data acquisition in CMSDetectors and Experimental TechniquesComputing and ComputersThe part of the CMS Data Acquisition (DAQ) system responsible for data readout and event building is a complex network of interdependent distributed applications. To ensure successful data taking, these programs have to be constantly monitored in order to facilitate the timeliness of necessary corrections in case of any deviation from specified behaviour. A large number of diverse monitoring data samples are periodically collected from multiple sources across the network. Monitoring data are kept in memory for online operations and optionally stored on disk for post-mortem analysis. We present a generic, reusable solution based on an open source NoSQL database, Elasticsearch, which is fully compatible and non-intrusive with respect to the existing system. The motivation is to benefit from an off-the-shelf software to facilitate the development, maintenance and support efforts. Elasticsearch provides failover and data redundancy capabilities as well as a programming language independent JSON-over-HTTP interface. The possibility of horizontal scaling matches the requirements of a DAQ monitoring system. The data load from all sources is balanced by redistribution over an Elasticsearch cluster that can be hosted on a computer cloud. In order to achieve the necessary robustness and to validate the scalability of the approach the above monitoring solution currently runs in parallel with an existing in-house developed DAQ monitoring system.oai:inspirehep.net:17605422019 |
spellingShingle | Detectors and Experimental Techniques Computing and Computers Andre, Jean-Marc Behrens, Ulf Branson, James Brummer, Philipp Chaze, Olivier Cittolin, Sergio da Silva Gomes, Diego Darlea, Georgiana-Lavinia Deldicque, Christian Demiragli, Zeynep Dobson, Marc Doualot, Nicolas Erhan, Samim Fulcher, Jonathan Richard Gigi, Dominique Gladki, Maciej Glege, Frank Gomez-Ceballos, Guillelmo Hegeman, Jeroen Holzner, Andre Janulis, Mindaugas Lettrich, Michael Mecionis, Audrius Meijers, Frans Meschi, Emilio Mommsen, Remigius K Morovic, Srecko O'Dell, Vivian Orsini, Luciano Papakrivopoulos, Ioannis Paus, Christoph Petrova, Petia Petrucci, Andrea Pieri, Marco Rabady, Dinyar Racz, Attila Rapsevicius, Valdas Reis, Thomas Sakulin, Hannes Schwick, Christoph Simelevicius, Dainius Stankevicius, Mantas Vazquez Velez, Cristina Vougioukas, Michail Wernet, Christian Zejdl, Petr A scalable online monitoring system based on Elasticsearch for distributed data acquisition in CMS |
title | A scalable online monitoring system based on Elasticsearch for distributed data acquisition in CMS |
title_full | A scalable online monitoring system based on Elasticsearch for distributed data acquisition in CMS |
title_fullStr | A scalable online monitoring system based on Elasticsearch for distributed data acquisition in CMS |
title_full_unstemmed | A scalable online monitoring system based on Elasticsearch for distributed data acquisition in CMS |
title_short | A scalable online monitoring system based on Elasticsearch for distributed data acquisition in CMS |
title_sort | scalable online monitoring system based on elasticsearch for distributed data acquisition in cms |
topic | Detectors and Experimental Techniques Computing and Computers |
url | https://dx.doi.org/10.1051/epjconf/201921401048 http://cds.cern.ch/record/2701775 |
work_keys_str_mv | AT andrejeanmarc ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT behrensulf ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT bransonjames ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT brummerphilipp ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT chazeolivier ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT cittolinsergio ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT dasilvagomesdiego ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT darleageorgianalavinia ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT deldicquechristian ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT demiraglizeynep ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT dobsonmarc ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT doualotnicolas ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT erhansamim ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT fulcherjonathanrichard ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT gigidominique ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT gladkimaciej ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT glegefrank ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT gomezceballosguillelmo ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT hegemanjeroen ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT holznerandre ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT janulismindaugas ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT lettrichmichael ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT mecionisaudrius ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT meijersfrans ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT meschiemilio ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT mommsenremigiusk ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT morovicsrecko ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT odellvivian ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT orsiniluciano ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT papakrivopoulosioannis ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT pauschristoph ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT petrovapetia ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT petrucciandrea ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT pierimarco ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT rabadydinyar ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT raczattila ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT rapseviciusvaldas ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT reisthomas ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT sakulinhannes ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT schwickchristoph ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT simeleviciusdainius ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT stankeviciusmantas ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT vazquezvelezcristina ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT vougioukasmichail ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT wernetchristian ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT zejdlpetr ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT andrejeanmarc scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT behrensulf scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT bransonjames scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT brummerphilipp scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT chazeolivier scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT cittolinsergio scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT dasilvagomesdiego scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT darleageorgianalavinia scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT deldicquechristian scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT demiraglizeynep scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT dobsonmarc scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT doualotnicolas scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT erhansamim scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT fulcherjonathanrichard scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT gigidominique scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT gladkimaciej scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT glegefrank scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT gomezceballosguillelmo scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT hegemanjeroen scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT holznerandre scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT janulismindaugas scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT lettrichmichael scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT mecionisaudrius scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT meijersfrans scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT meschiemilio scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT mommsenremigiusk scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT morovicsrecko scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT odellvivian scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT orsiniluciano scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT papakrivopoulosioannis scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT pauschristoph scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT petrovapetia scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT petrucciandrea scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT pierimarco scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT rabadydinyar scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT raczattila scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT rapseviciusvaldas scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT reisthomas scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT sakulinhannes scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT schwickchristoph scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT simeleviciusdainius scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT stankeviciusmantas scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT vazquezvelezcristina scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT vougioukasmichail scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT wernetchristian scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms AT zejdlpetr scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms |