Cargando…

A scalable online monitoring system based on Elasticsearch for distributed data acquisition in CMS

The part of the CMS Data Acquisition (DAQ) system responsible for data readout and event building is a complex network of interdependent distributed applications. To ensure successful data taking, these programs have to be constantly monitored in order to facilitate the timeliness of necessary corre...

Descripción completa

Detalles Bibliográficos
Autores principales: Andre, Jean-Marc, Behrens, Ulf, Branson, James, Brummer, Philipp, Chaze, Olivier, Cittolin, Sergio, da Silva Gomes, Diego, Darlea, Georgiana-Lavinia, Deldicque, Christian, Demiragli, Zeynep, Dobson, Marc, Doualot, Nicolas, Erhan, Samim, Fulcher, Jonathan Richard, Gigi, Dominique, Gladki, Maciej, Glege, Frank, Gomez-Ceballos, Guillelmo, Hegeman, Jeroen, Holzner, Andre, Janulis, Mindaugas, Lettrich, Michael, Mecionis, Audrius, Meijers, Frans, Meschi, Emilio, Mommsen, Remigius K, Morovic, Srecko, O'Dell, Vivian, Orsini, Luciano, Papakrivopoulos, Ioannis, Paus, Christoph, Petrova, Petia, Petrucci, Andrea, Pieri, Marco, Rabady, Dinyar, Racz, Attila, Rapsevicius, Valdas, Reis, Thomas, Sakulin, Hannes, Schwick, Christoph, Simelevicius, Dainius, Stankevicius, Mantas, Vazquez Velez, Cristina, Vougioukas, Michail, Wernet, Christian, Zejdl, Petr
Lenguaje:eng
Publicado: 2019
Materias:
Acceso en línea:https://dx.doi.org/10.1051/epjconf/201921401048
http://cds.cern.ch/record/2701775
_version_ 1780964594301272064
author Andre, Jean-Marc
Behrens, Ulf
Branson, James
Brummer, Philipp
Chaze, Olivier
Cittolin, Sergio
da Silva Gomes, Diego
Darlea, Georgiana-Lavinia
Deldicque, Christian
Demiragli, Zeynep
Dobson, Marc
Doualot, Nicolas
Erhan, Samim
Fulcher, Jonathan Richard
Gigi, Dominique
Gladki, Maciej
Glege, Frank
Gomez-Ceballos, Guillelmo
Hegeman, Jeroen
Holzner, Andre
Janulis, Mindaugas
Lettrich, Michael
Mecionis, Audrius
Meijers, Frans
Meschi, Emilio
Mommsen, Remigius K
Morovic, Srecko
O'Dell, Vivian
Orsini, Luciano
Papakrivopoulos, Ioannis
Paus, Christoph
Petrova, Petia
Petrucci, Andrea
Pieri, Marco
Rabady, Dinyar
Racz, Attila
Rapsevicius, Valdas
Reis, Thomas
Sakulin, Hannes
Schwick, Christoph
Simelevicius, Dainius
Stankevicius, Mantas
Vazquez Velez, Cristina
Vougioukas, Michail
Wernet, Christian
Zejdl, Petr
author_facet Andre, Jean-Marc
Behrens, Ulf
Branson, James
Brummer, Philipp
Chaze, Olivier
Cittolin, Sergio
da Silva Gomes, Diego
Darlea, Georgiana-Lavinia
Deldicque, Christian
Demiragli, Zeynep
Dobson, Marc
Doualot, Nicolas
Erhan, Samim
Fulcher, Jonathan Richard
Gigi, Dominique
Gladki, Maciej
Glege, Frank
Gomez-Ceballos, Guillelmo
Hegeman, Jeroen
Holzner, Andre
Janulis, Mindaugas
Lettrich, Michael
Mecionis, Audrius
Meijers, Frans
Meschi, Emilio
Mommsen, Remigius K
Morovic, Srecko
O'Dell, Vivian
Orsini, Luciano
Papakrivopoulos, Ioannis
Paus, Christoph
Petrova, Petia
Petrucci, Andrea
Pieri, Marco
Rabady, Dinyar
Racz, Attila
Rapsevicius, Valdas
Reis, Thomas
Sakulin, Hannes
Schwick, Christoph
Simelevicius, Dainius
Stankevicius, Mantas
Vazquez Velez, Cristina
Vougioukas, Michail
Wernet, Christian
Zejdl, Petr
author_sort Andre, Jean-Marc
collection CERN
description The part of the CMS Data Acquisition (DAQ) system responsible for data readout and event building is a complex network of interdependent distributed applications. To ensure successful data taking, these programs have to be constantly monitored in order to facilitate the timeliness of necessary corrections in case of any deviation from specified behaviour. A large number of diverse monitoring data samples are periodically collected from multiple sources across the network. Monitoring data are kept in memory for online operations and optionally stored on disk for post-mortem analysis. We present a generic, reusable solution based on an open source NoSQL database, Elasticsearch, which is fully compatible and non-intrusive with respect to the existing system. The motivation is to benefit from an off-the-shelf software to facilitate the development, maintenance and support efforts. Elasticsearch provides failover and data redundancy capabilities as well as a programming language independent JSON-over-HTTP interface. The possibility of horizontal scaling matches the requirements of a DAQ monitoring system. The data load from all sources is balanced by redistribution over an Elasticsearch cluster that can be hosted on a computer cloud. In order to achieve the necessary robustness and to validate the scalability of the approach the above monitoring solution currently runs in parallel with an existing in-house developed DAQ monitoring system.
id oai-inspirehep.net-1760542
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2019
record_format invenio
spelling oai-inspirehep.net-17605422022-08-10T12:21:55Zdoi:10.1051/epjconf/201921401048http://cds.cern.ch/record/2701775engAndre, Jean-MarcBehrens, UlfBranson, JamesBrummer, PhilippChaze, OlivierCittolin, Sergioda Silva Gomes, DiegoDarlea, Georgiana-LaviniaDeldicque, ChristianDemiragli, ZeynepDobson, MarcDoualot, NicolasErhan, SamimFulcher, Jonathan RichardGigi, DominiqueGladki, MaciejGlege, FrankGomez-Ceballos, GuillelmoHegeman, JeroenHolzner, AndreJanulis, MindaugasLettrich, MichaelMecionis, AudriusMeijers, FransMeschi, EmilioMommsen, Remigius KMorovic, SreckoO'Dell, VivianOrsini, LucianoPapakrivopoulos, IoannisPaus, ChristophPetrova, PetiaPetrucci, AndreaPieri, MarcoRabady, DinyarRacz, AttilaRapsevicius, ValdasReis, ThomasSakulin, HannesSchwick, ChristophSimelevicius, DainiusStankevicius, MantasVazquez Velez, CristinaVougioukas, MichailWernet, ChristianZejdl, PetrA scalable online monitoring system based on Elasticsearch for distributed data acquisition in CMSDetectors and Experimental TechniquesComputing and ComputersThe part of the CMS Data Acquisition (DAQ) system responsible for data readout and event building is a complex network of interdependent distributed applications. To ensure successful data taking, these programs have to be constantly monitored in order to facilitate the timeliness of necessary corrections in case of any deviation from specified behaviour. A large number of diverse monitoring data samples are periodically collected from multiple sources across the network. Monitoring data are kept in memory for online operations and optionally stored on disk for post-mortem analysis. We present a generic, reusable solution based on an open source NoSQL database, Elasticsearch, which is fully compatible and non-intrusive with respect to the existing system. The motivation is to benefit from an off-the-shelf software to facilitate the development, maintenance and support efforts. Elasticsearch provides failover and data redundancy capabilities as well as a programming language independent JSON-over-HTTP interface. The possibility of horizontal scaling matches the requirements of a DAQ monitoring system. The data load from all sources is balanced by redistribution over an Elasticsearch cluster that can be hosted on a computer cloud. In order to achieve the necessary robustness and to validate the scalability of the approach the above monitoring solution currently runs in parallel with an existing in-house developed DAQ monitoring system.oai:inspirehep.net:17605422019
spellingShingle Detectors and Experimental Techniques
Computing and Computers
Andre, Jean-Marc
Behrens, Ulf
Branson, James
Brummer, Philipp
Chaze, Olivier
Cittolin, Sergio
da Silva Gomes, Diego
Darlea, Georgiana-Lavinia
Deldicque, Christian
Demiragli, Zeynep
Dobson, Marc
Doualot, Nicolas
Erhan, Samim
Fulcher, Jonathan Richard
Gigi, Dominique
Gladki, Maciej
Glege, Frank
Gomez-Ceballos, Guillelmo
Hegeman, Jeroen
Holzner, Andre
Janulis, Mindaugas
Lettrich, Michael
Mecionis, Audrius
Meijers, Frans
Meschi, Emilio
Mommsen, Remigius K
Morovic, Srecko
O'Dell, Vivian
Orsini, Luciano
Papakrivopoulos, Ioannis
Paus, Christoph
Petrova, Petia
Petrucci, Andrea
Pieri, Marco
Rabady, Dinyar
Racz, Attila
Rapsevicius, Valdas
Reis, Thomas
Sakulin, Hannes
Schwick, Christoph
Simelevicius, Dainius
Stankevicius, Mantas
Vazquez Velez, Cristina
Vougioukas, Michail
Wernet, Christian
Zejdl, Petr
A scalable online monitoring system based on Elasticsearch for distributed data acquisition in CMS
title A scalable online monitoring system based on Elasticsearch for distributed data acquisition in CMS
title_full A scalable online monitoring system based on Elasticsearch for distributed data acquisition in CMS
title_fullStr A scalable online monitoring system based on Elasticsearch for distributed data acquisition in CMS
title_full_unstemmed A scalable online monitoring system based on Elasticsearch for distributed data acquisition in CMS
title_short A scalable online monitoring system based on Elasticsearch for distributed data acquisition in CMS
title_sort scalable online monitoring system based on elasticsearch for distributed data acquisition in cms
topic Detectors and Experimental Techniques
Computing and Computers
url https://dx.doi.org/10.1051/epjconf/201921401048
http://cds.cern.ch/record/2701775
work_keys_str_mv AT andrejeanmarc ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT behrensulf ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT bransonjames ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT brummerphilipp ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT chazeolivier ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT cittolinsergio ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT dasilvagomesdiego ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT darleageorgianalavinia ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT deldicquechristian ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT demiraglizeynep ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT dobsonmarc ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT doualotnicolas ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT erhansamim ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT fulcherjonathanrichard ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT gigidominique ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT gladkimaciej ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT glegefrank ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT gomezceballosguillelmo ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT hegemanjeroen ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT holznerandre ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT janulismindaugas ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT lettrichmichael ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT mecionisaudrius ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT meijersfrans ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT meschiemilio ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT mommsenremigiusk ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT morovicsrecko ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT odellvivian ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT orsiniluciano ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT papakrivopoulosioannis ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT pauschristoph ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT petrovapetia ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT petrucciandrea ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT pierimarco ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT rabadydinyar ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT raczattila ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT rapseviciusvaldas ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT reisthomas ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT sakulinhannes ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT schwickchristoph ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT simeleviciusdainius ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT stankeviciusmantas ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT vazquezvelezcristina ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT vougioukasmichail ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT wernetchristian ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT zejdlpetr ascalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT andrejeanmarc scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT behrensulf scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT bransonjames scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT brummerphilipp scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT chazeolivier scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT cittolinsergio scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT dasilvagomesdiego scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT darleageorgianalavinia scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT deldicquechristian scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT demiraglizeynep scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT dobsonmarc scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT doualotnicolas scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT erhansamim scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT fulcherjonathanrichard scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT gigidominique scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT gladkimaciej scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT glegefrank scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT gomezceballosguillelmo scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT hegemanjeroen scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT holznerandre scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT janulismindaugas scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT lettrichmichael scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT mecionisaudrius scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT meijersfrans scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT meschiemilio scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT mommsenremigiusk scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT morovicsrecko scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT odellvivian scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT orsiniluciano scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT papakrivopoulosioannis scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT pauschristoph scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT petrovapetia scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT petrucciandrea scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT pierimarco scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT rabadydinyar scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT raczattila scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT rapseviciusvaldas scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT reisthomas scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT sakulinhannes scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT schwickchristoph scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT simeleviciusdainius scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT stankeviciusmantas scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT vazquezvelezcristina scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT vougioukasmichail scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT wernetchristian scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms
AT zejdlpetr scalableonlinemonitoringsystembasedonelasticsearchfordistributeddataacquisitionincms