Cargando…

A Scalable and Reliable Message Transport Service for the ATLAS Trigger and Data Acquisition System

The ATLAS Trigger and Data Acquisition (TDAQ) is a large distributed computing system composed of several thousands of interconnected computers and tens of thousands applications. During a run, TDAQ applications produce a lot of control and information messages with variable rates, addressed to TDAQ...

Descripción completa

Detalles Bibliográficos
Autores principales: Kazarov, A, Caprini, M, Kolos, S, Lehmann Miotto, G, Soloviev, I
Lenguaje:eng
Publicado: 2014
Materias:
Acceso en línea:https://dx.doi.org/10.1109/RTC.2014.7097447
http://cds.cern.ch/record/1703434
_version_ 1780936360317681664
author Kazarov, A
Caprini, M
Kolos, S
Lehmann Miotto, G
Soloviev, I
author_facet Kazarov, A
Caprini, M
Kolos, S
Lehmann Miotto, G
Soloviev, I
author_sort Kazarov, A
collection CERN
description The ATLAS Trigger and Data Acquisition (TDAQ) is a large distributed computing system composed of several thousands of interconnected computers and tens of thousands applications. During a run, TDAQ applications produce a lot of control and information messages with variable rates, addressed to TDAQ operators or to other applications. Reliable, fast and accurate delivery of the messages is important for the functioning of the whole TDAQ system. The Message Transport Service (MTS) provides facilities for the reliable transport, the filtering and the routing of the messages, basing on publish-subscribe-notify communication pattern with content-based message filtering. During the ongoing LHC shutdown, the MTS was re-implemented, taking into account important requirements like reliability, scalability and performance, handling of slow subscribers case and also simplicity of the design and the implementation. MTS uses CORBA middleware, a common layer for TDAQ infrastructure, and provides sending/subscribing APIs in Java and C++ programming languages. The paper presents the design and the implementation details of the MTS, as well as the results of performance and scalability tests executed on a computing farm with an amount of workers and working conditions which reproduced a realistic TDAQ environment during ATLAS operations.
id cern-1703434
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2014
record_format invenio
spelling cern-17034342019-09-30T06:29:59Zdoi:10.1109/RTC.2014.7097447http://cds.cern.ch/record/1703434engKazarov, ACaprini, MKolos, SLehmann Miotto, GSoloviev, IA Scalable and Reliable Message Transport Service for the ATLAS Trigger and Data Acquisition SystemParticle Physics - ExperimentThe ATLAS Trigger and Data Acquisition (TDAQ) is a large distributed computing system composed of several thousands of interconnected computers and tens of thousands applications. During a run, TDAQ applications produce a lot of control and information messages with variable rates, addressed to TDAQ operators or to other applications. Reliable, fast and accurate delivery of the messages is important for the functioning of the whole TDAQ system. The Message Transport Service (MTS) provides facilities for the reliable transport, the filtering and the routing of the messages, basing on publish-subscribe-notify communication pattern with content-based message filtering. During the ongoing LHC shutdown, the MTS was re-implemented, taking into account important requirements like reliability, scalability and performance, handling of slow subscribers case and also simplicity of the design and the implementation. MTS uses CORBA middleware, a common layer for TDAQ infrastructure, and provides sending/subscribing APIs in Java and C++ programming languages. The paper presents the design and the implementation details of the MTS, as well as the results of performance and scalability tests executed on a computing farm with an amount of workers and working conditions which reproduced a realistic TDAQ environment during ATLAS operations.ATL-DAQ-PROC-2014-006oai:cds.cern.ch:17034342014-05-22
spellingShingle Particle Physics - Experiment
Kazarov, A
Caprini, M
Kolos, S
Lehmann Miotto, G
Soloviev, I
A Scalable and Reliable Message Transport Service for the ATLAS Trigger and Data Acquisition System
title A Scalable and Reliable Message Transport Service for the ATLAS Trigger and Data Acquisition System
title_full A Scalable and Reliable Message Transport Service for the ATLAS Trigger and Data Acquisition System
title_fullStr A Scalable and Reliable Message Transport Service for the ATLAS Trigger and Data Acquisition System
title_full_unstemmed A Scalable and Reliable Message Transport Service for the ATLAS Trigger and Data Acquisition System
title_short A Scalable and Reliable Message Transport Service for the ATLAS Trigger and Data Acquisition System
title_sort scalable and reliable message transport service for the atlas trigger and data acquisition system
topic Particle Physics - Experiment
url https://dx.doi.org/10.1109/RTC.2014.7097447
http://cds.cern.ch/record/1703434
work_keys_str_mv AT kazarova ascalableandreliablemessagetransportservicefortheatlastriggeranddataacquisitionsystem
AT caprinim ascalableandreliablemessagetransportservicefortheatlastriggeranddataacquisitionsystem
AT koloss ascalableandreliablemessagetransportservicefortheatlastriggeranddataacquisitionsystem
AT lehmannmiottog ascalableandreliablemessagetransportservicefortheatlastriggeranddataacquisitionsystem
AT solovievi ascalableandreliablemessagetransportservicefortheatlastriggeranddataacquisitionsystem
AT kazarova scalableandreliablemessagetransportservicefortheatlastriggeranddataacquisitionsystem
AT caprinim scalableandreliablemessagetransportservicefortheatlastriggeranddataacquisitionsystem
AT koloss scalableandreliablemessagetransportservicefortheatlastriggeranddataacquisitionsystem
AT lehmannmiottog scalableandreliablemessagetransportservicefortheatlastriggeranddataacquisitionsystem
AT solovievi scalableandreliablemessagetransportservicefortheatlastriggeranddataacquisitionsystem