Cargando…
Towards an Event Streaming Service for ATLAS data processing
The ATLAS experiment at the LHC is gradually transitioning from the traditional file-based processing model to dynamic workflow management at the event level with the ATLAS Event Service (AES). The AES assigns fine-grained processing jobs to workers and streams out the data in quasi-real time, ensur...
Autores principales: | , , , , , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2018
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1051/epjconf/201921404034 http://cds.cern.ch/record/2649518 |
_version_ | 1780960743966900224 |
---|---|
author | Magini, Nicolo Di Girolamo, Alessandro Guan, Wen Wenaus, Torre Tsulaia, Vakhtang Lassnig, Mario Walker, Rodney Maeno, Tadashi Nilsson, Paul Brino, Alex |
author_facet | Magini, Nicolo Di Girolamo, Alessandro Guan, Wen Wenaus, Torre Tsulaia, Vakhtang Lassnig, Mario Walker, Rodney Maeno, Tadashi Nilsson, Paul Brino, Alex |
author_sort | Magini, Nicolo |
collection | CERN |
description | The ATLAS experiment at the LHC is gradually transitioning from the traditional file-based processing model to dynamic workflow management at the event level with the ATLAS Event Service (AES). The AES assigns fine-grained processing jobs to workers and streams out the data in quasi-real time, ensuring fully efficient utilization of all resources, including the most volatile. The next major step in this evolution is the possibility to intelligently stream the input data itself to workers. The Event Streaming Service (ESS) is now in development to asynchronously deliver only the input data required for processing when it is needed, protecting the application payload from WAN latency without creating expensive long-term replicas. In the current prototype implementation, ESS processes run on compute nodes in parallel to the payload, reading the input event ranges remotely over the network, and replicating them in small input files that are passed to the application. In this contribution, we present the performance of the ESS prototype for different types of workflows in comparison to tasks accessing remote data directly. Based on the experience gained with the current prototype, we are now moving to the development of a server-side component of the ESS. The service can evolve progressively into a powerful Content Delivery Network-like capability for data streaming, ultimately enabling the delivery of ’virtual data’ generated on demand. |
id | cern-2649518 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2018 |
record_format | invenio |
spelling | cern-26495182022-08-10T12:22:19Zdoi:10.1051/epjconf/201921404034http://cds.cern.ch/record/2649518engMagini, NicoloDi Girolamo, AlessandroGuan, WenWenaus, TorreTsulaia, VakhtangLassnig, MarioWalker, RodneyMaeno, TadashiNilsson, PaulBrino, AlexTowards an Event Streaming Service for ATLAS data processingParticle Physics - ExperimentThe ATLAS experiment at the LHC is gradually transitioning from the traditional file-based processing model to dynamic workflow management at the event level with the ATLAS Event Service (AES). The AES assigns fine-grained processing jobs to workers and streams out the data in quasi-real time, ensuring fully efficient utilization of all resources, including the most volatile. The next major step in this evolution is the possibility to intelligently stream the input data itself to workers. The Event Streaming Service (ESS) is now in development to asynchronously deliver only the input data required for processing when it is needed, protecting the application payload from WAN latency without creating expensive long-term replicas. In the current prototype implementation, ESS processes run on compute nodes in parallel to the payload, reading the input event ranges remotely over the network, and replicating them in small input files that are passed to the application. In this contribution, we present the performance of the ESS prototype for different types of workflows in comparison to tasks accessing remote data directly. Based on the experience gained with the current prototype, we are now moving to the development of a server-side component of the ESS. The service can evolve progressively into a powerful Content Delivery Network-like capability for data streaming, ultimately enabling the delivery of ’virtual data’ generated on demand.ATL-SOFT-PROC-2018-050oai:cds.cern.ch:26495182018-12-01 |
spellingShingle | Particle Physics - Experiment Magini, Nicolo Di Girolamo, Alessandro Guan, Wen Wenaus, Torre Tsulaia, Vakhtang Lassnig, Mario Walker, Rodney Maeno, Tadashi Nilsson, Paul Brino, Alex Towards an Event Streaming Service for ATLAS data processing |
title | Towards an Event Streaming Service for ATLAS data processing |
title_full | Towards an Event Streaming Service for ATLAS data processing |
title_fullStr | Towards an Event Streaming Service for ATLAS data processing |
title_full_unstemmed | Towards an Event Streaming Service for ATLAS data processing |
title_short | Towards an Event Streaming Service for ATLAS data processing |
title_sort | towards an event streaming service for atlas data processing |
topic | Particle Physics - Experiment |
url | https://dx.doi.org/10.1051/epjconf/201921404034 http://cds.cern.ch/record/2649518 |
work_keys_str_mv | AT magininicolo towardsaneventstreamingserviceforatlasdataprocessing AT digirolamoalessandro towardsaneventstreamingserviceforatlasdataprocessing AT guanwen towardsaneventstreamingserviceforatlasdataprocessing AT wenaustorre towardsaneventstreamingserviceforatlasdataprocessing AT tsulaiavakhtang towardsaneventstreamingserviceforatlasdataprocessing AT lassnigmario towardsaneventstreamingserviceforatlasdataprocessing AT walkerrodney towardsaneventstreamingserviceforatlasdataprocessing AT maenotadashi towardsaneventstreamingserviceforatlasdataprocessing AT nilssonpaul towardsaneventstreamingserviceforatlasdataprocessing AT brinoalex towardsaneventstreamingserviceforatlasdataprocessing |