Cargando…

Towards an Event Streaming Service for ATLAS data processing

The ATLAS experiment is gradually transitioning from the traditional file-based processing model to dynamic workflow management at the event level with the ATLAS Event Service (AES). The AES assigns fine-grained processing jobs to workers and streams out the data in quasi-real time, ensuring fully e...

Descripción completa

Detalles Bibliográficos
Autores principales: Magini, Nicolo, Tsulaia, Vakhtang, Di Girolamo, Alessandro, Wenaus, Torre, Guan, Wen, Lassnig, Mario, Walker, Rodney, Nilsson, Paul, Maeno, Tadashi
Lenguaje:eng
Publicado: 2018
Materias:
Acceso en línea:http://cds.cern.ch/record/2628504
_version_ 1780959172280451072
author Magini, Nicolo
Tsulaia, Vakhtang
Di Girolamo, Alessandro
Wenaus, Torre
Guan, Wen
Lassnig, Mario
Walker, Rodney
Nilsson, Paul
Maeno, Tadashi
author_facet Magini, Nicolo
Tsulaia, Vakhtang
Di Girolamo, Alessandro
Wenaus, Torre
Guan, Wen
Lassnig, Mario
Walker, Rodney
Nilsson, Paul
Maeno, Tadashi
author_sort Magini, Nicolo
collection CERN
description The ATLAS experiment is gradually transitioning from the traditional file-based processing model to dynamic workflow management at the event level with the ATLAS Event Service (AES). The AES assigns fine-grained processing jobs to workers and streams out the data in quasi-real time, ensuring fully efficient utilization of all resources, including the most volatile. The next major step in this evolution is the possibility to intelligently stream the input data itself to workers. The Event Streaming Service (ESS) is now in development to asynchronously deliver only the input data required for processing when it is needed, protecting the application payload from WAN latency without creating expensive long-term replicas. In the current prototype implementation, ESS processes run on compute nodes in parallel to the payload, reading the input event ranges remotely over the network, and replicating them in small input files that are passed to the application. In this contribution, we present the performance of the ESS prototype for different types of workflows in comparison to tasks accessing remote data directly. Based on the experience gained with the current prototype, we are now moving to the development of a server-side component of the ESS. The service can evolve progressively into a powerful CDN-like capability for data streaming, ultimately enabling the delivery of 'virtual data' generated on demand.
id cern-2628504
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2018
record_format invenio
spelling cern-26285042019-11-12T10:07:22Zhttp://cds.cern.ch/record/2628504engMagini, NicoloTsulaia, VakhtangDi Girolamo, AlessandroWenaus, TorreGuan, WenLassnig, MarioWalker, RodneyNilsson, PaulMaeno, TadashiTowards an Event Streaming Service for ATLAS data processingParticle Physics - ExperimentThe ATLAS experiment is gradually transitioning from the traditional file-based processing model to dynamic workflow management at the event level with the ATLAS Event Service (AES). The AES assigns fine-grained processing jobs to workers and streams out the data in quasi-real time, ensuring fully efficient utilization of all resources, including the most volatile. The next major step in this evolution is the possibility to intelligently stream the input data itself to workers. The Event Streaming Service (ESS) is now in development to asynchronously deliver only the input data required for processing when it is needed, protecting the application payload from WAN latency without creating expensive long-term replicas. In the current prototype implementation, ESS processes run on compute nodes in parallel to the payload, reading the input event ranges remotely over the network, and replicating them in small input files that are passed to the application. In this contribution, we present the performance of the ESS prototype for different types of workflows in comparison to tasks accessing remote data directly. Based on the experience gained with the current prototype, we are now moving to the development of a server-side component of the ESS. The service can evolve progressively into a powerful CDN-like capability for data streaming, ultimately enabling the delivery of 'virtual data' generated on demand.ATL-SOFT-SLIDE-2018-461oai:cds.cern.ch:26285042018-07-04
spellingShingle Particle Physics - Experiment
Magini, Nicolo
Tsulaia, Vakhtang
Di Girolamo, Alessandro
Wenaus, Torre
Guan, Wen
Lassnig, Mario
Walker, Rodney
Nilsson, Paul
Maeno, Tadashi
Towards an Event Streaming Service for ATLAS data processing
title Towards an Event Streaming Service for ATLAS data processing
title_full Towards an Event Streaming Service for ATLAS data processing
title_fullStr Towards an Event Streaming Service for ATLAS data processing
title_full_unstemmed Towards an Event Streaming Service for ATLAS data processing
title_short Towards an Event Streaming Service for ATLAS data processing
title_sort towards an event streaming service for atlas data processing
topic Particle Physics - Experiment
url http://cds.cern.ch/record/2628504
work_keys_str_mv AT magininicolo towardsaneventstreamingserviceforatlasdataprocessing
AT tsulaiavakhtang towardsaneventstreamingserviceforatlasdataprocessing
AT digirolamoalessandro towardsaneventstreamingserviceforatlasdataprocessing
AT wenaustorre towardsaneventstreamingserviceforatlasdataprocessing
AT guanwen towardsaneventstreamingserviceforatlasdataprocessing
AT lassnigmario towardsaneventstreamingserviceforatlasdataprocessing
AT walkerrodney towardsaneventstreamingserviceforatlasdataprocessing
AT nilssonpaul towardsaneventstreamingserviceforatlasdataprocessing
AT maenotadashi towardsaneventstreamingserviceforatlasdataprocessing