Cargando…

Data intensive ATLAS workflows in the Cloud

This contribution reports on the feasibility of executing data intensive workflows on Cloud infrastructures. In order to assess this, the metric ETC = Events/Time/Cost is formed, which quantifies the different workflow and infrastructure configurations that are tested against each other. In these te...

Descripción completa

Detalles Bibliográficos
Autores principales: Rzehorz, Gerhard Ferdinand, Keeble, Oliver, Quadt, Arnulf, Kawamura, Gen
Lenguaje:eng
Publicado: 2017
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/898/6/062008
http://cds.cern.ch/record/2242940
_version_ 1780953285755142144
author Rzehorz, Gerhard Ferdinand
Keeble, Oliver
Quadt, Arnulf
Kawamura, Gen
author_facet Rzehorz, Gerhard Ferdinand
Keeble, Oliver
Quadt, Arnulf
Kawamura, Gen
author_sort Rzehorz, Gerhard Ferdinand
collection CERN
description This contribution reports on the feasibility of executing data intensive workflows on Cloud infrastructures. In order to assess this, the metric ETC = Events/Time/Cost is formed, which quantifies the different workflow and infrastructure configurations that are tested against each other. In these tests ATLAS reconstruction Jobs are run, examining the effects of overcommitting (more parallel processes running than CPU cores available), scheduling (staggered execution) and scaling (number of cores). The desirability of commissioning storage in the Cloud is evaluated, in conjunction with a simple analytical model of the system, and correlated with questions about the network bandwidth, caches and what kind of storage to utilise. In the end a cost/benefit evaluation of different infrastructure configurations and workflows is undertaken, with the goal to find the maximum of the ETC value.
id cern-2242940
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2017
record_format invenio
spelling cern-22429402019-10-15T15:17:12Zdoi:10.1088/1742-6596/898/6/062008http://cds.cern.ch/record/2242940engRzehorz, Gerhard FerdinandKeeble, OliverQuadt, ArnulfKawamura, GenData intensive ATLAS workflows in the CloudParticle Physics - ExperimentThis contribution reports on the feasibility of executing data intensive workflows on Cloud infrastructures. In order to assess this, the metric ETC = Events/Time/Cost is formed, which quantifies the different workflow and infrastructure configurations that are tested against each other. In these tests ATLAS reconstruction Jobs are run, examining the effects of overcommitting (more parallel processes running than CPU cores available), scheduling (staggered execution) and scaling (number of cores). The desirability of commissioning storage in the Cloud is evaluated, in conjunction with a simple analytical model of the system, and correlated with questions about the network bandwidth, caches and what kind of storage to utilise. In the end a cost/benefit evaluation of different infrastructure configurations and workflows is undertaken, with the goal to find the maximum of the ETC value.ATL-SOFT-PROC-2017-020oai:cds.cern.ch:22429402017-01-25
spellingShingle Particle Physics - Experiment
Rzehorz, Gerhard Ferdinand
Keeble, Oliver
Quadt, Arnulf
Kawamura, Gen
Data intensive ATLAS workflows in the Cloud
title Data intensive ATLAS workflows in the Cloud
title_full Data intensive ATLAS workflows in the Cloud
title_fullStr Data intensive ATLAS workflows in the Cloud
title_full_unstemmed Data intensive ATLAS workflows in the Cloud
title_short Data intensive ATLAS workflows in the Cloud
title_sort data intensive atlas workflows in the cloud
topic Particle Physics - Experiment
url https://dx.doi.org/10.1088/1742-6596/898/6/062008
http://cds.cern.ch/record/2242940
work_keys_str_mv AT rzehorzgerhardferdinand dataintensiveatlasworkflowsinthecloud
AT keebleoliver dataintensiveatlasworkflowsinthecloud
AT quadtarnulf dataintensiveatlasworkflowsinthecloud
AT kawamuragen dataintensiveatlasworkflowsinthecloud