Cargando…
Data intensive ATLAS workflows in the Cloud
This contribution reports on the feasibility of executing data intensive workflows on Cloud infrastructures. In order to assess this, the metric ETC = Events/Time/Cost is formed, which quantifies the different workflow and infrastructure configurations that are tested against each other. In these te...
Autores principales: | , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2017
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/898/6/062008 http://cds.cern.ch/record/2242940 |
_version_ | 1780953285755142144 |
---|---|
author | Rzehorz, Gerhard Ferdinand Keeble, Oliver Quadt, Arnulf Kawamura, Gen |
author_facet | Rzehorz, Gerhard Ferdinand Keeble, Oliver Quadt, Arnulf Kawamura, Gen |
author_sort | Rzehorz, Gerhard Ferdinand |
collection | CERN |
description | This contribution reports on the feasibility of executing data intensive workflows on Cloud infrastructures. In order to assess this, the metric ETC = Events/Time/Cost is formed, which quantifies the different workflow and infrastructure configurations that are tested against each other. In these tests ATLAS reconstruction Jobs are run, examining the effects of overcommitting (more parallel processes running than CPU cores available), scheduling (staggered execution) and scaling (number of cores). The desirability of commissioning storage in the Cloud is evaluated, in conjunction with a simple analytical model of the system, and correlated with questions about the network bandwidth, caches and what kind of storage to utilise. In the end a cost/benefit evaluation of different infrastructure configurations and workflows is undertaken, with the goal to find the maximum of the ETC value. |
id | cern-2242940 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2017 |
record_format | invenio |
spelling | cern-22429402019-10-15T15:17:12Zdoi:10.1088/1742-6596/898/6/062008http://cds.cern.ch/record/2242940engRzehorz, Gerhard FerdinandKeeble, OliverQuadt, ArnulfKawamura, GenData intensive ATLAS workflows in the CloudParticle Physics - ExperimentThis contribution reports on the feasibility of executing data intensive workflows on Cloud infrastructures. In order to assess this, the metric ETC = Events/Time/Cost is formed, which quantifies the different workflow and infrastructure configurations that are tested against each other. In these tests ATLAS reconstruction Jobs are run, examining the effects of overcommitting (more parallel processes running than CPU cores available), scheduling (staggered execution) and scaling (number of cores). The desirability of commissioning storage in the Cloud is evaluated, in conjunction with a simple analytical model of the system, and correlated with questions about the network bandwidth, caches and what kind of storage to utilise. In the end a cost/benefit evaluation of different infrastructure configurations and workflows is undertaken, with the goal to find the maximum of the ETC value.ATL-SOFT-PROC-2017-020oai:cds.cern.ch:22429402017-01-25 |
spellingShingle | Particle Physics - Experiment Rzehorz, Gerhard Ferdinand Keeble, Oliver Quadt, Arnulf Kawamura, Gen Data intensive ATLAS workflows in the Cloud |
title | Data intensive ATLAS workflows in the Cloud |
title_full | Data intensive ATLAS workflows in the Cloud |
title_fullStr | Data intensive ATLAS workflows in the Cloud |
title_full_unstemmed | Data intensive ATLAS workflows in the Cloud |
title_short | Data intensive ATLAS workflows in the Cloud |
title_sort | data intensive atlas workflows in the cloud |
topic | Particle Physics - Experiment |
url | https://dx.doi.org/10.1088/1742-6596/898/6/062008 http://cds.cern.ch/record/2242940 |
work_keys_str_mv | AT rzehorzgerhardferdinand dataintensiveatlasworkflowsinthecloud AT keebleoliver dataintensiveatlasworkflowsinthecloud AT quadtarnulf dataintensiveatlasworkflowsinthecloud AT kawamuragen dataintensiveatlasworkflowsinthecloud |