Cargando…
First results from a combined analysis of CERN computing infrastructure metrics
The IT Analysis Working Group (AWG) has been formed at CERN across individual computing units and the experiments to attempt a cross cutting analysis of computing infrastructure and application metrics. In this presentation we will describe the first results obtained using medium/long term data (1 m...
Autores principales: | , |
---|---|
Lenguaje: | eng |
Publicado: |
2017
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/898/7/072035 http://cds.cern.ch/record/2296798 |
_version_ | 1780956906016210944 |
---|---|
author | Duellmann, Dirk Nieke, Christian |
author_facet | Duellmann, Dirk Nieke, Christian |
author_sort | Duellmann, Dirk |
collection | CERN |
description | The IT Analysis Working Group (AWG) has been formed at CERN across individual computing units and the experiments to attempt a cross cutting analysis of computing infrastructure and application metrics. In this presentation we will describe the first results obtained using medium/long term data (1 months — 1 year) correlating box level metrics, job level metrics from LSF and HTCondor, IO metrics from the physics analysis disk pools (EOS) and networking and application level metrics from the experiment dashboards. We will cover in particular the measurement of hardware performance and prediction of job duration, the latency sensitivity of different job types and a search for bottlenecks with the production job mix in the current infrastructure. The presentation will conclude with the proposal of a small set of metrics to simplify drawing conclusions also in the more constrained environment of public cloud deployments. |
id | oai-inspirehep.net-1638561 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2017 |
record_format | invenio |
spelling | oai-inspirehep.net-16385612021-02-09T10:06:26Zdoi:10.1088/1742-6596/898/7/072035http://cds.cern.ch/record/2296798engDuellmann, DirkNieke, ChristianFirst results from a combined analysis of CERN computing infrastructure metricsComputing and ComputersThe IT Analysis Working Group (AWG) has been formed at CERN across individual computing units and the experiments to attempt a cross cutting analysis of computing infrastructure and application metrics. In this presentation we will describe the first results obtained using medium/long term data (1 months — 1 year) correlating box level metrics, job level metrics from LSF and HTCondor, IO metrics from the physics analysis disk pools (EOS) and networking and application level metrics from the experiment dashboards. We will cover in particular the measurement of hardware performance and prediction of job duration, the latency sensitivity of different job types and a search for bottlenecks with the production job mix in the current infrastructure. The presentation will conclude with the proposal of a small set of metrics to simplify drawing conclusions also in the more constrained environment of public cloud deployments.oai:inspirehep.net:16385612017 |
spellingShingle | Computing and Computers Duellmann, Dirk Nieke, Christian First results from a combined analysis of CERN computing infrastructure metrics |
title | First results from a combined analysis of CERN computing infrastructure metrics |
title_full | First results from a combined analysis of CERN computing infrastructure metrics |
title_fullStr | First results from a combined analysis of CERN computing infrastructure metrics |
title_full_unstemmed | First results from a combined analysis of CERN computing infrastructure metrics |
title_short | First results from a combined analysis of CERN computing infrastructure metrics |
title_sort | first results from a combined analysis of cern computing infrastructure metrics |
topic | Computing and Computers |
url | https://dx.doi.org/10.1088/1742-6596/898/7/072035 http://cds.cern.ch/record/2296798 |
work_keys_str_mv | AT duellmanndirk firstresultsfromacombinedanalysisofcerncomputinginfrastructuremetrics AT niekechristian firstresultsfromacombinedanalysisofcerncomputinginfrastructuremetrics |