Cargando…

First results from a combined analysis of CERN computing infrastructure metrics

The IT Analysis Working Group (AWG) has been formed at CERN across individual computing units and the experiments to attempt a cross cutting analysis of computing infrastructure and application metrics. In this presentation we will describe the first results obtained using medium/long term data (1 m...

Descripción completa

Detalles Bibliográficos
Autores principales: Duellmann, Dirk, Nieke, Christian
Lenguaje:eng
Publicado: 2017
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/898/7/072035
http://cds.cern.ch/record/2296798
_version_ 1780956906016210944
author Duellmann, Dirk
Nieke, Christian
author_facet Duellmann, Dirk
Nieke, Christian
author_sort Duellmann, Dirk
collection CERN
description The IT Analysis Working Group (AWG) has been formed at CERN across individual computing units and the experiments to attempt a cross cutting analysis of computing infrastructure and application metrics. In this presentation we will describe the first results obtained using medium/long term data (1 months — 1 year) correlating box level metrics, job level metrics from LSF and HTCondor, IO metrics from the physics analysis disk pools (EOS) and networking and application level metrics from the experiment dashboards. We will cover in particular the measurement of hardware performance and prediction of job duration, the latency sensitivity of different job types and a search for bottlenecks with the production job mix in the current infrastructure. The presentation will conclude with the proposal of a small set of metrics to simplify drawing conclusions also in the more constrained environment of public cloud deployments.
id oai-inspirehep.net-1638561
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2017
record_format invenio
spelling oai-inspirehep.net-16385612021-02-09T10:06:26Zdoi:10.1088/1742-6596/898/7/072035http://cds.cern.ch/record/2296798engDuellmann, DirkNieke, ChristianFirst results from a combined analysis of CERN computing infrastructure metricsComputing and ComputersThe IT Analysis Working Group (AWG) has been formed at CERN across individual computing units and the experiments to attempt a cross cutting analysis of computing infrastructure and application metrics. In this presentation we will describe the first results obtained using medium/long term data (1 months — 1 year) correlating box level metrics, job level metrics from LSF and HTCondor, IO metrics from the physics analysis disk pools (EOS) and networking and application level metrics from the experiment dashboards. We will cover in particular the measurement of hardware performance and prediction of job duration, the latency sensitivity of different job types and a search for bottlenecks with the production job mix in the current infrastructure. The presentation will conclude with the proposal of a small set of metrics to simplify drawing conclusions also in the more constrained environment of public cloud deployments.oai:inspirehep.net:16385612017
spellingShingle Computing and Computers
Duellmann, Dirk
Nieke, Christian
First results from a combined analysis of CERN computing infrastructure metrics
title First results from a combined analysis of CERN computing infrastructure metrics
title_full First results from a combined analysis of CERN computing infrastructure metrics
title_fullStr First results from a combined analysis of CERN computing infrastructure metrics
title_full_unstemmed First results from a combined analysis of CERN computing infrastructure metrics
title_short First results from a combined analysis of CERN computing infrastructure metrics
title_sort first results from a combined analysis of cern computing infrastructure metrics
topic Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/898/7/072035
http://cds.cern.ch/record/2296798
work_keys_str_mv AT duellmanndirk firstresultsfromacombinedanalysisofcerncomputinginfrastructuremetrics
AT niekechristian firstresultsfromacombinedanalysisofcerncomputinginfrastructuremetrics