Cargando…

A History-based Estimation for LHCb job requirements

The main goal of a Workload Management System (WMS) is to find and allocate resources for the given tasks. The more and better job information the WMS receives, the easier will be to accomplish its task, which directly translates into higher utilization of resources. Traditionally, the information a...

Descripción completa

Detalles Bibliográficos
Autor principal: Rauschmayr, Nathalie
Lenguaje:eng
Publicado: 2015
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/664/6/062050
http://cds.cern.ch/record/2134611
_version_ 1780949915716812800
author Rauschmayr, Nathalie
author_facet Rauschmayr, Nathalie
author_sort Rauschmayr, Nathalie
collection CERN
description The main goal of a Workload Management System (WMS) is to find and allocate resources for the given tasks. The more and better job information the WMS receives, the easier will be to accomplish its task, which directly translates into higher utilization of resources. Traditionally, the information associated with each job, like expected runtime, is defined beforehand by the Production Manager in best case and fixed arbitrary values by default. In the case of LHCb's Workload Management System no mechanisms are provided which automate the estimation of job requirements. As a result, much more CPU time is normally requested than actually needed. Particularly, in the context of multicore jobs this presents a major problem, since single- and multicore jobs shall share the same resources. Consequently, grid sites need to rely on estimations given by the VOs in order to not decrease the utilization of their worker nodes when making multicore job slots available. The main reason for going to multicore jobs is the reduction of the overall memory footprint. Therefore, it also needs to be studied how memory consumption of jobs can be estimated.A detailed workload analysis of past LHCb jobs is presented. It includes a study of job features and their correlation with runtime and memory consumption. Following the features, a supervised learning algorithm is developed based on a history based prediction. The aim is to learn over time how jobs’ runtime and memory evolve influenced due to changes in experiment conditions and software versions. It will be shown that estimation can be notably improved if experiment conditions are taken into account.
id oai-inspirehep.net-1413968
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2015
record_format invenio
spelling oai-inspirehep.net-14139682022-08-10T13:01:00Zdoi:10.1088/1742-6596/664/6/062050http://cds.cern.ch/record/2134611engRauschmayr, NathalieA History-based Estimation for LHCb job requirementsComputing and ComputersThe main goal of a Workload Management System (WMS) is to find and allocate resources for the given tasks. The more and better job information the WMS receives, the easier will be to accomplish its task, which directly translates into higher utilization of resources. Traditionally, the information associated with each job, like expected runtime, is defined beforehand by the Production Manager in best case and fixed arbitrary values by default. In the case of LHCb's Workload Management System no mechanisms are provided which automate the estimation of job requirements. As a result, much more CPU time is normally requested than actually needed. Particularly, in the context of multicore jobs this presents a major problem, since single- and multicore jobs shall share the same resources. Consequently, grid sites need to rely on estimations given by the VOs in order to not decrease the utilization of their worker nodes when making multicore job slots available. The main reason for going to multicore jobs is the reduction of the overall memory footprint. Therefore, it also needs to be studied how memory consumption of jobs can be estimated.A detailed workload analysis of past LHCb jobs is presented. It includes a study of job features and their correlation with runtime and memory consumption. Following the features, a supervised learning algorithm is developed based on a history based prediction. The aim is to learn over time how jobs’ runtime and memory evolve influenced due to changes in experiment conditions and software versions. It will be shown that estimation can be notably improved if experiment conditions are taken into account.oai:inspirehep.net:14139682015
spellingShingle Computing and Computers
Rauschmayr, Nathalie
A History-based Estimation for LHCb job requirements
title A History-based Estimation for LHCb job requirements
title_full A History-based Estimation for LHCb job requirements
title_fullStr A History-based Estimation for LHCb job requirements
title_full_unstemmed A History-based Estimation for LHCb job requirements
title_short A History-based Estimation for LHCb job requirements
title_sort history-based estimation for lhcb job requirements
topic Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/664/6/062050
http://cds.cern.ch/record/2134611
work_keys_str_mv AT rauschmayrnathalie ahistorybasedestimationforlhcbjobrequirements
AT rauschmayrnathalie historybasedestimationforlhcbjobrequirements