Cargando…

Estimating job runtime for CMS analysis jobs

The basic premise of pilot systems is to create an overlay scheduling system on top of leased resources. And by definition, leases have a limited lifetime, so any job that is scheduled on such resources must finish before the lease is over, or it will be killed and all the computation wasted. In ord...

Descripción completa

Detalles Bibliográficos
Autor principal: Sfiligoi, Igor
Lenguaje:eng
Publicado: 2013
Materias:
Acceso en línea:http://cds.cern.ch/record/1619654
_version_ 1780933023984779264
author Sfiligoi, Igor
author_facet Sfiligoi, Igor
author_sort Sfiligoi, Igor
collection CERN
description The basic premise of pilot systems is to create an overlay scheduling system on top of leased resources. And by definition, leases have a limited lifetime, so any job that is scheduled on such resources must finish before the lease is over, or it will be killed and all the computation wasted. In order to effectively schedule jobs to resources, the pilot system thus requires the expected runtime of the users jobs. Past studies have shown that relying on user provided estimates is not a valid strategy, so the system should try to make an estimate by itself. This paper provides a study of the historical data obtained from the CMS Analysis Operations submission system. Clear patterns are observed, suggesting that making prediction of an expected job lifetime range is achievable with high confidence level in this environment.
id cern-1619654
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2013
record_format invenio
spelling cern-16196542019-09-30T06:29:59Zhttp://cds.cern.ch/record/1619654engSfiligoi, IgorEstimating job runtime for CMS analysis jobsDetectors and Experimental TechniquesThe basic premise of pilot systems is to create an overlay scheduling system on top of leased resources. And by definition, leases have a limited lifetime, so any job that is scheduled on such resources must finish before the lease is over, or it will be killed and all the computation wasted. In order to effectively schedule jobs to resources, the pilot system thus requires the expected runtime of the users jobs. Past studies have shown that relying on user provided estimates is not a valid strategy, so the system should try to make an estimate by itself. This paper provides a study of the historical data obtained from the CMS Analysis Operations submission system. Clear patterns are observed, suggesting that making prediction of an expected job lifetime range is achievable with high confidence level in this environment.CMS-CR-2013-340oai:cds.cern.ch:16196542013-10-18
spellingShingle Detectors and Experimental Techniques
Sfiligoi, Igor
Estimating job runtime for CMS analysis jobs
title Estimating job runtime for CMS analysis jobs
title_full Estimating job runtime for CMS analysis jobs
title_fullStr Estimating job runtime for CMS analysis jobs
title_full_unstemmed Estimating job runtime for CMS analysis jobs
title_short Estimating job runtime for CMS analysis jobs
title_sort estimating job runtime for cms analysis jobs
topic Detectors and Experimental Techniques
url http://cds.cern.ch/record/1619654
work_keys_str_mv AT sfiligoiigor estimatingjobruntimeforcmsanalysisjobs