Cargando…

ATLAS job submission system for Salomon HPC based on ARC-CE

The ATLAS experiment at CERN is using HPCs opportunistically to extend its computing capacity for years. To the Salomon HPC, ATLAS jobs come via ARC-CE machines located in the computing center of the Institute of Physics of the Czech Academy of Sciences. The ARC-CE serves as an interface between job...

Descripción completa

Detalles Bibliográficos
Autores principales: Svatos, Michal, Chudoba, Jiri, Vokac, Petr
Lenguaje:eng
Publicado: 2020
Materias:
Acceso en línea:http://cds.cern.ch/record/2707263
_version_ 1780964924192718848
author Svatos, Michal
Chudoba, Jiri
Vokac, Petr
author_facet Svatos, Michal
Chudoba, Jiri
Vokac, Petr
author_sort Svatos, Michal
collection CERN
description The ATLAS experiment at CERN is using HPCs opportunistically to extend its computing capacity for years. To the Salomon HPC, ATLAS jobs come via ARC-CE machines located in the computing center of the Institute of Physics of the Czech Academy of Sciences. The ARC-CE serves as an interface between job management systems of the ATLAS and the HPC. Commands of the PBSpro batch system are submitted via ssh. Scripts and input files are shared between the ARC-CE and shared file system located at the HPC via sshfs. There are several aspects of interaction between ARC-CE machines and Salomon's batch system which are important for performance of the whole system. First, the allowed amount of requests to PBSpro is limited and the ARC-CE needed to be adapted to this fact. Second, the sshfs connection speed seems to be a limiting factor for job turnaround. Some possibilities of sshfs parameters tuning were investigated. Moreover, monitoring allows quick detection of issues and therefore helps the performance of the system. The ARC-CE based job submission system has adapted to conditions of the Salomon HPC and utilizes successfully its resources.
id cern-2707263
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2020
record_format invenio
spelling cern-27072632020-01-27T19:39:46Zhttp://cds.cern.ch/record/2707263engSvatos, MichalChudoba, JiriVokac, PetrATLAS job submission system for Salomon HPC based on ARC-CEParticle Physics - ExperimentThe ATLAS experiment at CERN is using HPCs opportunistically to extend its computing capacity for years. To the Salomon HPC, ATLAS jobs come via ARC-CE machines located in the computing center of the Institute of Physics of the Czech Academy of Sciences. The ARC-CE serves as an interface between job management systems of the ATLAS and the HPC. Commands of the PBSpro batch system are submitted via ssh. Scripts and input files are shared between the ARC-CE and shared file system located at the HPC via sshfs. There are several aspects of interaction between ARC-CE machines and Salomon's batch system which are important for performance of the whole system. First, the allowed amount of requests to PBSpro is limited and the ARC-CE needed to be adapted to this fact. Second, the sshfs connection speed seems to be a limiting factor for job turnaround. Some possibilities of sshfs parameters tuning were investigated. Moreover, monitoring allows quick detection of issues and therefore helps the performance of the system. The ARC-CE based job submission system has adapted to conditions of the Salomon HPC and utilizes successfully its resources.ATL-SOFT-PROC-2020-001oai:cds.cern.ch:27072632020-01-27
spellingShingle Particle Physics - Experiment
Svatos, Michal
Chudoba, Jiri
Vokac, Petr
ATLAS job submission system for Salomon HPC based on ARC-CE
title ATLAS job submission system for Salomon HPC based on ARC-CE
title_full ATLAS job submission system for Salomon HPC based on ARC-CE
title_fullStr ATLAS job submission system for Salomon HPC based on ARC-CE
title_full_unstemmed ATLAS job submission system for Salomon HPC based on ARC-CE
title_short ATLAS job submission system for Salomon HPC based on ARC-CE
title_sort atlas job submission system for salomon hpc based on arc-ce
topic Particle Physics - Experiment
url http://cds.cern.ch/record/2707263
work_keys_str_mv AT svatosmichal atlasjobsubmissionsystemforsalomonhpcbasedonarcce
AT chudobajiri atlasjobsubmissionsystemforsalomonhpcbasedonarcce
AT vokacpetr atlasjobsubmissionsystemforsalomonhpcbasedonarcce