Cargando…

Support system for ATLAS distributed computing operations

The ATLAS distributed computing system has allowed the experiment to successfully meet the challenges of LHC Run 2. In order for distributed computing to operate smoothly and efficiently, several support teams are organized in the ATLAS experiment. The ADCoS (ATLAS Distributed Computing Operation Sh...

Descripción completa

Detalles Bibliográficos
Autor principal: Kishimoto, Tomoe
Lenguaje:eng
Publicado: 2018
Materias:
Acceso en línea:http://cds.cern.ch/record/2621615
_version_ 1780958521382141952
author Kishimoto, Tomoe
author_facet Kishimoto, Tomoe
author_sort Kishimoto, Tomoe
collection CERN
description The ATLAS distributed computing system has allowed the experiment to successfully meet the challenges of LHC Run 2. In order for distributed computing to operate smoothly and efficiently, several support teams are organized in the ATLAS experiment. The ADCoS (ATLAS Distributed Computing Operation Shifts) is a dedicated group of shifters who follow and report failing jobs, failing data transfers between sites, degradation of ATLAS central computing services, and more. The DAST (Distributed Analysis Support Team) provides user support to resolve issues related to running distributed analysis on the grid. The CRC (Computing Run Coordinator) maintains a global view of the day-to-day operations. In this presentation, the status and operational experience of the support system for ATLAS distributed computing in LHC Run 2 will be reported. This report also includes operations experience from the grid site point of view, and an analysis of the errors that create the biggest waste of wallclock time. The report of operations experience will focus on some of the more time-consuming tasks for shifters and grid sites, and on the introduction of new technologies, such as machine learning, to ease the work.
id cern-2621615
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2018
record_format invenio
spelling cern-26216152019-11-21T10:35:31Zhttp://cds.cern.ch/record/2621615engKishimoto, TomoeSupport system for ATLAS distributed computing operationsParticle Physics - ExperimentThe ATLAS distributed computing system has allowed the experiment to successfully meet the challenges of LHC Run 2. In order for distributed computing to operate smoothly and efficiently, several support teams are organized in the ATLAS experiment. The ADCoS (ATLAS Distributed Computing Operation Shifts) is a dedicated group of shifters who follow and report failing jobs, failing data transfers between sites, degradation of ATLAS central computing services, and more. The DAST (Distributed Analysis Support Team) provides user support to resolve issues related to running distributed analysis on the grid. The CRC (Computing Run Coordinator) maintains a global view of the day-to-day operations. In this presentation, the status and operational experience of the support system for ATLAS distributed computing in LHC Run 2 will be reported. This report also includes operations experience from the grid site point of view, and an analysis of the errors that create the biggest waste of wallclock time. The report of operations experience will focus on some of the more time-consuming tasks for shifters and grid sites, and on the introduction of new technologies, such as machine learning, to ease the work.ATL-SOFT-SLIDE-2018-327oai:cds.cern.ch:26216152018-06-04
spellingShingle Particle Physics - Experiment
Kishimoto, Tomoe
Support system for ATLAS distributed computing operations
title Support system for ATLAS distributed computing operations
title_full Support system for ATLAS distributed computing operations
title_fullStr Support system for ATLAS distributed computing operations
title_full_unstemmed Support system for ATLAS distributed computing operations
title_short Support system for ATLAS distributed computing operations
title_sort support system for atlas distributed computing operations
topic Particle Physics - Experiment
url http://cds.cern.ch/record/2621615
work_keys_str_mv AT kishimototomoe supportsystemforatlasdistributedcomputingoperations