Cargando…

Visual Cluster Analysis for Computing Tasks at Workflow Management System of the ATLAS Experiment at the LHC

Hundreds of petabytes of experimental data in high energy and nuclear physics (HENP) have already been obtained by unique scientific facilities, such as LHC, RHIC, KEK. As the accelerators are being modernized (energy and luminosity are increasing), data volumes are rapidly growing and have reached...

Descripción completa

Detalles Bibliográficos
Autores principales: Grigoryeva, Maria, Korchuganova, Tatiana, Padolski, Siarhei, Klimentov, Alexei, Titov, Mikhail
Lenguaje:eng
Publicado: 2018
Materias:
Acceso en línea:http://cds.cern.ch/record/2639961
Descripción
Sumario:Hundreds of petabytes of experimental data in high energy and nuclear physics (HENP) have already been obtained by unique scientific facilities, such as LHC, RHIC, KEK. As the accelerators are being modernized (energy and luminosity are increasing), data volumes are rapidly growing and have reached the exabyte scale, that also leads to an increasing the number of analysis and data processing tasks, that are competing continuously for computational resources. The increase in processing tasks causes an increase in the performance of the computing infrastructure by the involvement of high-performance computing resources and forming a heterogeneous distributed computing environment (hundreds of distributed computing centers). With a distributed model of data processing and analysis, the optimization of data management and workload systems becomes a critical task, and the absence of an adequate solution leads to economic, functional and time losses. This work describes the first stage of the study aiming at solving the task of increasing the stability and efficiency of workflow management systems for mega-science experiments by using visual analytics methods. Using the case of the ATLAS experiment at LHC the visual methods for cluster analysis of the workload management system computing tasks/jobs will be applied. The interdependencies and correlations between various tasks/jobs parameters will be investigated and graphically interpreted in N-dimensional space using 3D projections. Visual analysis allows to identify similar jobs, as well as anomalous jobs, and to determine what causes such anomaly. A further evolvement of the work in this direction will be focused on the increasing the amount of analysed computing jobs and the development of the appropriate infrastructure.