Cargando…

Analysis of data integrity and storage quality of a distributed storage system

<!--HTML-->CERN uses the world's largest scientific computing grid, WLCG, for distributed data storage and processing. Monitoring of the CPU and storage resources is an important and essential element to detect operational issues in its systems, for example in the storage elements, and to...

Descripción completa

Detalles Bibliográficos
Autor principal: Negru, Adrian-Eduard
Lenguaje:eng
Publicado: 2021
Materias:
Acceso en línea:http://cds.cern.ch/record/2767588
_version_ 1780971314409897984
author Negru, Adrian-Eduard
author_facet Negru, Adrian-Eduard
author_sort Negru, Adrian-Eduard
collection CERN
description <!--HTML-->CERN uses the world's largest scientific computing grid, WLCG, for distributed data storage and processing. Monitoring of the CPU and storage resources is an important and essential element to detect operational issues in its systems, for example in the storage elements, and to ensure their proper and efficient function. The processing of experiment data depends strongly on the data access quality, as well as its integrity and both of these key parameters must be assured for the data lifetime. Given the substantial amount of data, O(200PB), already collected by ALICE and kept at various storage elements around the globe, scanning every single data chunk would be a very expensive process, both in terms of computing resources usage and in terms of execution time. In this paper, we describe a distributed file crawler that addresses these natural limits by periodically extracting and analyzing statistically significant samples of files from storage elements, evaluates the results and is integrated with the existing monitoring solution, MonALISA.
id cern-2767588
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2021
record_format invenio
spelling cern-27675882022-11-02T22:25:25Zhttp://cds.cern.ch/record/2767588engNegru, Adrian-EduardAnalysis of data integrity and storage quality of a distributed storage system25th International Conference on Computing in High Energy & Nuclear PhysicsConferences<!--HTML-->CERN uses the world's largest scientific computing grid, WLCG, for distributed data storage and processing. Monitoring of the CPU and storage resources is an important and essential element to detect operational issues in its systems, for example in the storage elements, and to ensure their proper and efficient function. The processing of experiment data depends strongly on the data access quality, as well as its integrity and both of these key parameters must be assured for the data lifetime. Given the substantial amount of data, O(200PB), already collected by ALICE and kept at various storage elements around the globe, scanning every single data chunk would be a very expensive process, both in terms of computing resources usage and in terms of execution time. In this paper, we describe a distributed file crawler that addresses these natural limits by periodically extracting and analyzing statistically significant samples of files from storage elements, evaluates the results and is integrated with the existing monitoring solution, MonALISA.oai:cds.cern.ch:27675882021
spellingShingle Conferences
Negru, Adrian-Eduard
Analysis of data integrity and storage quality of a distributed storage system
title Analysis of data integrity and storage quality of a distributed storage system
title_full Analysis of data integrity and storage quality of a distributed storage system
title_fullStr Analysis of data integrity and storage quality of a distributed storage system
title_full_unstemmed Analysis of data integrity and storage quality of a distributed storage system
title_short Analysis of data integrity and storage quality of a distributed storage system
title_sort analysis of data integrity and storage quality of a distributed storage system
topic Conferences
url http://cds.cern.ch/record/2767588
work_keys_str_mv AT negruadrianeduard analysisofdataintegrityandstoragequalityofadistributedstoragesystem
AT negruadrianeduard 25thinternationalconferenceoncomputinginhighenergynuclearphysics