Cargando…

A Probabilistic Analysis of Data Popularity in ATLAS Data Caching

One of the most important aspects in any distribution system is efficient data replication over storage / computing centers, that guarantees high data availability and low cost of resources utilization. In this paper we propose a data distribution scheme for the production and distributed analysis s...

Descripción completa

Detalles Bibliográficos
Autores principales: Tito, M, Záruba, G, Klimentov, A, De, K
Lenguaje:eng
Publicado: 2012
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/396/3/032106
http://cds.cern.ch/record/1448603
_version_ 1780924861982441472
author Tito, M
Záruba, G
Klimentov, A
De, K
author_facet Tito, M
Záruba, G
Klimentov, A
De, K
author_sort Tito, M
collection CERN
description One of the most important aspects in any distribution system is efficient data replication over storage / computing centers, that guarantees high data availability and low cost of resources utilization. In this paper we propose a data distribution scheme for the production and distributed analysis system PanDA at the ATLAS experiment. Our proposed scheme is based on an investigation of data usage. Thus, the paper is focused on the main concepts of data popularity at the PanDA system and their utilization. Data popularity is represented as the set of parameters that are used to predict the future data state in terms of popularity levels.
id cern-1448603
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2012
record_format invenio
spelling cern-14486032019-09-30T06:29:59Zdoi:10.1088/1742-6596/396/3/032106http://cds.cern.ch/record/1448603engTito, MZáruba, GKlimentov, ADe, KA Probabilistic Analysis of Data Popularity in ATLAS Data CachingDetectors and Experimental TechniquesOne of the most important aspects in any distribution system is efficient data replication over storage / computing centers, that guarantees high data availability and low cost of resources utilization. In this paper we propose a data distribution scheme for the production and distributed analysis system PanDA at the ATLAS experiment. Our proposed scheme is based on an investigation of data usage. Thus, the paper is focused on the main concepts of data popularity at the PanDA system and their utilization. Data popularity is represented as the set of parameters that are used to predict the future data state in terms of popularity levels.ATL-SOFT-PROC-2012-022oai:cds.cern.ch:14486032012-05-14
spellingShingle Detectors and Experimental Techniques
Tito, M
Záruba, G
Klimentov, A
De, K
A Probabilistic Analysis of Data Popularity in ATLAS Data Caching
title A Probabilistic Analysis of Data Popularity in ATLAS Data Caching
title_full A Probabilistic Analysis of Data Popularity in ATLAS Data Caching
title_fullStr A Probabilistic Analysis of Data Popularity in ATLAS Data Caching
title_full_unstemmed A Probabilistic Analysis of Data Popularity in ATLAS Data Caching
title_short A Probabilistic Analysis of Data Popularity in ATLAS Data Caching
title_sort probabilistic analysis of data popularity in atlas data caching
topic Detectors and Experimental Techniques
url https://dx.doi.org/10.1088/1742-6596/396/3/032106
http://cds.cern.ch/record/1448603
work_keys_str_mv AT titom aprobabilisticanalysisofdatapopularityinatlasdatacaching
AT zarubag aprobabilisticanalysisofdatapopularityinatlasdatacaching
AT klimentova aprobabilisticanalysisofdatapopularityinatlasdatacaching
AT dek aprobabilisticanalysisofdatapopularityinatlasdatacaching
AT titom probabilisticanalysisofdatapopularityinatlasdatacaching
AT zarubag probabilisticanalysisofdatapopularityinatlasdatacaching
AT klimentova probabilisticanalysisofdatapopularityinatlasdatacaching
AT dek probabilisticanalysisofdatapopularityinatlasdatacaching