Cargando…
A Probabilistic Analysis of Data Popularity in ATLAS Data Caching
One of the most important aspects in any distribution system is efficient data replication over storage / computing centers, that guarantees high data availability and low cost of resources utilization. In this paper we propose a data distribution scheme for the production and distributed analysis s...
Autores principales: | , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2012
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/396/3/032106 http://cds.cern.ch/record/1448603 |
_version_ | 1780924861982441472 |
---|---|
author | Tito, M Záruba, G Klimentov, A De, K |
author_facet | Tito, M Záruba, G Klimentov, A De, K |
author_sort | Tito, M |
collection | CERN |
description | One of the most important aspects in any distribution system is efficient data replication over storage / computing centers, that guarantees high data availability and low cost of resources utilization. In this paper we propose a data distribution scheme for the production and distributed analysis system PanDA at the ATLAS experiment. Our proposed scheme is based on an investigation of data usage. Thus, the paper is focused on the main concepts of data popularity at the PanDA system and their utilization. Data popularity is represented as the set of parameters that are used to predict the future data state in terms of popularity levels. |
id | cern-1448603 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2012 |
record_format | invenio |
spelling | cern-14486032019-09-30T06:29:59Zdoi:10.1088/1742-6596/396/3/032106http://cds.cern.ch/record/1448603engTito, MZáruba, GKlimentov, ADe, KA Probabilistic Analysis of Data Popularity in ATLAS Data CachingDetectors and Experimental TechniquesOne of the most important aspects in any distribution system is efficient data replication over storage / computing centers, that guarantees high data availability and low cost of resources utilization. In this paper we propose a data distribution scheme for the production and distributed analysis system PanDA at the ATLAS experiment. Our proposed scheme is based on an investigation of data usage. Thus, the paper is focused on the main concepts of data popularity at the PanDA system and their utilization. Data popularity is represented as the set of parameters that are used to predict the future data state in terms of popularity levels.ATL-SOFT-PROC-2012-022oai:cds.cern.ch:14486032012-05-14 |
spellingShingle | Detectors and Experimental Techniques Tito, M Záruba, G Klimentov, A De, K A Probabilistic Analysis of Data Popularity in ATLAS Data Caching |
title | A Probabilistic Analysis of Data Popularity in ATLAS Data Caching |
title_full | A Probabilistic Analysis of Data Popularity in ATLAS Data Caching |
title_fullStr | A Probabilistic Analysis of Data Popularity in ATLAS Data Caching |
title_full_unstemmed | A Probabilistic Analysis of Data Popularity in ATLAS Data Caching |
title_short | A Probabilistic Analysis of Data Popularity in ATLAS Data Caching |
title_sort | probabilistic analysis of data popularity in atlas data caching |
topic | Detectors and Experimental Techniques |
url | https://dx.doi.org/10.1088/1742-6596/396/3/032106 http://cds.cern.ch/record/1448603 |
work_keys_str_mv | AT titom aprobabilisticanalysisofdatapopularityinatlasdatacaching AT zarubag aprobabilisticanalysisofdatapopularityinatlasdatacaching AT klimentova aprobabilisticanalysisofdatapopularityinatlasdatacaching AT dek aprobabilisticanalysisofdatapopularityinatlasdatacaching AT titom probabilisticanalysisofdatapopularityinatlasdatacaching AT zarubag probabilisticanalysisofdatapopularityinatlasdatacaching AT klimentova probabilisticanalysisofdatapopularityinatlasdatacaching AT dek probabilisticanalysisofdatapopularityinatlasdatacaching |