Cargando…
A Probabilistic Analysis of Data Popularity in ATLAS Data Caching
One of the most important aspects in any distribution system is efficient data replication over storage / computing centers, that guarantees high data availability and low cost of resources utilization. In this paper we propose a data distribution scheme for the production and distributed analysis s...
Autores principales: | , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2012
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/396/3/032106 http://cds.cern.ch/record/1448603 |
Sumario: | One of the most important aspects in any distribution system is efficient data replication over storage / computing centers, that guarantees high data availability and low cost of resources utilization. In this paper we propose a data distribution scheme for the production and distributed analysis system PanDA at the ATLAS experiment. Our proposed scheme is based on an investigation of data usage. Thus, the paper is focused on the main concepts of data popularity at the PanDA system and their utilization. Data popularity is represented as the set of parameters that are used to predict the future data state in terms of popularity levels. |
---|