Cargando…

Popularity Prediction Tool for ATLAS Distributed Data Management

This paper describes a popularity prediction tool for data-intensive data management systems, such as ATLAS distributed data management (DDM). It is fed by the DDM popularity system, which produces historical reports about ATLAS data usage, providing information about files, datasets, users and site...

Descripción completa

Detalles Bibliográficos
Autores principales: Beermann, T, Maettig, P, Stewart, G, Lassnig, M, Garonne, V, Barisits, M, Vigne, R, Serfon, C, Goossens, L, Nairz, A, Molfetas, A
Lenguaje:eng
Publicado: 2013
Materias:
Acceso en línea:http://cds.cern.ch/record/1605794
_version_ 1780931654747947008
author Beermann, T
Maettig, P
Stewart, G
Lassnig, M
Garonne, V
Barisits, M
Vigne, R
Serfon, C
Goossens, L
Nairz, A
Molfetas, A
author_facet Beermann, T
Maettig, P
Stewart, G
Lassnig, M
Garonne, V
Barisits, M
Vigne, R
Serfon, C
Goossens, L
Nairz, A
Molfetas, A
author_sort Beermann, T
collection CERN
description This paper describes a popularity prediction tool for data-intensive data management systems, such as ATLAS distributed data management (DDM). It is fed by the DDM popularity system, which produces historical reports about ATLAS data usage, providing information about files, datasets, users and sites where data was accessed. The tool described in this contribution uses this historical information to make a prediction about the future popularity of data. It finds trends in the usage of data using a set of neural networks and a set of input parameters and predicts the number of accesses in the near term future. This information can then be used in a second step to improve the distribution of replicas at sites, taking into account the cost of creating new replicas (bandwidth and load on the storage system) compared to gain of having new ones (faster access of data for analysis). To evaluate the benefit of the redistribution a grid simulator is introduced that is able replay real workload on different data distributions. This article describes the popularity prediction method and the simulator that is used to evaluate the redistribution.
id cern-1605794
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2013
record_format invenio
spelling cern-16057942019-09-30T06:29:59Zhttp://cds.cern.ch/record/1605794engBeermann, TMaettig, PStewart, GLassnig, MGaronne, VBarisits, MVigne, RSerfon, CGoossens, LNairz, AMolfetas, APopularity Prediction Tool for ATLAS Distributed Data ManagementDetectors and Experimental TechniquesThis paper describes a popularity prediction tool for data-intensive data management systems, such as ATLAS distributed data management (DDM). It is fed by the DDM popularity system, which produces historical reports about ATLAS data usage, providing information about files, datasets, users and sites where data was accessed. The tool described in this contribution uses this historical information to make a prediction about the future popularity of data. It finds trends in the usage of data using a set of neural networks and a set of input parameters and predicts the number of accesses in the near term future. This information can then be used in a second step to improve the distribution of replicas at sites, taking into account the cost of creating new replicas (bandwidth and load on the storage system) compared to gain of having new ones (faster access of data for analysis). To evaluate the benefit of the redistribution a grid simulator is introduced that is able replay real workload on different data distributions. This article describes the popularity prediction method and the simulator that is used to evaluate the redistribution.ATL-SOFT-SLIDE-2013-782oai:cds.cern.ch:16057942013-10-04
spellingShingle Detectors and Experimental Techniques
Beermann, T
Maettig, P
Stewart, G
Lassnig, M
Garonne, V
Barisits, M
Vigne, R
Serfon, C
Goossens, L
Nairz, A
Molfetas, A
Popularity Prediction Tool for ATLAS Distributed Data Management
title Popularity Prediction Tool for ATLAS Distributed Data Management
title_full Popularity Prediction Tool for ATLAS Distributed Data Management
title_fullStr Popularity Prediction Tool for ATLAS Distributed Data Management
title_full_unstemmed Popularity Prediction Tool for ATLAS Distributed Data Management
title_short Popularity Prediction Tool for ATLAS Distributed Data Management
title_sort popularity prediction tool for atlas distributed data management
topic Detectors and Experimental Techniques
url http://cds.cern.ch/record/1605794
work_keys_str_mv AT beermannt popularitypredictiontoolforatlasdistributeddatamanagement
AT maettigp popularitypredictiontoolforatlasdistributeddatamanagement
AT stewartg popularitypredictiontoolforatlasdistributeddatamanagement
AT lassnigm popularitypredictiontoolforatlasdistributeddatamanagement
AT garonnev popularitypredictiontoolforatlasdistributeddatamanagement
AT barisitsm popularitypredictiontoolforatlasdistributeddatamanagement
AT vigner popularitypredictiontoolforatlasdistributeddatamanagement
AT serfonc popularitypredictiontoolforatlasdistributeddatamanagement
AT goossensl popularitypredictiontoolforatlasdistributeddatamanagement
AT nairza popularitypredictiontoolforatlasdistributeddatamanagement
AT molfetasa popularitypredictiontoolforatlasdistributeddatamanagement