Cargando…
Popularity Prediction Tool for ATLAS Distributed Data Management
This paper describes a popularity prediction tool for data-intensive data management systems, such as ATLAS distributed data management (DDM). It is fed by the DDM popularity system, which produces historical reports about ATLAS data usage, providing information about files, datasets, users and site...
Autores principales: | , , , , , , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2013
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/1605794 |
_version_ | 1780931654747947008 |
---|---|
author | Beermann, T Maettig, P Stewart, G Lassnig, M Garonne, V Barisits, M Vigne, R Serfon, C Goossens, L Nairz, A Molfetas, A |
author_facet | Beermann, T Maettig, P Stewart, G Lassnig, M Garonne, V Barisits, M Vigne, R Serfon, C Goossens, L Nairz, A Molfetas, A |
author_sort | Beermann, T |
collection | CERN |
description | This paper describes a popularity prediction tool for data-intensive data management systems, such as ATLAS distributed data management (DDM). It is fed by the DDM popularity system, which produces historical reports about ATLAS data usage, providing information about files, datasets, users and sites where data was accessed. The tool described in this contribution uses this historical information to make a prediction about the future popularity of data. It finds trends in the usage of data using a set of neural networks and a set of input parameters and predicts the number of accesses in the near term future. This information can then be used in a second step to improve the distribution of replicas at sites, taking into account the cost of creating new replicas (bandwidth and load on the storage system) compared to gain of having new ones (faster access of data for analysis). To evaluate the benefit of the redistribution a grid simulator is introduced that is able replay real workload on different data distributions. This article describes the popularity prediction method and the simulator that is used to evaluate the redistribution. |
id | cern-1605794 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2013 |
record_format | invenio |
spelling | cern-16057942019-09-30T06:29:59Zhttp://cds.cern.ch/record/1605794engBeermann, TMaettig, PStewart, GLassnig, MGaronne, VBarisits, MVigne, RSerfon, CGoossens, LNairz, AMolfetas, APopularity Prediction Tool for ATLAS Distributed Data ManagementDetectors and Experimental TechniquesThis paper describes a popularity prediction tool for data-intensive data management systems, such as ATLAS distributed data management (DDM). It is fed by the DDM popularity system, which produces historical reports about ATLAS data usage, providing information about files, datasets, users and sites where data was accessed. The tool described in this contribution uses this historical information to make a prediction about the future popularity of data. It finds trends in the usage of data using a set of neural networks and a set of input parameters and predicts the number of accesses in the near term future. This information can then be used in a second step to improve the distribution of replicas at sites, taking into account the cost of creating new replicas (bandwidth and load on the storage system) compared to gain of having new ones (faster access of data for analysis). To evaluate the benefit of the redistribution a grid simulator is introduced that is able replay real workload on different data distributions. This article describes the popularity prediction method and the simulator that is used to evaluate the redistribution.ATL-SOFT-SLIDE-2013-782oai:cds.cern.ch:16057942013-10-04 |
spellingShingle | Detectors and Experimental Techniques Beermann, T Maettig, P Stewart, G Lassnig, M Garonne, V Barisits, M Vigne, R Serfon, C Goossens, L Nairz, A Molfetas, A Popularity Prediction Tool for ATLAS Distributed Data Management |
title | Popularity Prediction Tool for ATLAS Distributed Data Management |
title_full | Popularity Prediction Tool for ATLAS Distributed Data Management |
title_fullStr | Popularity Prediction Tool for ATLAS Distributed Data Management |
title_full_unstemmed | Popularity Prediction Tool for ATLAS Distributed Data Management |
title_short | Popularity Prediction Tool for ATLAS Distributed Data Management |
title_sort | popularity prediction tool for atlas distributed data management |
topic | Detectors and Experimental Techniques |
url | http://cds.cern.ch/record/1605794 |
work_keys_str_mv | AT beermannt popularitypredictiontoolforatlasdistributeddatamanagement AT maettigp popularitypredictiontoolforatlasdistributeddatamanagement AT stewartg popularitypredictiontoolforatlasdistributeddatamanagement AT lassnigm popularitypredictiontoolforatlasdistributeddatamanagement AT garonnev popularitypredictiontoolforatlasdistributeddatamanagement AT barisitsm popularitypredictiontoolforatlasdistributeddatamanagement AT vigner popularitypredictiontoolforatlasdistributeddatamanagement AT serfonc popularitypredictiontoolforatlasdistributeddatamanagement AT goossensl popularitypredictiontoolforatlasdistributeddatamanagement AT nairza popularitypredictiontoolforatlasdistributeddatamanagement AT molfetasa popularitypredictiontoolforatlasdistributeddatamanagement |