Cargando…

Web enabled data management with DPM & LFC

The Disk Pool Manager (DPM) and LCG File Catalog (LFC) are two grid data management components currently used in production with more than 240 endpoints. Together with a set of grid client tools they give the users a unified view of their data, hiding most details concerning data location and access...

Descripción completa

Detalles Bibliográficos
Autores principales: Alvarez Ayllon, A, Beche, A, Fabrizio, F, Hellmich, M, Keeble, O, Brito da Rocha, R
Lenguaje:eng
Publicado: 2012
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/396/5/052006
http://cds.cern.ch/record/1457962
_version_ 1780925146885783552
author Alvarez Ayllon, A
Beche, A
Fabrizio, F
Hellmich, M
Keeble, O
Brito da Rocha, R
author_facet Alvarez Ayllon, A
Beche, A
Fabrizio, F
Hellmich, M
Keeble, O
Brito da Rocha, R
author_sort Alvarez Ayllon, A
collection CERN
description The Disk Pool Manager (DPM) and LCG File Catalog (LFC) are two grid data management components currently used in production with more than 240 endpoints. Together with a set of grid client tools they give the users a unified view of their data, hiding most details concerning data location and access. Recently we’ve put a lot of effort in developing a reliable and high performance HTTP/WebDAV frontend to both our grid catalog and storage components, exposing the existing functionality to users accessing the services via standard clients - e.g. web browsers, curl - present in all operating systems, giving users a simple and straigh-forward way of interaction. In addition, as other relevant grid storage components (like dCache) expose their data using the same protocol, for the first time we had the opportunity of attempting a unified view of all grid storage using HTTP. We describe the mechanism used to integrate the grid catalog(s) with the multiple storage components - HTTP redirection -, including details on some assumptions made to allow integration with other implementations. We describe the way we hide the details regarding site availability or catalog inconsistencies, by switching the standard HTTP client automatically between multiple replicas. We also present measurements of access performance, and the relevant factors regarding replica selection - current throughput and load, geographic proximity, etc. Finally, we report on some additional work done to have this system as a viable alternative to GridFTP, providing multi-stream transfers and exploiting some additional features of WebDAV to enable third party copies - essential for managing data movements between storage systems - with equivalent performance.
id cern-1457962
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2012
record_format invenio
spelling cern-14579622022-08-17T13:32:57Zdoi:10.1088/1742-6596/396/5/052006http://cds.cern.ch/record/1457962engAlvarez Ayllon, ABeche, AFabrizio, FHellmich, MKeeble, OBrito da Rocha, RWeb enabled data management with DPM & LFCComputing and ComputersThe Disk Pool Manager (DPM) and LCG File Catalog (LFC) are two grid data management components currently used in production with more than 240 endpoints. Together with a set of grid client tools they give the users a unified view of their data, hiding most details concerning data location and access. Recently we’ve put a lot of effort in developing a reliable and high performance HTTP/WebDAV frontend to both our grid catalog and storage components, exposing the existing functionality to users accessing the services via standard clients - e.g. web browsers, curl - present in all operating systems, giving users a simple and straigh-forward way of interaction. In addition, as other relevant grid storage components (like dCache) expose their data using the same protocol, for the first time we had the opportunity of attempting a unified view of all grid storage using HTTP. We describe the mechanism used to integrate the grid catalog(s) with the multiple storage components - HTTP redirection -, including details on some assumptions made to allow integration with other implementations. We describe the way we hide the details regarding site availability or catalog inconsistencies, by switching the standard HTTP client automatically between multiple replicas. We also present measurements of access performance, and the relevant factors regarding replica selection - current throughput and load, geographic proximity, etc. Finally, we report on some additional work done to have this system as a viable alternative to GridFTP, providing multi-stream transfers and exploiting some additional features of WebDAV to enable third party copies - essential for managing data movements between storage systems - with equivalent performance.CERN-IT-Note-2012-008oai:cds.cern.ch:14579622012-06-21
spellingShingle Computing and Computers
Alvarez Ayllon, A
Beche, A
Fabrizio, F
Hellmich, M
Keeble, O
Brito da Rocha, R
Web enabled data management with DPM & LFC
title Web enabled data management with DPM & LFC
title_full Web enabled data management with DPM & LFC
title_fullStr Web enabled data management with DPM & LFC
title_full_unstemmed Web enabled data management with DPM & LFC
title_short Web enabled data management with DPM & LFC
title_sort web enabled data management with dpm & lfc
topic Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/396/5/052006
http://cds.cern.ch/record/1457962
work_keys_str_mv AT alvarezayllona webenableddatamanagementwithdpmlfc
AT bechea webenableddatamanagementwithdpmlfc
AT fabriziof webenableddatamanagementwithdpmlfc
AT hellmichm webenableddatamanagementwithdpmlfc
AT keebleo webenableddatamanagementwithdpmlfc
AT britodarochar webenableddatamanagementwithdpmlfc