Cargando…
Web enabled data management with DPM & LFC
The Disk Pool Manager (DPM) and LCG File Catalog (LFC) are two grid data management components currently used in production with more than 240 endpoints. Together with a set of grid client tools they give the users a unified view of their data, hiding most details concerning data location and access...
Autores principales: | , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2012
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/396/5/052006 http://cds.cern.ch/record/1457962 |
_version_ | 1780925146885783552 |
---|---|
author | Alvarez Ayllon, A Beche, A Fabrizio, F Hellmich, M Keeble, O Brito da Rocha, R |
author_facet | Alvarez Ayllon, A Beche, A Fabrizio, F Hellmich, M Keeble, O Brito da Rocha, R |
author_sort | Alvarez Ayllon, A |
collection | CERN |
description | The Disk Pool Manager (DPM) and LCG File Catalog (LFC) are two grid data management components currently used in production with more than 240 endpoints. Together with a set of grid client tools they give the users a unified view of their data, hiding most details concerning data location and access. Recently we’ve put a lot of effort in developing a reliable and high performance HTTP/WebDAV frontend to both our grid catalog and storage components, exposing the existing functionality to users accessing the services via standard clients - e.g. web browsers, curl - present in all operating systems, giving users a simple and straigh-forward way of interaction. In addition, as other relevant grid storage components (like dCache) expose their data using the same protocol, for the first time we had the opportunity of attempting a unified view of all grid storage using HTTP. We describe the mechanism used to integrate the grid catalog(s) with the multiple storage components - HTTP redirection -, including details on some assumptions made to allow integration with other implementations. We describe the way we hide the details regarding site availability or catalog inconsistencies, by switching the standard HTTP client automatically between multiple replicas. We also present measurements of access performance, and the relevant factors regarding replica selection - current throughput and load, geographic proximity, etc. Finally, we report on some additional work done to have this system as a viable alternative to GridFTP, providing multi-stream transfers and exploiting some additional features of WebDAV to enable third party copies - essential for managing data movements between storage systems - with equivalent performance. |
id | cern-1457962 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2012 |
record_format | invenio |
spelling | cern-14579622022-08-17T13:32:57Zdoi:10.1088/1742-6596/396/5/052006http://cds.cern.ch/record/1457962engAlvarez Ayllon, ABeche, AFabrizio, FHellmich, MKeeble, OBrito da Rocha, RWeb enabled data management with DPM & LFCComputing and ComputersThe Disk Pool Manager (DPM) and LCG File Catalog (LFC) are two grid data management components currently used in production with more than 240 endpoints. Together with a set of grid client tools they give the users a unified view of their data, hiding most details concerning data location and access. Recently we’ve put a lot of effort in developing a reliable and high performance HTTP/WebDAV frontend to both our grid catalog and storage components, exposing the existing functionality to users accessing the services via standard clients - e.g. web browsers, curl - present in all operating systems, giving users a simple and straigh-forward way of interaction. In addition, as other relevant grid storage components (like dCache) expose their data using the same protocol, for the first time we had the opportunity of attempting a unified view of all grid storage using HTTP. We describe the mechanism used to integrate the grid catalog(s) with the multiple storage components - HTTP redirection -, including details on some assumptions made to allow integration with other implementations. We describe the way we hide the details regarding site availability or catalog inconsistencies, by switching the standard HTTP client automatically between multiple replicas. We also present measurements of access performance, and the relevant factors regarding replica selection - current throughput and load, geographic proximity, etc. Finally, we report on some additional work done to have this system as a viable alternative to GridFTP, providing multi-stream transfers and exploiting some additional features of WebDAV to enable third party copies - essential for managing data movements between storage systems - with equivalent performance.CERN-IT-Note-2012-008oai:cds.cern.ch:14579622012-06-21 |
spellingShingle | Computing and Computers Alvarez Ayllon, A Beche, A Fabrizio, F Hellmich, M Keeble, O Brito da Rocha, R Web enabled data management with DPM & LFC |
title | Web enabled data management with DPM & LFC |
title_full | Web enabled data management with DPM & LFC |
title_fullStr | Web enabled data management with DPM & LFC |
title_full_unstemmed | Web enabled data management with DPM & LFC |
title_short | Web enabled data management with DPM & LFC |
title_sort | web enabled data management with dpm & lfc |
topic | Computing and Computers |
url | https://dx.doi.org/10.1088/1742-6596/396/5/052006 http://cds.cern.ch/record/1457962 |
work_keys_str_mv | AT alvarezayllona webenableddatamanagementwithdpmlfc AT bechea webenableddatamanagementwithdpmlfc AT fabriziof webenableddatamanagementwithdpmlfc AT hellmichm webenableddatamanagementwithdpmlfc AT keebleo webenableddatamanagementwithdpmlfc AT britodarochar webenableddatamanagementwithdpmlfc |