Cargando…

File and object replication in data grids

Data replication is a key issue in a data grid and can be managed in different ways and at different levels of granularity: for example, at the file level or the object level. In the high-energy physics community, data grids are being developed to support the distributed analysis of experimental dat...

Descripción completa

Detalles Bibliográficos
Autores principales: Stockinger, H E, Samar, A, Allcock, B, Foster, I, Holtman, K, Tierney, B L
Lenguaje:eng
Publicado: 2001
Materias:
Acceso en línea:http://cds.cern.ch/record/560404
_version_ 1780899071092850688
author Stockinger, H E
Samar, A
Allcock, B
Foster, I
Holtman, K
Tierney, B L
author_facet Stockinger, H E
Samar, A
Allcock, B
Foster, I
Holtman, K
Tierney, B L
author_sort Stockinger, H E
collection CERN
description Data replication is a key issue in a data grid and can be managed in different ways and at different levels of granularity: for example, at the file level or the object level. In the high-energy physics community, data grids are being developed to support the distributed analysis of experimental data. We have produced a prototype data replication tool, the Grid Data Management Pilot (GDMP) that is in production use in one physics experiment, with middleware provided by the Globus toolkit used for authentication, data movement and other purposes. We present a new, enhanced GDMP architecture and prototype implementation that uses Globus data-grid tools for efficient file replication. We also explain how this architecture can address object replication issues in an object-oriented database management system. File transfer over wide-area networks requires specific performance tuning in order to gain optimal data transfer rates. We present performance results obtained with GridFTP, an enhanced version of FTP, and discuss tuning parameters. (37 refs).
id cern-560404
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2001
record_format invenio
spelling cern-5604042019-09-30T06:29:59Zhttp://cds.cern.ch/record/560404engStockinger, H ESamar, AAllcock, BFoster, IHoltman, KTierney, B LFile and object replication in data gridsComputing and ComputersData replication is a key issue in a data grid and can be managed in different ways and at different levels of granularity: for example, at the file level or the object level. In the high-energy physics community, data grids are being developed to support the distributed analysis of experimental data. We have produced a prototype data replication tool, the Grid Data Management Pilot (GDMP) that is in production use in one physics experiment, with middleware provided by the Globus toolkit used for authentication, data movement and other purposes. We present a new, enhanced GDMP architecture and prototype implementation that uses Globus data-grid tools for efficient file replication. We also explain how this architecture can address object replication issues in an object-oriented database management system. File transfer over wide-area networks requires specific performance tuning in order to gain optimal data transfer rates. We present performance results obtained with GridFTP, an enhanced version of FTP, and discuss tuning parameters. (37 refs).oai:cds.cern.ch:5604042001
spellingShingle Computing and Computers
Stockinger, H E
Samar, A
Allcock, B
Foster, I
Holtman, K
Tierney, B L
File and object replication in data grids
title File and object replication in data grids
title_full File and object replication in data grids
title_fullStr File and object replication in data grids
title_full_unstemmed File and object replication in data grids
title_short File and object replication in data grids
title_sort file and object replication in data grids
topic Computing and Computers
url http://cds.cern.ch/record/560404
work_keys_str_mv AT stockingerhe fileandobjectreplicationindatagrids
AT samara fileandobjectreplicationindatagrids
AT allcockb fileandobjectreplicationindatagrids
AT fosteri fileandobjectreplicationindatagrids
AT holtmank fileandobjectreplicationindatagrids
AT tierneybl fileandobjectreplicationindatagrids