Cargando…

The replica consistency problem in data grids

Fast and reliable data access is a crucial aspect in distributed computing and is often achieved using data replication techniques. In Grid architectures, data are replicated in many nodes of the Grid, and users usually access the best replica in terms of availability and network latency. When repli...

Descripción completa

Detalles Bibliográficos
Autor principal: Pucciani, Gianni
Lenguaje:eng
Publicado: Pisa U. 2008
Materias:
Acceso en línea:http://cds.cern.ch/record/1110291
_version_ 1780914276966334464
author Pucciani, Gianni
author_facet Pucciani, Gianni
author_sort Pucciani, Gianni
collection CERN
description Fast and reliable data access is a crucial aspect in distributed computing and is often achieved using data replication techniques. In Grid architectures, data are replicated in many nodes of the Grid, and users usually access the best replica in terms of availability and network latency. When replicas are modifiable, a change made to one replica will break the consistency with the other replicas that, at that point, become stale. Replica synchronisation protocols exist and are applied in several distributed architectures, for example in distributed databases. Grid middleware solutions provide well established support for replicating data. Nevertheless, replicas are still considered read-only, and no support is provided to the user for updating a replica while maintaining the consistency with the other replicas. In this thesis, done in collaboration with the Italian National Institute of Nuclear Physics (INFN) and the European Organisation for Nuclear Research (CERN), we study the replica consistency problem in Grid computing and propose a service, called CONStanza, that is able to synchronise both les and heterogeneous (different vendors) databases in a Grid environment. We analyse and implement a specific use case that arises in high energy Physics, where conditions databases are replicated using databases of different makes. We provide detailed performance results, and show how CONStanza can be used together with Oracle Streams to provide multitier replication of conditions databases using Oracle and MySQL databases.
id cern-1110291
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2008
publisher Pisa U.
record_format invenio
spelling cern-11102912019-09-30T06:29:59Zhttp://cds.cern.ch/record/1110291engPucciani, GianniThe replica consistency problem in data gridsComputing and ComputersFast and reliable data access is a crucial aspect in distributed computing and is often achieved using data replication techniques. In Grid architectures, data are replicated in many nodes of the Grid, and users usually access the best replica in terms of availability and network latency. When replicas are modifiable, a change made to one replica will break the consistency with the other replicas that, at that point, become stale. Replica synchronisation protocols exist and are applied in several distributed architectures, for example in distributed databases. Grid middleware solutions provide well established support for replicating data. Nevertheless, replicas are still considered read-only, and no support is provided to the user for updating a replica while maintaining the consistency with the other replicas. In this thesis, done in collaboration with the Italian National Institute of Nuclear Physics (INFN) and the European Organisation for Nuclear Research (CERN), we study the replica consistency problem in Grid computing and propose a service, called CONStanza, that is able to synchronise both les and heterogeneous (different vendors) databases in a Grid environment. We analyse and implement a specific use case that arises in high energy Physics, where conditions databases are replicated using databases of different makes. We provide detailed performance results, and show how CONStanza can be used together with Oracle Streams to provide multitier replication of conditions databases using Oracle and MySQL databases.Pisa U.CERN-THESIS-2008-049oai:cds.cern.ch:11102912008
spellingShingle Computing and Computers
Pucciani, Gianni
The replica consistency problem in data grids
title The replica consistency problem in data grids
title_full The replica consistency problem in data grids
title_fullStr The replica consistency problem in data grids
title_full_unstemmed The replica consistency problem in data grids
title_short The replica consistency problem in data grids
title_sort replica consistency problem in data grids
topic Computing and Computers
url http://cds.cern.ch/record/1110291
work_keys_str_mv AT puccianigianni thereplicaconsistencyproblemindatagrids
AT puccianigianni replicaconsistencyproblemindatagrids