Cargando…

Software Agents in Data and Workflow Management

CMS currently uses a number of tools to transfer data which, taken together, form the basis of a heterogeneous datagrid. The range of tools used, and the directed, rather than optimized nature of CMS recent large scale data challenge required the creation of a simple infrastructure that allowed a ra...

Descripción completa

Detalles Bibliográficos
Autores principales: Barrass, T A, Maroney, O, Metson, S, Newbold, D, Jank, W, García-Abia, P, Hernández, J M, Afaq, A, Ernst, M, Fisk, I, Wu, Y, Charlot, C, Semeniouk, I N, Bonacorsi, D, Fanfani, A, Grandi, C, De Filippis, N, Rabbertz, K, Rehn, J, Tuura, L, Wildish, T
Lenguaje:eng
Publicado: CERN 2005
Materias:
Acceso en línea:https://dx.doi.org/10.5170/CERN-2005-002.838
http://cds.cern.ch/record/896896
_version_ 1780908623490187264
author Barrass, T A
Maroney, O
Metson, S
Newbold, D
Jank, W
García-Abia, P
Hernández, J M
Afaq, A
Ernst, M
Fisk, I
Wu, Y
Charlot, C
Semeniouk, I N
Bonacorsi, D
Fanfani, A
Grandi, C
De Filippis, N
Rabbertz, K
Rehn, J
Tuura, L
Wildish, T
Newbold, D
author_facet Barrass, T A
Maroney, O
Metson, S
Newbold, D
Jank, W
García-Abia, P
Hernández, J M
Afaq, A
Ernst, M
Fisk, I
Wu, Y
Charlot, C
Semeniouk, I N
Bonacorsi, D
Fanfani, A
Grandi, C
De Filippis, N
Rabbertz, K
Rehn, J
Tuura, L
Wildish, T
Newbold, D
author_sort Barrass, T A
collection CERN
description CMS currently uses a number of tools to transfer data which, taken together, form the basis of a heterogeneous datagrid. The range of tools used, and the directed, rather than optimized nature of CMS recent large scale data challenge required the creation of a simple infrastructure that allowed a range of tools to operate in a complementary way. The system created comprises a hierarchy of simple processes (named agents) that propagate files through a number of transfer states. File locations and some application metadata were stored in POOL file catalogues, with LCG LRC or MySQL back-ends. Agents were assigned limited responsibilities, and were restricted to communicating state 9in a well-defined, indirect fashion through a central transfer management database. In this way, the task of distributing data was easily divided between different groups for implementation. The prototype system w as developed rapidly, and achieved the required sustained transfer rate of ~10 MBps, with O(10^6) files distributed to 6 sites from CERN. Experience with the system during the data challenge raised issues with underlying technology (MSS write/read, stability of the LRC, maintenance of file catalogues, synchronization of filespaces), all of which have been successfully identified and handled. The development of this prototype infrastructure allows us to plan the evolution of backbone CMS data distribution from a simple hierarchy to a more autonomous, scalable model drawing on emerging agent and grid technology.
id cern-896896
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2005
publisher CERN
record_format invenio
spelling cern-8968962019-09-30T06:29:59Zdoi:10.5170/CERN-2005-002.838http://cds.cern.ch/record/896896engBarrass, T AMaroney, OMetson, SNewbold, DJank, WGarcía-Abia, PHernández, J MAfaq, AErnst, MFisk, IWu, YCharlot, CSemeniouk, I NBonacorsi, DFanfani, AGrandi, CDe Filippis, NRabbertz, KRehn, JTuura, LWildish, TNewbold, DSoftware Agents in Data and Workflow ManagementComputing and ComputersCMS currently uses a number of tools to transfer data which, taken together, form the basis of a heterogeneous datagrid. The range of tools used, and the directed, rather than optimized nature of CMS recent large scale data challenge required the creation of a simple infrastructure that allowed a range of tools to operate in a complementary way. The system created comprises a hierarchy of simple processes (named agents) that propagate files through a number of transfer states. File locations and some application metadata were stored in POOL file catalogues, with LCG LRC or MySQL back-ends. Agents were assigned limited responsibilities, and were restricted to communicating state 9in a well-defined, indirect fashion through a central transfer management database. In this way, the task of distributing data was easily divided between different groups for implementation. The prototype system w as developed rapidly, and achieved the required sustained transfer rate of ~10 MBps, with O(10^6) files distributed to 6 sites from CERN. Experience with the system during the data challenge raised issues with underlying technology (MSS write/read, stability of the LRC, maintenance of file catalogues, synchronization of filespaces), all of which have been successfully identified and handled. The development of this prototype infrastructure allows us to plan the evolution of backbone CMS data distribution from a simple hierarchy to a more autonomous, scalable model drawing on emerging agent and grid technology.CERNCMS-CR-2004-053oai:cds.cern.ch:8968962005
spellingShingle Computing and Computers
Barrass, T A
Maroney, O
Metson, S
Newbold, D
Jank, W
García-Abia, P
Hernández, J M
Afaq, A
Ernst, M
Fisk, I
Wu, Y
Charlot, C
Semeniouk, I N
Bonacorsi, D
Fanfani, A
Grandi, C
De Filippis, N
Rabbertz, K
Rehn, J
Tuura, L
Wildish, T
Newbold, D
Software Agents in Data and Workflow Management
title Software Agents in Data and Workflow Management
title_full Software Agents in Data and Workflow Management
title_fullStr Software Agents in Data and Workflow Management
title_full_unstemmed Software Agents in Data and Workflow Management
title_short Software Agents in Data and Workflow Management
title_sort software agents in data and workflow management
topic Computing and Computers
url https://dx.doi.org/10.5170/CERN-2005-002.838
http://cds.cern.ch/record/896896
work_keys_str_mv AT barrassta softwareagentsindataandworkflowmanagement
AT maroneyo softwareagentsindataandworkflowmanagement
AT metsons softwareagentsindataandworkflowmanagement
AT newboldd softwareagentsindataandworkflowmanagement
AT jankw softwareagentsindataandworkflowmanagement
AT garciaabiap softwareagentsindataandworkflowmanagement
AT hernandezjm softwareagentsindataandworkflowmanagement
AT afaqa softwareagentsindataandworkflowmanagement
AT ernstm softwareagentsindataandworkflowmanagement
AT fiski softwareagentsindataandworkflowmanagement
AT wuy softwareagentsindataandworkflowmanagement
AT charlotc softwareagentsindataandworkflowmanagement
AT semenioukin softwareagentsindataandworkflowmanagement
AT bonacorsid softwareagentsindataandworkflowmanagement
AT fanfania softwareagentsindataandworkflowmanagement
AT grandic softwareagentsindataandworkflowmanagement
AT defilippisn softwareagentsindataandworkflowmanagement
AT rabbertzk softwareagentsindataandworkflowmanagement
AT rehnj softwareagentsindataandworkflowmanagement
AT tuural softwareagentsindataandworkflowmanagement
AT wildisht softwareagentsindataandworkflowmanagement
AT newboldd softwareagentsindataandworkflowmanagement