Cargando…

Data analysis with the DIANA meta-scheduling approach

The concepts, design and evaluation of the Data Intensive and Network Aware (DIANA) meta-scheduling approach for solving the challenges of data analysis being faced by CERN experiments are discussed in this paper. Our results suggest that data analysis can be made robust by employing fault tolerant...

Descripción completa

Detalles Bibliográficos
Autores principales: Anjum, A, McClatchey, R, Willers, I
Lenguaje:eng
Publicado: 2008
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/119/7/072004
http://cds.cern.ch/record/1177371
_version_ 1780916271332720640
author Anjum, A
McClatchey, R
Willers, I
author_facet Anjum, A
McClatchey, R
Willers, I
author_sort Anjum, A
collection CERN
description The concepts, design and evaluation of the Data Intensive and Network Aware (DIANA) meta-scheduling approach for solving the challenges of data analysis being faced by CERN experiments are discussed in this paper. Our results suggest that data analysis can be made robust by employing fault tolerant and decentralized meta-scheduling algorithms supported in our DIANA meta-scheduler. The DIANA meta-scheduler supports data intensive bulk scheduling, is network aware and follows a policy centric meta-scheduling. In this paper, we demonstrate that a decentralized and dynamic meta-scheduling approach is an effective strategy to cope with increasing numbers of users, jobs and datasets. We present 'quality of service' related statistics for physics analysis through the application of a policy centric fair-share scheduling model. The DIANA meta-schedulers create a peer-to-peer hierarchy of schedulers to accomplish resource management that changes with evolving loads and is dynamic and adapts to the volatile nature of the resources.
id cern-1177371
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2008
record_format invenio
spelling cern-11773712022-08-17T13:37:19Zdoi:10.1088/1742-6596/119/7/072004http://cds.cern.ch/record/1177371engAnjum, AMcClatchey, RWillers, IData analysis with the DIANA meta-scheduling approachComputing and ComputersThe concepts, design and evaluation of the Data Intensive and Network Aware (DIANA) meta-scheduling approach for solving the challenges of data analysis being faced by CERN experiments are discussed in this paper. Our results suggest that data analysis can be made robust by employing fault tolerant and decentralized meta-scheduling algorithms supported in our DIANA meta-scheduler. The DIANA meta-scheduler supports data intensive bulk scheduling, is network aware and follows a policy centric meta-scheduling. In this paper, we demonstrate that a decentralized and dynamic meta-scheduling approach is an effective strategy to cope with increasing numbers of users, jobs and datasets. We present 'quality of service' related statistics for physics analysis through the application of a policy centric fair-share scheduling model. The DIANA meta-schedulers create a peer-to-peer hierarchy of schedulers to accomplish resource management that changes with evolving loads and is dynamic and adapts to the volatile nature of the resources.oai:cds.cern.ch:11773712008
spellingShingle Computing and Computers
Anjum, A
McClatchey, R
Willers, I
Data analysis with the DIANA meta-scheduling approach
title Data analysis with the DIANA meta-scheduling approach
title_full Data analysis with the DIANA meta-scheduling approach
title_fullStr Data analysis with the DIANA meta-scheduling approach
title_full_unstemmed Data analysis with the DIANA meta-scheduling approach
title_short Data analysis with the DIANA meta-scheduling approach
title_sort data analysis with the diana meta-scheduling approach
topic Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/119/7/072004
http://cds.cern.ch/record/1177371
work_keys_str_mv AT anjuma dataanalysiswiththedianametaschedulingapproach
AT mcclatcheyr dataanalysiswiththedianametaschedulingapproach
AT willersi dataanalysiswiththedianametaschedulingapproach