Cargando…
Distributed Monitoring Infrastructure for Worldwide LHC Computing Grid
The journey of a monitoring probe from its development phase to the moment its execution result is presented in an availability report is a complex process. It goes through multiple phases such as development, testing, integration, release, deployment, execution, data aggregation, computation, and r...
Autores principales: | , , , , , , , , , , , , , , , , , |
---|---|
Lenguaje: | eng |
Publicado: |
2012
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/396/3/032002 http://cds.cern.ch/record/1457986 |
_version_ | 1780925149078355968 |
---|---|
author | Andrade, Pedro Babik, Marian Bhatt, Kislay Chand, Phool Collados, David Duggal, Vibhuti Fuente, Paloma Hayashi, Soichi Imamagic, Emir Joshi, Pradyumna Kalmady, Rajesh Karnani, Urvashi Kumar, Vaibhav Lapka, Wojciech Quick, Robert Tarragon, Jacobo Teige, Scott Triantafyllidis, Christos |
author_facet | Andrade, Pedro Babik, Marian Bhatt, Kislay Chand, Phool Collados, David Duggal, Vibhuti Fuente, Paloma Hayashi, Soichi Imamagic, Emir Joshi, Pradyumna Kalmady, Rajesh Karnani, Urvashi Kumar, Vaibhav Lapka, Wojciech Quick, Robert Tarragon, Jacobo Teige, Scott Triantafyllidis, Christos |
author_sort | Andrade, Pedro |
collection | CERN |
description | The journey of a monitoring probe from its development phase to the moment its execution result is presented in an availability report is a complex process. It goes through multiple phases such as development, testing, integration, release, deployment, execution, data aggregation, computation, and reporting. Further, it involves people with different roles (developers, site managers, VO managers, service managers, management), from different middleware providers (ARC, dCache, gLite, UNICORE and VDT), consortiums (WLCG, EMI, EGI, OSG), and operational teams (GOC, OMB, OTAG, CSIRT). The seamless harmonization of these distributed actors is in daily use for monitoring of the WLCG infrastructure. In this paper we describe the monitoring of the WLCG infrastructure from the operational perspective. We explain the complexity of the journey of a monitoring probe from its execution on a grid node to the visualization on the MyWLCG portal where it is exposed to other clients. This monitoring workflow profits from the interoperability established between the SAM and RSV frameworks. We show how these two distributed structures are capable of uniting technologies and hiding the complexity around them, making them easy to be used by the community. Finally, the different supported deployment strategies, tailored not only for monitoring the entire infrastructure but also for monitoring sites and virtual organizations, are presented and the associated operational benefits highlighted. |
id | cern-1457986 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2012 |
record_format | invenio |
spelling | cern-14579862022-08-17T13:32:22Zdoi:10.1088/1742-6596/396/3/032002http://cds.cern.ch/record/1457986engAndrade, PedroBabik, MarianBhatt, KislayChand, PhoolCollados, DavidDuggal, VibhutiFuente, PalomaHayashi, SoichiImamagic, EmirJoshi, PradyumnaKalmady, RajeshKarnani, UrvashiKumar, VaibhavLapka, WojciechQuick, RobertTarragon, JacoboTeige, ScottTriantafyllidis, ChristosDistributed Monitoring Infrastructure for Worldwide LHC Computing GridComputing and ComputersThe journey of a monitoring probe from its development phase to the moment its execution result is presented in an availability report is a complex process. It goes through multiple phases such as development, testing, integration, release, deployment, execution, data aggregation, computation, and reporting. Further, it involves people with different roles (developers, site managers, VO managers, service managers, management), from different middleware providers (ARC, dCache, gLite, UNICORE and VDT), consortiums (WLCG, EMI, EGI, OSG), and operational teams (GOC, OMB, OTAG, CSIRT). The seamless harmonization of these distributed actors is in daily use for monitoring of the WLCG infrastructure. In this paper we describe the monitoring of the WLCG infrastructure from the operational perspective. We explain the complexity of the journey of a monitoring probe from its execution on a grid node to the visualization on the MyWLCG portal where it is exposed to other clients. This monitoring workflow profits from the interoperability established between the SAM and RSV frameworks. We show how these two distributed structures are capable of uniting technologies and hiding the complexity around them, making them easy to be used by the community. Finally, the different supported deployment strategies, tailored not only for monitoring the entire infrastructure but also for monitoring sites and virtual organizations, are presented and the associated operational benefits highlighted.CERN-IT-Note-2012-015oai:cds.cern.ch:14579862012-06-26 |
spellingShingle | Computing and Computers Andrade, Pedro Babik, Marian Bhatt, Kislay Chand, Phool Collados, David Duggal, Vibhuti Fuente, Paloma Hayashi, Soichi Imamagic, Emir Joshi, Pradyumna Kalmady, Rajesh Karnani, Urvashi Kumar, Vaibhav Lapka, Wojciech Quick, Robert Tarragon, Jacobo Teige, Scott Triantafyllidis, Christos Distributed Monitoring Infrastructure for Worldwide LHC Computing Grid |
title | Distributed Monitoring Infrastructure for Worldwide LHC Computing Grid |
title_full | Distributed Monitoring Infrastructure for Worldwide LHC Computing Grid |
title_fullStr | Distributed Monitoring Infrastructure for Worldwide LHC Computing Grid |
title_full_unstemmed | Distributed Monitoring Infrastructure for Worldwide LHC Computing Grid |
title_short | Distributed Monitoring Infrastructure for Worldwide LHC Computing Grid |
title_sort | distributed monitoring infrastructure for worldwide lhc computing grid |
topic | Computing and Computers |
url | https://dx.doi.org/10.1088/1742-6596/396/3/032002 http://cds.cern.ch/record/1457986 |
work_keys_str_mv | AT andradepedro distributedmonitoringinfrastructureforworldwidelhccomputinggrid AT babikmarian distributedmonitoringinfrastructureforworldwidelhccomputinggrid AT bhattkislay distributedmonitoringinfrastructureforworldwidelhccomputinggrid AT chandphool distributedmonitoringinfrastructureforworldwidelhccomputinggrid AT colladosdavid distributedmonitoringinfrastructureforworldwidelhccomputinggrid AT duggalvibhuti distributedmonitoringinfrastructureforworldwidelhccomputinggrid AT fuentepaloma distributedmonitoringinfrastructureforworldwidelhccomputinggrid AT hayashisoichi distributedmonitoringinfrastructureforworldwidelhccomputinggrid AT imamagicemir distributedmonitoringinfrastructureforworldwidelhccomputinggrid AT joshipradyumna distributedmonitoringinfrastructureforworldwidelhccomputinggrid AT kalmadyrajesh distributedmonitoringinfrastructureforworldwidelhccomputinggrid AT karnaniurvashi distributedmonitoringinfrastructureforworldwidelhccomputinggrid AT kumarvaibhav distributedmonitoringinfrastructureforworldwidelhccomputinggrid AT lapkawojciech distributedmonitoringinfrastructureforworldwidelhccomputinggrid AT quickrobert distributedmonitoringinfrastructureforworldwidelhccomputinggrid AT tarragonjacobo distributedmonitoringinfrastructureforworldwidelhccomputinggrid AT teigescott distributedmonitoringinfrastructureforworldwidelhccomputinggrid AT triantafyllidischristos distributedmonitoringinfrastructureforworldwidelhccomputinggrid |