Cargando…

Distributed Monitoring Infrastructure for Worldwide LHC Computing Grid

The journey of a monitoring probe from its development phase to the moment its execution result is presented in an availability report is a complex process. It goes through multiple phases such as development, testing, integration, release, deployment, execution, data aggregation, computation, and r...

Descripción completa

Detalles Bibliográficos
Autores principales: Andrade, Pedro, Babik, Marian, Bhatt, Kislay, Chand, Phool, Collados, David, Duggal, Vibhuti, Fuente, Paloma, Hayashi, Soichi, Imamagic, Emir, Joshi, Pradyumna, Kalmady, Rajesh, Karnani, Urvashi, Kumar, Vaibhav, Lapka, Wojciech, Quick, Robert, Tarragon, Jacobo, Teige, Scott, Triantafyllidis, Christos
Lenguaje:eng
Publicado: 2012
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/396/3/032002
http://cds.cern.ch/record/1457986
_version_ 1780925149078355968
author Andrade, Pedro
Babik, Marian
Bhatt, Kislay
Chand, Phool
Collados, David
Duggal, Vibhuti
Fuente, Paloma
Hayashi, Soichi
Imamagic, Emir
Joshi, Pradyumna
Kalmady, Rajesh
Karnani, Urvashi
Kumar, Vaibhav
Lapka, Wojciech
Quick, Robert
Tarragon, Jacobo
Teige, Scott
Triantafyllidis, Christos
author_facet Andrade, Pedro
Babik, Marian
Bhatt, Kislay
Chand, Phool
Collados, David
Duggal, Vibhuti
Fuente, Paloma
Hayashi, Soichi
Imamagic, Emir
Joshi, Pradyumna
Kalmady, Rajesh
Karnani, Urvashi
Kumar, Vaibhav
Lapka, Wojciech
Quick, Robert
Tarragon, Jacobo
Teige, Scott
Triantafyllidis, Christos
author_sort Andrade, Pedro
collection CERN
description The journey of a monitoring probe from its development phase to the moment its execution result is presented in an availability report is a complex process. It goes through multiple phases such as development, testing, integration, release, deployment, execution, data aggregation, computation, and reporting. Further, it involves people with different roles (developers, site managers, VO managers, service managers, management), from different middleware providers (ARC, dCache, gLite, UNICORE and VDT), consortiums (WLCG, EMI, EGI, OSG), and operational teams (GOC, OMB, OTAG, CSIRT). The seamless harmonization of these distributed actors is in daily use for monitoring of the WLCG infrastructure. In this paper we describe the monitoring of the WLCG infrastructure from the operational perspective. We explain the complexity of the journey of a monitoring probe from its execution on a grid node to the visualization on the MyWLCG portal where it is exposed to other clients. This monitoring workflow profits from the interoperability established between the SAM and RSV frameworks. We show how these two distributed structures are capable of uniting technologies and hiding the complexity around them, making them easy to be used by the community. Finally, the different supported deployment strategies, tailored not only for monitoring the entire infrastructure but also for monitoring sites and virtual organizations, are presented and the associated operational benefits highlighted.
id cern-1457986
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2012
record_format invenio
spelling cern-14579862022-08-17T13:32:22Zdoi:10.1088/1742-6596/396/3/032002http://cds.cern.ch/record/1457986engAndrade, PedroBabik, MarianBhatt, KislayChand, PhoolCollados, DavidDuggal, VibhutiFuente, PalomaHayashi, SoichiImamagic, EmirJoshi, PradyumnaKalmady, RajeshKarnani, UrvashiKumar, VaibhavLapka, WojciechQuick, RobertTarragon, JacoboTeige, ScottTriantafyllidis, ChristosDistributed Monitoring Infrastructure for Worldwide LHC Computing GridComputing and ComputersThe journey of a monitoring probe from its development phase to the moment its execution result is presented in an availability report is a complex process. It goes through multiple phases such as development, testing, integration, release, deployment, execution, data aggregation, computation, and reporting. Further, it involves people with different roles (developers, site managers, VO managers, service managers, management), from different middleware providers (ARC, dCache, gLite, UNICORE and VDT), consortiums (WLCG, EMI, EGI, OSG), and operational teams (GOC, OMB, OTAG, CSIRT). The seamless harmonization of these distributed actors is in daily use for monitoring of the WLCG infrastructure. In this paper we describe the monitoring of the WLCG infrastructure from the operational perspective. We explain the complexity of the journey of a monitoring probe from its execution on a grid node to the visualization on the MyWLCG portal where it is exposed to other clients. This monitoring workflow profits from the interoperability established between the SAM and RSV frameworks. We show how these two distributed structures are capable of uniting technologies and hiding the complexity around them, making them easy to be used by the community. Finally, the different supported deployment strategies, tailored not only for monitoring the entire infrastructure but also for monitoring sites and virtual organizations, are presented and the associated operational benefits highlighted.CERN-IT-Note-2012-015oai:cds.cern.ch:14579862012-06-26
spellingShingle Computing and Computers
Andrade, Pedro
Babik, Marian
Bhatt, Kislay
Chand, Phool
Collados, David
Duggal, Vibhuti
Fuente, Paloma
Hayashi, Soichi
Imamagic, Emir
Joshi, Pradyumna
Kalmady, Rajesh
Karnani, Urvashi
Kumar, Vaibhav
Lapka, Wojciech
Quick, Robert
Tarragon, Jacobo
Teige, Scott
Triantafyllidis, Christos
Distributed Monitoring Infrastructure for Worldwide LHC Computing Grid
title Distributed Monitoring Infrastructure for Worldwide LHC Computing Grid
title_full Distributed Monitoring Infrastructure for Worldwide LHC Computing Grid
title_fullStr Distributed Monitoring Infrastructure for Worldwide LHC Computing Grid
title_full_unstemmed Distributed Monitoring Infrastructure for Worldwide LHC Computing Grid
title_short Distributed Monitoring Infrastructure for Worldwide LHC Computing Grid
title_sort distributed monitoring infrastructure for worldwide lhc computing grid
topic Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/396/3/032002
http://cds.cern.ch/record/1457986
work_keys_str_mv AT andradepedro distributedmonitoringinfrastructureforworldwidelhccomputinggrid
AT babikmarian distributedmonitoringinfrastructureforworldwidelhccomputinggrid
AT bhattkislay distributedmonitoringinfrastructureforworldwidelhccomputinggrid
AT chandphool distributedmonitoringinfrastructureforworldwidelhccomputinggrid
AT colladosdavid distributedmonitoringinfrastructureforworldwidelhccomputinggrid
AT duggalvibhuti distributedmonitoringinfrastructureforworldwidelhccomputinggrid
AT fuentepaloma distributedmonitoringinfrastructureforworldwidelhccomputinggrid
AT hayashisoichi distributedmonitoringinfrastructureforworldwidelhccomputinggrid
AT imamagicemir distributedmonitoringinfrastructureforworldwidelhccomputinggrid
AT joshipradyumna distributedmonitoringinfrastructureforworldwidelhccomputinggrid
AT kalmadyrajesh distributedmonitoringinfrastructureforworldwidelhccomputinggrid
AT karnaniurvashi distributedmonitoringinfrastructureforworldwidelhccomputinggrid
AT kumarvaibhav distributedmonitoringinfrastructureforworldwidelhccomputinggrid
AT lapkawojciech distributedmonitoringinfrastructureforworldwidelhccomputinggrid
AT quickrobert distributedmonitoringinfrastructureforworldwidelhccomputinggrid
AT tarragonjacobo distributedmonitoringinfrastructureforworldwidelhccomputinggrid
AT teigescott distributedmonitoringinfrastructureforworldwidelhccomputinggrid
AT triantafyllidischristos distributedmonitoringinfrastructureforworldwidelhccomputinggrid