Cargando…

Towards Monitoring-as-a-service for Scientific Computing Cloud applications using the ElasticSearch ecosystem

The INFN computing centre in Torino hosts a private Cloud, which is managed with the OpenNebula cloud controller. The infrastructure offers Infrastructure-as-a-Service (IaaS) and Platform-as-a-Service (PaaS) services to different scientific computing applications. The main stakeholders of the facili...

Descripción completa

Detalles Bibliográficos
Autores principales: Bagnasco, S, Berzano, D, Guarise, A, Lusso, S, Masera, M, Vallero, S
Lenguaje:eng
Publicado: 2015
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/664/2/022040
http://cds.cern.ch/record/2134541
_version_ 1780949901136363520
author Bagnasco, S
Berzano, D
Guarise, A
Lusso, S
Masera, M
Vallero, S
author_facet Bagnasco, S
Berzano, D
Guarise, A
Lusso, S
Masera, M
Vallero, S
author_sort Bagnasco, S
collection CERN
description The INFN computing centre in Torino hosts a private Cloud, which is managed with the OpenNebula cloud controller. The infrastructure offers Infrastructure-as-a-Service (IaaS) and Platform-as-a-Service (PaaS) services to different scientific computing applications. The main stakeholders of the facility are a grid Tier-2 site for the ALICE collaboration at LHC, an interactive analysis facility for the same experiment and a grid Tier-2 site for the BESIII collaboration, plus an increasing number of other small tenants. The dynamic allocation of resources to tenants is partially automated. This feature requires detailed monitoring and accounting of the resource usage. We set up a monitoring framework to inspect the site activities both in terms of IaaS and applications running on the hosted virtual instances. For this purpose we used the ElasticSearch, Logstash and Kibana (ELK) stack. The infrastructure relies on a MySQL database back-end for data preservation and to ensure flexibility to choose a different monitoring solution if needed. The heterogeneous accounting information is transferred from the database to the ElasticSearch engine via a custom Logstash plugin. Each use-case is indexed separately in ElasticSearch and we setup a set of Kibana dashboards with pre-defined queries in order to monitor the relevant information in each case. For the IaaS metering, we developed sensors for the OpenNebula API. The IaaS level information gathered through the API is sent to the MySQL database through an ad-hoc developed RESTful web service. Moreover, we have developed a billing system for our private Cloud, which relies on the RabbitMQ message queue for asynchronous communication to the database and on the ELK stack for its graphical interface. The Italian Grid accounting framework is also migrating to a similar set-up. Concerning the application level, we used the Root plugin TProofMonSenderSQL to collect accounting data from the interactive analysis facility. The BESIII virtual instances used to be monitored with Zabbix, as a proof of concept we also retrieve the information contained in the Zabbix database. In this way we have achieved a uniform monitoring interface for both the IaaS and the scientific applications, mostly leveraging off-the-shelf tools. At present, we are working to define a model for monitoring-as-a-service, based on the tools described above, which the Cloud tenants can easily configure to suit their specific needs.
id oai-inspirehep.net-1413801
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2015
record_format invenio
spelling oai-inspirehep.net-14138012022-08-10T13:00:50Zdoi:10.1088/1742-6596/664/2/022040http://cds.cern.ch/record/2134541engBagnasco, SBerzano, DGuarise, ALusso, SMasera, MVallero, STowards Monitoring-as-a-service for Scientific Computing Cloud applications using the ElasticSearch ecosystemComputing and ComputersThe INFN computing centre in Torino hosts a private Cloud, which is managed with the OpenNebula cloud controller. The infrastructure offers Infrastructure-as-a-Service (IaaS) and Platform-as-a-Service (PaaS) services to different scientific computing applications. The main stakeholders of the facility are a grid Tier-2 site for the ALICE collaboration at LHC, an interactive analysis facility for the same experiment and a grid Tier-2 site for the BESIII collaboration, plus an increasing number of other small tenants. The dynamic allocation of resources to tenants is partially automated. This feature requires detailed monitoring and accounting of the resource usage. We set up a monitoring framework to inspect the site activities both in terms of IaaS and applications running on the hosted virtual instances. For this purpose we used the ElasticSearch, Logstash and Kibana (ELK) stack. The infrastructure relies on a MySQL database back-end for data preservation and to ensure flexibility to choose a different monitoring solution if needed. The heterogeneous accounting information is transferred from the database to the ElasticSearch engine via a custom Logstash plugin. Each use-case is indexed separately in ElasticSearch and we setup a set of Kibana dashboards with pre-defined queries in order to monitor the relevant information in each case. For the IaaS metering, we developed sensors for the OpenNebula API. The IaaS level information gathered through the API is sent to the MySQL database through an ad-hoc developed RESTful web service. Moreover, we have developed a billing system for our private Cloud, which relies on the RabbitMQ message queue for asynchronous communication to the database and on the ELK stack for its graphical interface. The Italian Grid accounting framework is also migrating to a similar set-up. Concerning the application level, we used the Root plugin TProofMonSenderSQL to collect accounting data from the interactive analysis facility. The BESIII virtual instances used to be monitored with Zabbix, as a proof of concept we also retrieve the information contained in the Zabbix database. In this way we have achieved a uniform monitoring interface for both the IaaS and the scientific applications, mostly leveraging off-the-shelf tools. At present, we are working to define a model for monitoring-as-a-service, based on the tools described above, which the Cloud tenants can easily configure to suit their specific needs.oai:inspirehep.net:14138012015
spellingShingle Computing and Computers
Bagnasco, S
Berzano, D
Guarise, A
Lusso, S
Masera, M
Vallero, S
Towards Monitoring-as-a-service for Scientific Computing Cloud applications using the ElasticSearch ecosystem
title Towards Monitoring-as-a-service for Scientific Computing Cloud applications using the ElasticSearch ecosystem
title_full Towards Monitoring-as-a-service for Scientific Computing Cloud applications using the ElasticSearch ecosystem
title_fullStr Towards Monitoring-as-a-service for Scientific Computing Cloud applications using the ElasticSearch ecosystem
title_full_unstemmed Towards Monitoring-as-a-service for Scientific Computing Cloud applications using the ElasticSearch ecosystem
title_short Towards Monitoring-as-a-service for Scientific Computing Cloud applications using the ElasticSearch ecosystem
title_sort towards monitoring-as-a-service for scientific computing cloud applications using the elasticsearch ecosystem
topic Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/664/2/022040
http://cds.cern.ch/record/2134541
work_keys_str_mv AT bagnascos towardsmonitoringasaserviceforscientificcomputingcloudapplicationsusingtheelasticsearchecosystem
AT berzanod towardsmonitoringasaserviceforscientificcomputingcloudapplicationsusingtheelasticsearchecosystem
AT guarisea towardsmonitoringasaserviceforscientificcomputingcloudapplicationsusingtheelasticsearchecosystem
AT lussos towardsmonitoringasaserviceforscientificcomputingcloudapplicationsusingtheelasticsearchecosystem
AT maseram towardsmonitoringasaserviceforscientificcomputingcloudapplicationsusingtheelasticsearchecosystem
AT valleros towardsmonitoringasaserviceforscientificcomputingcloudapplicationsusingtheelasticsearchecosystem