Cargando…

The SysMES Framework: System Management for Networked Embedded Systems and Clusters

Automated system management for large distributed and heterogeneous environments is a common challenge in modern computer sciences. Desired properties of such a management system are, among others, a minimal dependency on human operators for problem recognition and solution, adaptability to increasi...

Descripción completa

Detalles Bibliográficos
Autor principal: Lara Martinez, Camilo Ernesto
Lenguaje:eng
Publicado: 2012
Materias:
Acceso en línea:http://cds.cern.ch/record/1454269
_version_ 1780925011904692224
author Lara Martinez, Camilo Ernesto
author_facet Lara Martinez, Camilo Ernesto
author_sort Lara Martinez, Camilo Ernesto
collection CERN
description Automated system management for large distributed and heterogeneous environments is a common challenge in modern computer sciences. Desired properties of such a management system are, among others, a minimal dependency on human operators for problem recognition and solution, adaptability to increasing loads, fault tolerance and the flexibility to integrate new management resources at runtime. Existing tools address parts of these requirements however there is no single integrated framework which possesses all mentioned characteristics. SysMES was developed as an integrated framework for automated monitoring and management of networked devices. In order to achieve the requirements of scalability and fault tolerance, a fully distributed and decentralized architecture has been chosen. The framework comprises a monitoring module, a rule engine and an executive module for the execution of actions. A formal language has been defined which allows administrators to define complex spatial and temporal rule conditions for failure states and according reactions. These rules are used in order to reduce the number and duration of manual interventions in the managed environment by automated problem solution. SysMES is based on standards ensuring interoperability and manufacturer independence. The object-oriented modeling of management resources allows several abstraction levels for handling the complexity of managing large and heterogeneous environments. Management resources can be extended and (re)configured without downtime for increased flexibility. Multiple tests and a reference installation demonstrate the suitability of SysMES for automated management of large heterogeneous environments.
id cern-1454269
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2012
record_format invenio
spelling cern-14542692019-09-30T06:29:59Zhttp://cds.cern.ch/record/1454269engLara Martinez, Camilo ErnestoThe SysMES Framework: System Management for Networked Embedded Systems and ClustersComputing and ComputersAutomated system management for large distributed and heterogeneous environments is a common challenge in modern computer sciences. Desired properties of such a management system are, among others, a minimal dependency on human operators for problem recognition and solution, adaptability to increasing loads, fault tolerance and the flexibility to integrate new management resources at runtime. Existing tools address parts of these requirements however there is no single integrated framework which possesses all mentioned characteristics. SysMES was developed as an integrated framework for automated monitoring and management of networked devices. In order to achieve the requirements of scalability and fault tolerance, a fully distributed and decentralized architecture has been chosen. The framework comprises a monitoring module, a rule engine and an executive module for the execution of actions. A formal language has been defined which allows administrators to define complex spatial and temporal rule conditions for failure states and according reactions. These rules are used in order to reduce the number and duration of manual interventions in the managed environment by automated problem solution. SysMES is based on standards ensuring interoperability and manufacturer independence. The object-oriented modeling of management resources allows several abstraction levels for handling the complexity of managing large and heterogeneous environments. Management resources can be extended and (re)configured without downtime for increased flexibility. Multiple tests and a reference installation demonstrate the suitability of SysMES for automated management of large heterogeneous environments.CERN-THESIS-2011-242oai:cds.cern.ch:14542692012-06-05T19:59:58Z
spellingShingle Computing and Computers
Lara Martinez, Camilo Ernesto
The SysMES Framework: System Management for Networked Embedded Systems and Clusters
title The SysMES Framework: System Management for Networked Embedded Systems and Clusters
title_full The SysMES Framework: System Management for Networked Embedded Systems and Clusters
title_fullStr The SysMES Framework: System Management for Networked Embedded Systems and Clusters
title_full_unstemmed The SysMES Framework: System Management for Networked Embedded Systems and Clusters
title_short The SysMES Framework: System Management for Networked Embedded Systems and Clusters
title_sort sysmes framework: system management for networked embedded systems and clusters
topic Computing and Computers
url http://cds.cern.ch/record/1454269
work_keys_str_mv AT laramartinezcamiloernesto thesysmesframeworksystemmanagementfornetworkedembeddedsystemsandclusters
AT laramartinezcamiloernesto sysmesframeworksystemmanagementfornetworkedembeddedsystemsandclusters