Cargando…

Monitoring Performance in Large Scale Computing Clouds with Passive Benchmarking

Providers of computing services such as data science clouds need to maintain large hardware infrastructures often with thousands of nodes. Using commodity hardware leads to heterogeneous setups that differ significantly in individual nodes' performance, which must be understood to allow for acc...

Descripción completa

Detalles Bibliográficos
Autores principales: Nieke, Christian, Balke, Wolf-Tilo
Lenguaje:eng
Publicado: 2017
Materias:
Acceso en línea:https://dx.doi.org/10.1109/CLOUD.2017.32
http://cds.cern.ch/record/2320710
_version_ 1780958564522655744
author Nieke, Christian
Balke, Wolf-Tilo
author_facet Nieke, Christian
Balke, Wolf-Tilo
author_sort Nieke, Christian
collection CERN
description Providers of computing services such as data science clouds need to maintain large hardware infrastructures often with thousands of nodes. Using commodity hardware leads to heterogeneous setups that differ significantly in individual nodes' performance, which must be understood to allow for accounting, strategic planning, and to identify problems and bottle-necks. Today's method of choice are active benchmarks, but they disturb normal operations and are too expensive to run continuously. They also struggle to be representative of an ever changing workload. We therefore design a passive benchmark-ing technique, which computes expressive and accurate perfor-mance metrics based on actual workloads. We prove the quality and performance benefits of our passive benchmark on a prac-tical workload in one of the world's largest scientific computing infrastructures, the CERN Computing Center. In fact, our approach allows continuous benchmarking of the active system, while avoiding costs in terms of downtime and achieves prediction quality comparable to the state-of-the-art approach of active benchmarking
id oai-inspirehep.net-1675010
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2017
record_format invenio
spelling oai-inspirehep.net-16750102019-09-30T06:29:59Zdoi:10.1109/CLOUD.2017.32http://cds.cern.ch/record/2320710engNieke, ChristianBalke, Wolf-TiloMonitoring Performance in Large Scale Computing Clouds with Passive BenchmarkingComputing and ComputersProviders of computing services such as data science clouds need to maintain large hardware infrastructures often with thousands of nodes. Using commodity hardware leads to heterogeneous setups that differ significantly in individual nodes' performance, which must be understood to allow for accounting, strategic planning, and to identify problems and bottle-necks. Today's method of choice are active benchmarks, but they disturb normal operations and are too expensive to run continuously. They also struggle to be representative of an ever changing workload. We therefore design a passive benchmark-ing technique, which computes expressive and accurate perfor-mance metrics based on actual workloads. We prove the quality and performance benefits of our passive benchmark on a prac-tical workload in one of the world's largest scientific computing infrastructures, the CERN Computing Center. In fact, our approach allows continuous benchmarking of the active system, while avoiding costs in terms of downtime and achieves prediction quality comparable to the state-of-the-art approach of active benchmarkingoai:inspirehep.net:16750102017
spellingShingle Computing and Computers
Nieke, Christian
Balke, Wolf-Tilo
Monitoring Performance in Large Scale Computing Clouds with Passive Benchmarking
title Monitoring Performance in Large Scale Computing Clouds with Passive Benchmarking
title_full Monitoring Performance in Large Scale Computing Clouds with Passive Benchmarking
title_fullStr Monitoring Performance in Large Scale Computing Clouds with Passive Benchmarking
title_full_unstemmed Monitoring Performance in Large Scale Computing Clouds with Passive Benchmarking
title_short Monitoring Performance in Large Scale Computing Clouds with Passive Benchmarking
title_sort monitoring performance in large scale computing clouds with passive benchmarking
topic Computing and Computers
url https://dx.doi.org/10.1109/CLOUD.2017.32
http://cds.cern.ch/record/2320710
work_keys_str_mv AT niekechristian monitoringperformanceinlargescalecomputingcloudswithpassivebenchmarking
AT balkewolftilo monitoringperformanceinlargescalecomputingcloudswithpassivebenchmarking