Cargando…
Monitoring Performance in Large Scale Computing Clouds with Passive Benchmarking
Providers of computing services such as data science clouds need to maintain large hardware infrastructures often with thousands of nodes. Using commodity hardware leads to heterogeneous setups that differ significantly in individual nodes' performance, which must be understood to allow for acc...
Autores principales: | , |
---|---|
Lenguaje: | eng |
Publicado: |
2017
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1109/CLOUD.2017.32 http://cds.cern.ch/record/2320710 |
_version_ | 1780958564522655744 |
---|---|
author | Nieke, Christian Balke, Wolf-Tilo |
author_facet | Nieke, Christian Balke, Wolf-Tilo |
author_sort | Nieke, Christian |
collection | CERN |
description | Providers of computing services such as data science clouds need to maintain large hardware infrastructures often with thousands of nodes. Using commodity hardware leads to heterogeneous setups that differ significantly in individual nodes' performance, which must be understood to allow for accounting, strategic planning, and to identify problems and bottle-necks. Today's method of choice are active benchmarks, but they disturb normal operations and are too expensive to run continuously. They also struggle to be representative of an ever changing workload. We therefore design a passive benchmark-ing technique, which computes expressive and accurate perfor-mance metrics based on actual workloads. We prove the quality and performance benefits of our passive benchmark on a prac-tical workload in one of the world's largest scientific computing infrastructures, the CERN Computing Center. In fact, our approach allows continuous benchmarking of the active system, while avoiding costs in terms of downtime and achieves prediction quality comparable to the state-of-the-art approach of active benchmarking |
id | oai-inspirehep.net-1675010 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2017 |
record_format | invenio |
spelling | oai-inspirehep.net-16750102019-09-30T06:29:59Zdoi:10.1109/CLOUD.2017.32http://cds.cern.ch/record/2320710engNieke, ChristianBalke, Wolf-TiloMonitoring Performance in Large Scale Computing Clouds with Passive BenchmarkingComputing and ComputersProviders of computing services such as data science clouds need to maintain large hardware infrastructures often with thousands of nodes. Using commodity hardware leads to heterogeneous setups that differ significantly in individual nodes' performance, which must be understood to allow for accounting, strategic planning, and to identify problems and bottle-necks. Today's method of choice are active benchmarks, but they disturb normal operations and are too expensive to run continuously. They also struggle to be representative of an ever changing workload. We therefore design a passive benchmark-ing technique, which computes expressive and accurate perfor-mance metrics based on actual workloads. We prove the quality and performance benefits of our passive benchmark on a prac-tical workload in one of the world's largest scientific computing infrastructures, the CERN Computing Center. In fact, our approach allows continuous benchmarking of the active system, while avoiding costs in terms of downtime and achieves prediction quality comparable to the state-of-the-art approach of active benchmarkingoai:inspirehep.net:16750102017 |
spellingShingle | Computing and Computers Nieke, Christian Balke, Wolf-Tilo Monitoring Performance in Large Scale Computing Clouds with Passive Benchmarking |
title | Monitoring Performance in Large Scale Computing Clouds with Passive Benchmarking |
title_full | Monitoring Performance in Large Scale Computing Clouds with Passive Benchmarking |
title_fullStr | Monitoring Performance in Large Scale Computing Clouds with Passive Benchmarking |
title_full_unstemmed | Monitoring Performance in Large Scale Computing Clouds with Passive Benchmarking |
title_short | Monitoring Performance in Large Scale Computing Clouds with Passive Benchmarking |
title_sort | monitoring performance in large scale computing clouds with passive benchmarking |
topic | Computing and Computers |
url | https://dx.doi.org/10.1109/CLOUD.2017.32 http://cds.cern.ch/record/2320710 |
work_keys_str_mv | AT niekechristian monitoringperformanceinlargescalecomputingcloudswithpassivebenchmarking AT balkewolftilo monitoringperformanceinlargescalecomputingcloudswithpassivebenchmarking |