Cargando…

CMSSW Performance Monitoring on processors based on the Intel Core and Nehalem Microarchitectures

CPU clock frequency is not likely to be increased significantly in the coming years, and data analysis speed can be improved by using more processors or buying new machines, only if one is willing to change the programming paradigm to a parallel one. Therefore, performance monitoring procedures and...

Descripción completa

Detalles Bibliográficos
Autor principal: Kruse, Daniele Francesco
Lenguaje:eng
Publicado: 2011
Materias:
Acceso en línea:http://cds.cern.ch/record/1345301
_version_ 1780922198757736448
author Kruse, Daniele Francesco
author_facet Kruse, Daniele Francesco
author_sort Kruse, Daniele Francesco
collection CERN
description CPU clock frequency is not likely to be increased significantly in the coming years, and data analysis speed can be improved by using more processors or buying new machines, only if one is willing to change the programming paradigm to a parallel one. Therefore, performance monitoring procedures and tools are needed to help programmers to optimize existing software running on current and future hardware, without having to redesign it completely. Low level information from hardware performance counters is vital to spot specific performance problems slowing program execution. HEP software is often huge and complex, and existing tools are unable to give results with the required granularity. I will report on the approach I have chosen to solve this problem on CMSSW. It involves decomposing the application into parts and monitoring each one of them separately. Both counting and sampling methods are used to allow an analysis with the required custom granularity: from global level, up to the function level. A set of tools (based on perfmon2 - a software interface to hardware counters) has been developed and deployed. This document is a report concerning the work done at CERN on performance monitoring of CMSSW, using a 4-way analysis based on hardware performance counters. I will present the Intel Core and Nehalem microarchitectures and the methodology used to analyse the software, the Cycle Accounting Analysis. Next I will describe the 4 analysis approaches used (Overall, Symbol, Module Level and Modular Symbol Level analysis) and their complementary usefulness in performance evaluation. A few paragraphs are dedicated to a basic explanation of the inner workings of the tools implemented
id cern-1345301
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2011
record_format invenio
spelling cern-13453012019-09-30T06:29:59Zhttp://cds.cern.ch/record/1345301engKruse, Daniele FrancescoCMSSW Performance Monitoring on processors based on the Intel Core and Nehalem MicroarchitecturesDetectors and Experimental TechniquesCPU clock frequency is not likely to be increased significantly in the coming years, and data analysis speed can be improved by using more processors or buying new machines, only if one is willing to change the programming paradigm to a parallel one. Therefore, performance monitoring procedures and tools are needed to help programmers to optimize existing software running on current and future hardware, without having to redesign it completely. Low level information from hardware performance counters is vital to spot specific performance problems slowing program execution. HEP software is often huge and complex, and existing tools are unable to give results with the required granularity. I will report on the approach I have chosen to solve this problem on CMSSW. It involves decomposing the application into parts and monitoring each one of them separately. Both counting and sampling methods are used to allow an analysis with the required custom granularity: from global level, up to the function level. A set of tools (based on perfmon2 - a software interface to hardware counters) has been developed and deployed. This document is a report concerning the work done at CERN on performance monitoring of CMSSW, using a 4-way analysis based on hardware performance counters. I will present the Intel Core and Nehalem microarchitectures and the methodology used to analyse the software, the Cycle Accounting Analysis. Next I will describe the 4 analysis approaches used (Overall, Symbol, Module Level and Modular Symbol Level analysis) and their complementary usefulness in performance evaluation. A few paragraphs are dedicated to a basic explanation of the inner workings of the tools implementedCMS-NOTE-2011-001oai:cds.cern.ch:13453012011-03-25
spellingShingle Detectors and Experimental Techniques
Kruse, Daniele Francesco
CMSSW Performance Monitoring on processors based on the Intel Core and Nehalem Microarchitectures
title CMSSW Performance Monitoring on processors based on the Intel Core and Nehalem Microarchitectures
title_full CMSSW Performance Monitoring on processors based on the Intel Core and Nehalem Microarchitectures
title_fullStr CMSSW Performance Monitoring on processors based on the Intel Core and Nehalem Microarchitectures
title_full_unstemmed CMSSW Performance Monitoring on processors based on the Intel Core and Nehalem Microarchitectures
title_short CMSSW Performance Monitoring on processors based on the Intel Core and Nehalem Microarchitectures
title_sort cmssw performance monitoring on processors based on the intel core and nehalem microarchitectures
topic Detectors and Experimental Techniques
url http://cds.cern.ch/record/1345301
work_keys_str_mv AT krusedanielefrancesco cmsswperformancemonitoringonprocessorsbasedontheintelcoreandnehalemmicroarchitectures