Cargando…

Faster RooFitting: Automated parallel calculation of collaborative statistical models

RooFit [1, 2] is the main statistical modeling and fitting package used to extract physical parameters from reduced particle collision data, e.g. the Higgs boson experiments at the LHC [3, 4]. RooFit aims to separate particle physics model building and fitting (the users’ goals) from their technical...

Descripción completa

Detalles Bibliográficos
Autores principales: Bos, E G Patrick, Burgard, Carsten D, Croft, Vincent A, Hageboeck, Stephan, Moneta, Lorenzo, Pelupessy, Inti, Attema, Jisk J, Verkerke, Wouter
Lenguaje:eng
Publicado: 2020
Materias:
Acceso en línea:https://dx.doi.org/10.1051/epjconf/202024506027
http://cds.cern.ch/record/2752190
_version_ 1780969335351672832
author Bos, E G Patrick
Burgard, Carsten D
Croft, Vincent A
Hageboeck, Stephan
Moneta, Lorenzo
Pelupessy, Inti
Attema, Jisk J
Verkerke, Wouter
author_facet Bos, E G Patrick
Burgard, Carsten D
Croft, Vincent A
Hageboeck, Stephan
Moneta, Lorenzo
Pelupessy, Inti
Attema, Jisk J
Verkerke, Wouter
author_sort Bos, E G Patrick
collection CERN
description RooFit [1, 2] is the main statistical modeling and fitting package used to extract physical parameters from reduced particle collision data, e.g. the Higgs boson experiments at the LHC [3, 4]. RooFit aims to separate particle physics model building and fitting (the users’ goals) from their technical implementation and optimization in the back-end. In this paper, we outline our efforts to further optimize this back-end by automatically running parts of user models in parallel on multi-core machines. A major challenge is that RooFit allows users to define many different types of models, with different types of computational bottlenecks. Our automatic parallelization framework must then be flexible, while still reducing run time by at least an order of magnitude, preferably more. We have performed extensive benchmarks and identified at least three bottlenecks that will benefit from parallelization. We designed a parallelization framework that allows us to parallelize likelihood minimization with high performance by splitting over partial derivatives in the minimizer. The basis of the framework is a task queue approach. Preliminary results show speed-ups of factor 2 to 20, depending on the exact model and parallelization strategy.
id oai-inspirehep.net-1832224
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2020
record_format invenio
spelling oai-inspirehep.net-18322242021-02-18T19:23:10Zdoi:10.1051/epjconf/202024506027http://cds.cern.ch/record/2752190engBos, E G PatrickBurgard, Carsten DCroft, Vincent AHageboeck, StephanMoneta, LorenzoPelupessy, IntiAttema, Jisk JVerkerke, WouterFaster RooFitting: Automated parallel calculation of collaborative statistical modelsComputing and ComputersRooFit [1, 2] is the main statistical modeling and fitting package used to extract physical parameters from reduced particle collision data, e.g. the Higgs boson experiments at the LHC [3, 4]. RooFit aims to separate particle physics model building and fitting (the users’ goals) from their technical implementation and optimization in the back-end. In this paper, we outline our efforts to further optimize this back-end by automatically running parts of user models in parallel on multi-core machines. A major challenge is that RooFit allows users to define many different types of models, with different types of computational bottlenecks. Our automatic parallelization framework must then be flexible, while still reducing run time by at least an order of magnitude, preferably more. We have performed extensive benchmarks and identified at least three bottlenecks that will benefit from parallelization. We designed a parallelization framework that allows us to parallelize likelihood minimization with high performance by splitting over partial derivatives in the minimizer. The basis of the framework is a task queue approach. Preliminary results show speed-ups of factor 2 to 20, depending on the exact model and parallelization strategy.oai:inspirehep.net:18322242020
spellingShingle Computing and Computers
Bos, E G Patrick
Burgard, Carsten D
Croft, Vincent A
Hageboeck, Stephan
Moneta, Lorenzo
Pelupessy, Inti
Attema, Jisk J
Verkerke, Wouter
Faster RooFitting: Automated parallel calculation of collaborative statistical models
title Faster RooFitting: Automated parallel calculation of collaborative statistical models
title_full Faster RooFitting: Automated parallel calculation of collaborative statistical models
title_fullStr Faster RooFitting: Automated parallel calculation of collaborative statistical models
title_full_unstemmed Faster RooFitting: Automated parallel calculation of collaborative statistical models
title_short Faster RooFitting: Automated parallel calculation of collaborative statistical models
title_sort faster roofitting: automated parallel calculation of collaborative statistical models
topic Computing and Computers
url https://dx.doi.org/10.1051/epjconf/202024506027
http://cds.cern.ch/record/2752190
work_keys_str_mv AT bosegpatrick fasterroofittingautomatedparallelcalculationofcollaborativestatisticalmodels
AT burgardcarstend fasterroofittingautomatedparallelcalculationofcollaborativestatisticalmodels
AT croftvincenta fasterroofittingautomatedparallelcalculationofcollaborativestatisticalmodels
AT hageboeckstephan fasterroofittingautomatedparallelcalculationofcollaborativestatisticalmodels
AT monetalorenzo fasterroofittingautomatedparallelcalculationofcollaborativestatisticalmodels
AT pelupessyinti fasterroofittingautomatedparallelcalculationofcollaborativestatisticalmodels
AT attemajiskj fasterroofittingautomatedparallelcalculationofcollaborativestatisticalmodels
AT verkerkewouter fasterroofittingautomatedparallelcalculationofcollaborativestatisticalmodels