Cargando…

BENCHMARKING OF ROOT BOOSTED DECISION TREES AND XGBOOST

ROOT is a data analysis framework used in many different experiments across CERN, and beyond. The TMVA library, which forms part of ROOT, is in particu- lar a machine learning toolkit for multivariate analysis. In particular, amongst its various machine learning tools, it provides a robust boosted de...

Descripción completa

Detalles Bibliográficos
Autor principal: Mifsud, Xandru
Lenguaje:eng
Publicado: 2021
Materias:
Acceso en línea:http://cds.cern.ch/record/2781067
_version_ 1780971916339707904
author Mifsud, Xandru
author_facet Mifsud, Xandru
author_sort Mifsud, Xandru
collection CERN
description ROOT is a data analysis framework used in many different experiments across CERN, and beyond. The TMVA library, which forms part of ROOT, is in particu- lar a machine learning toolkit for multivariate analysis. In particular, amongst its various machine learning tools, it provides a robust boosted decisions trees (BDT) implementation. The primary task at hand was then to benchmark the CPU and memory performance of TMVA’s BDT implementation, and see how it fairs. Ideally, we would benchmark against some known implementation which is con- sidered to be the ‘gold standard’, which in this case is XGBoost. Different machine learning libraries have different specifications on how data is to be formatted for input. To this extent, besides benchmarking, we also required a means by which we can convert training and testing data–sets prepared by TMVA into a format which is readable by XGBoost. Lastly, we required that any written benchmarks should employ the use of Google Benchmark and be integrated into rootbench, a benchmarking library for ROOT.
id cern-2781067
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2021
record_format invenio
spelling cern-27810672021-09-14T21:14:26Zhttp://cds.cern.ch/record/2781067engMifsud, XandruBENCHMARKING OF ROOT BOOSTED DECISION TREES AND XGBOOSTPhysics in GeneralROOT is a data analysis framework used in many different experiments across CERN, and beyond. The TMVA library, which forms part of ROOT, is in particu- lar a machine learning toolkit for multivariate analysis. In particular, amongst its various machine learning tools, it provides a robust boosted decisions trees (BDT) implementation. The primary task at hand was then to benchmark the CPU and memory performance of TMVA’s BDT implementation, and see how it fairs. Ideally, we would benchmark against some known implementation which is con- sidered to be the ‘gold standard’, which in this case is XGBoost. Different machine learning libraries have different specifications on how data is to be formatted for input. To this extent, besides benchmarking, we also required a means by which we can convert training and testing data–sets prepared by TMVA into a format which is readable by XGBoost. Lastly, we required that any written benchmarks should employ the use of Google Benchmark and be integrated into rootbench, a benchmarking library for ROOT.CERN-STUDENTS-Note-2021-156oai:cds.cern.ch:27810672021-09-14
spellingShingle Physics in General
Mifsud, Xandru
BENCHMARKING OF ROOT BOOSTED DECISION TREES AND XGBOOST
title BENCHMARKING OF ROOT BOOSTED DECISION TREES AND XGBOOST
title_full BENCHMARKING OF ROOT BOOSTED DECISION TREES AND XGBOOST
title_fullStr BENCHMARKING OF ROOT BOOSTED DECISION TREES AND XGBOOST
title_full_unstemmed BENCHMARKING OF ROOT BOOSTED DECISION TREES AND XGBOOST
title_short BENCHMARKING OF ROOT BOOSTED DECISION TREES AND XGBOOST
title_sort benchmarking of root boosted decision trees and xgboost
topic Physics in General
url http://cds.cern.ch/record/2781067
work_keys_str_mv AT mifsudxandru benchmarkingofrootboosteddecisiontreesandxgboost