Cargando…
BENCHMARKING OF ROOT BOOSTED DECISION TREES AND XGBOOST
ROOT is a data analysis framework used in many different experiments across CERN, and beyond. The TMVA library, which forms part of ROOT, is in particu- lar a machine learning toolkit for multivariate analysis. In particular, amongst its various machine learning tools, it provides a robust boosted de...
Autor principal: | |
---|---|
Lenguaje: | eng |
Publicado: |
2021
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2781067 |
_version_ | 1780971916339707904 |
---|---|
author | Mifsud, Xandru |
author_facet | Mifsud, Xandru |
author_sort | Mifsud, Xandru |
collection | CERN |
description | ROOT is a data analysis framework used in many different experiments across CERN, and beyond. The TMVA library, which forms part of ROOT, is in particu- lar a machine learning toolkit for multivariate analysis. In particular, amongst its various machine learning tools, it provides a robust boosted decisions trees (BDT) implementation. The primary task at hand was then to benchmark the CPU and memory performance of TMVA’s BDT implementation, and see how it fairs. Ideally, we would benchmark against some known implementation which is con- sidered to be the ‘gold standard’, which in this case is XGBoost. Different machine learning libraries have different specifications on how data is to be formatted for input. To this extent, besides benchmarking, we also required a means by which we can convert training and testing data–sets prepared by TMVA into a format which is readable by XGBoost. Lastly, we required that any written benchmarks should employ the use of Google Benchmark and be integrated into rootbench, a benchmarking library for ROOT. |
id | cern-2781067 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2021 |
record_format | invenio |
spelling | cern-27810672021-09-14T21:14:26Zhttp://cds.cern.ch/record/2781067engMifsud, XandruBENCHMARKING OF ROOT BOOSTED DECISION TREES AND XGBOOSTPhysics in GeneralROOT is a data analysis framework used in many different experiments across CERN, and beyond. The TMVA library, which forms part of ROOT, is in particu- lar a machine learning toolkit for multivariate analysis. In particular, amongst its various machine learning tools, it provides a robust boosted decisions trees (BDT) implementation. The primary task at hand was then to benchmark the CPU and memory performance of TMVA’s BDT implementation, and see how it fairs. Ideally, we would benchmark against some known implementation which is con- sidered to be the ‘gold standard’, which in this case is XGBoost. Different machine learning libraries have different specifications on how data is to be formatted for input. To this extent, besides benchmarking, we also required a means by which we can convert training and testing data–sets prepared by TMVA into a format which is readable by XGBoost. Lastly, we required that any written benchmarks should employ the use of Google Benchmark and be integrated into rootbench, a benchmarking library for ROOT.CERN-STUDENTS-Note-2021-156oai:cds.cern.ch:27810672021-09-14 |
spellingShingle | Physics in General Mifsud, Xandru BENCHMARKING OF ROOT BOOSTED DECISION TREES AND XGBOOST |
title | BENCHMARKING OF ROOT BOOSTED DECISION TREES AND XGBOOST |
title_full | BENCHMARKING OF ROOT BOOSTED DECISION TREES AND XGBOOST |
title_fullStr | BENCHMARKING OF ROOT BOOSTED DECISION TREES AND XGBOOST |
title_full_unstemmed | BENCHMARKING OF ROOT BOOSTED DECISION TREES AND XGBOOST |
title_short | BENCHMARKING OF ROOT BOOSTED DECISION TREES AND XGBOOST |
title_sort | benchmarking of root boosted decision trees and xgboost |
topic | Physics in General |
url | http://cds.cern.ch/record/2781067 |
work_keys_str_mv | AT mifsudxandru benchmarkingofrootboosteddecisiontreesandxgboost |