Cargando…

Basket Classifier: Fast and Optimal Restructuring of theClassifier for Differing Train and Target Samples

<!--HTML-->The common approach for constructing a classifier for particle selection assumes reasonable consistency between train data samples and the target data sample used for the particular analysis. However, train and target data may have very different properties, like energy spectra for...

Descripción completa

Detalles Bibliográficos
Autor principal: Philippov, Anton
Lenguaje:eng
Publicado: 2021
Materias:
Acceso en línea:http://cds.cern.ch/record/2767116
_version_ 1780971270175719424
author Philippov, Anton
author_facet Philippov, Anton
author_sort Philippov, Anton
collection CERN
description <!--HTML-->The common approach for constructing a classifier for particle selection assumes reasonable consistency between train data samples and the target data sample used for the particular analysis. However, train and target data may have very different properties, like energy spectra for signal and background contributions. We suggest using ensemble of pre-trained classifiers, each of which is trained on exclusive subset of the total dataset, data baskets. Appropriate separate adjustment of separation thresholds for every basket classifier allows to dynamically adjust combined classifier and make optimal prediction for data with differing properties without re-training of the classifier. The approach is illustrated with a toy example. Quality dependency on the number of used data baskets is also presented
id cern-2767116
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2021
record_format invenio
spelling cern-27671162022-11-02T22:25:40Zhttp://cds.cern.ch/record/2767116engPhilippov, AntonBasket Classifier: Fast and Optimal Restructuring of theClassifier for Differing Train and Target Samples25th International Conference on Computing in High Energy & Nuclear PhysicsConferences<!--HTML-->The common approach for constructing a classifier for particle selection assumes reasonable consistency between train data samples and the target data sample used for the particular analysis. However, train and target data may have very different properties, like energy spectra for signal and background contributions. We suggest using ensemble of pre-trained classifiers, each of which is trained on exclusive subset of the total dataset, data baskets. Appropriate separate adjustment of separation thresholds for every basket classifier allows to dynamically adjust combined classifier and make optimal prediction for data with differing properties without re-training of the classifier. The approach is illustrated with a toy example. Quality dependency on the number of used data baskets is also presentedoai:cds.cern.ch:27671162021
spellingShingle Conferences
Philippov, Anton
Basket Classifier: Fast and Optimal Restructuring of theClassifier for Differing Train and Target Samples
title Basket Classifier: Fast and Optimal Restructuring of theClassifier for Differing Train and Target Samples
title_full Basket Classifier: Fast and Optimal Restructuring of theClassifier for Differing Train and Target Samples
title_fullStr Basket Classifier: Fast and Optimal Restructuring of theClassifier for Differing Train and Target Samples
title_full_unstemmed Basket Classifier: Fast and Optimal Restructuring of theClassifier for Differing Train and Target Samples
title_short Basket Classifier: Fast and Optimal Restructuring of theClassifier for Differing Train and Target Samples
title_sort basket classifier: fast and optimal restructuring of theclassifier for differing train and target samples
topic Conferences
url http://cds.cern.ch/record/2767116
work_keys_str_mv AT philippovanton basketclassifierfastandoptimalrestructuringoftheclassifierfordifferingtrainandtargetsamples
AT philippovanton 25thinternationalconferenceoncomputinginhighenergynuclearphysics