Cargando…
Basket Classifier: Fast and Optimal Restructuring of theClassifier for Differing Train and Target Samples
<!--HTML-->The common approach for constructing a classifier for particle selection assumes reasonable consistency between train data samples and the target data sample used for the particular analysis. However, train and target data may have very different properties, like energy spectra for...
Autor principal: | |
---|---|
Lenguaje: | eng |
Publicado: |
2021
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2767116 |
_version_ | 1780971270175719424 |
---|---|
author | Philippov, Anton |
author_facet | Philippov, Anton |
author_sort | Philippov, Anton |
collection | CERN |
description | <!--HTML-->The common approach for constructing a classifier for particle selection assumes reasonable consistency between train data samples and the target data sample used for the particular analysis. However, train and target data may have very different properties, like energy spectra for signal and background contributions. We suggest using ensemble of pre-trained classifiers, each of which is trained on exclusive subset of the total dataset, data baskets. Appropriate separate adjustment of separation thresholds for every basket classifier allows to dynamically adjust combined classifier and make optimal prediction for data with differing properties without re-training of the classifier. The approach is illustrated with a toy example. Quality dependency on the number of used data baskets is also presented |
id | cern-2767116 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2021 |
record_format | invenio |
spelling | cern-27671162022-11-02T22:25:40Zhttp://cds.cern.ch/record/2767116engPhilippov, AntonBasket Classifier: Fast and Optimal Restructuring of theClassifier for Differing Train and Target Samples25th International Conference on Computing in High Energy & Nuclear PhysicsConferences<!--HTML-->The common approach for constructing a classifier for particle selection assumes reasonable consistency between train data samples and the target data sample used for the particular analysis. However, train and target data may have very different properties, like energy spectra for signal and background contributions. We suggest using ensemble of pre-trained classifiers, each of which is trained on exclusive subset of the total dataset, data baskets. Appropriate separate adjustment of separation thresholds for every basket classifier allows to dynamically adjust combined classifier and make optimal prediction for data with differing properties without re-training of the classifier. The approach is illustrated with a toy example. Quality dependency on the number of used data baskets is also presentedoai:cds.cern.ch:27671162021 |
spellingShingle | Conferences Philippov, Anton Basket Classifier: Fast and Optimal Restructuring of theClassifier for Differing Train and Target Samples |
title | Basket Classifier: Fast and Optimal Restructuring of theClassifier for Differing Train and Target Samples |
title_full | Basket Classifier: Fast and Optimal Restructuring of theClassifier for Differing Train and Target Samples |
title_fullStr | Basket Classifier: Fast and Optimal Restructuring of theClassifier for Differing Train and Target Samples |
title_full_unstemmed | Basket Classifier: Fast and Optimal Restructuring of theClassifier for Differing Train and Target Samples |
title_short | Basket Classifier: Fast and Optimal Restructuring of theClassifier for Differing Train and Target Samples |
title_sort | basket classifier: fast and optimal restructuring of theclassifier for differing train and target samples |
topic | Conferences |
url | http://cds.cern.ch/record/2767116 |
work_keys_str_mv | AT philippovanton basketclassifierfastandoptimalrestructuringoftheclassifierfordifferingtrainandtargetsamples AT philippovanton 25thinternationalconferenceoncomputinginhighenergynuclearphysics |