Cargando…

KS(conf): A Light-Weight Test if a Multiclass Classifier Operates Outside of Its Specifications

We study the problem of automatically detecting if a given multi-class classifier operates outside of its specifications (out-of-specs), i.e. on input data from a different distribution than what it was trained for. This is an important problem to solve on the road towards creating reliable computer...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Rémy, Lampert, Christoph H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7152589/
https://www.ncbi.nlm.nih.gov/pubmed/32313381
http://dx.doi.org/10.1007/s11263-019-01232-x
_version_ 1783521510138314752
author Sun, Rémy
Lampert, Christoph H.
author_facet Sun, Rémy
Lampert, Christoph H.
author_sort Sun, Rémy
collection PubMed
description We study the problem of automatically detecting if a given multi-class classifier operates outside of its specifications (out-of-specs), i.e. on input data from a different distribution than what it was trained for. This is an important problem to solve on the road towards creating reliable computer vision systems for real-world applications, because the quality of a classifier’s predictions cannot be guaranteed if it operates out-of-specs. Previously proposed methods for out-of-specs detection make decisions on the level of single inputs. This, however, is insufficient to achieve low false positive rate and high false negative rates at the same time. In this work, we describe a new procedure named KS(conf), based on statistical reasoning. Its main component is a classical Kolmogorov–Smirnov test that is applied to the set of predicted confidence values for batches of samples. Working with batches instead of single samples allows increasing the true positive rate without negatively affecting the false positive rate, thereby overcoming a crucial limitation of single sample tests. We show by extensive experiments using a variety of convolutional network architectures and datasets that KS(conf) reliably detects out-of-specs situations even under conditions where other tests fail. It furthermore has a number of properties that make it an excellent candidate for practical deployment: it is easy to implement, adds almost no overhead to the system, works with any classifier that outputs confidence scores, and requires no a priori knowledge about how the data distribution could change.
format Online
Article
Text
id pubmed-7152589
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-71525892020-04-18 KS(conf): A Light-Weight Test if a Multiclass Classifier Operates Outside of Its Specifications Sun, Rémy Lampert, Christoph H. Int J Comput Vis Article We study the problem of automatically detecting if a given multi-class classifier operates outside of its specifications (out-of-specs), i.e. on input data from a different distribution than what it was trained for. This is an important problem to solve on the road towards creating reliable computer vision systems for real-world applications, because the quality of a classifier’s predictions cannot be guaranteed if it operates out-of-specs. Previously proposed methods for out-of-specs detection make decisions on the level of single inputs. This, however, is insufficient to achieve low false positive rate and high false negative rates at the same time. In this work, we describe a new procedure named KS(conf), based on statistical reasoning. Its main component is a classical Kolmogorov–Smirnov test that is applied to the set of predicted confidence values for batches of samples. Working with batches instead of single samples allows increasing the true positive rate without negatively affecting the false positive rate, thereby overcoming a crucial limitation of single sample tests. We show by extensive experiments using a variety of convolutional network architectures and datasets that KS(conf) reliably detects out-of-specs situations even under conditions where other tests fail. It furthermore has a number of properties that make it an excellent candidate for practical deployment: it is easy to implement, adds almost no overhead to the system, works with any classifier that outputs confidence scores, and requires no a priori knowledge about how the data distribution could change. Springer US 2019-10-10 2020 /pmc/articles/PMC7152589/ /pubmed/32313381 http://dx.doi.org/10.1007/s11263-019-01232-x Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Sun, Rémy
Lampert, Christoph H.
KS(conf): A Light-Weight Test if a Multiclass Classifier Operates Outside of Its Specifications
title KS(conf): A Light-Weight Test if a Multiclass Classifier Operates Outside of Its Specifications
title_full KS(conf): A Light-Weight Test if a Multiclass Classifier Operates Outside of Its Specifications
title_fullStr KS(conf): A Light-Weight Test if a Multiclass Classifier Operates Outside of Its Specifications
title_full_unstemmed KS(conf): A Light-Weight Test if a Multiclass Classifier Operates Outside of Its Specifications
title_short KS(conf): A Light-Weight Test if a Multiclass Classifier Operates Outside of Its Specifications
title_sort ks(conf): a light-weight test if a multiclass classifier operates outside of its specifications
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7152589/
https://www.ncbi.nlm.nih.gov/pubmed/32313381
http://dx.doi.org/10.1007/s11263-019-01232-x
work_keys_str_mv AT sunremy ksconfalightweighttestifamulticlassclassifieroperatesoutsideofitsspecifications
AT lampertchristophh ksconfalightweighttestifamulticlassclassifieroperatesoutsideofitsspecifications