Cargando…
KS(conf): A Light-Weight Test if a Multiclass Classifier Operates Outside of Its Specifications
We study the problem of automatically detecting if a given multi-class classifier operates outside of its specifications (out-of-specs), i.e. on input data from a different distribution than what it was trained for. This is an important problem to solve on the road towards creating reliable computer...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7152589/ https://www.ncbi.nlm.nih.gov/pubmed/32313381 http://dx.doi.org/10.1007/s11263-019-01232-x |
_version_ | 1783521510138314752 |
---|---|
author | Sun, Rémy Lampert, Christoph H. |
author_facet | Sun, Rémy Lampert, Christoph H. |
author_sort | Sun, Rémy |
collection | PubMed |
description | We study the problem of automatically detecting if a given multi-class classifier operates outside of its specifications (out-of-specs), i.e. on input data from a different distribution than what it was trained for. This is an important problem to solve on the road towards creating reliable computer vision systems for real-world applications, because the quality of a classifier’s predictions cannot be guaranteed if it operates out-of-specs. Previously proposed methods for out-of-specs detection make decisions on the level of single inputs. This, however, is insufficient to achieve low false positive rate and high false negative rates at the same time. In this work, we describe a new procedure named KS(conf), based on statistical reasoning. Its main component is a classical Kolmogorov–Smirnov test that is applied to the set of predicted confidence values for batches of samples. Working with batches instead of single samples allows increasing the true positive rate without negatively affecting the false positive rate, thereby overcoming a crucial limitation of single sample tests. We show by extensive experiments using a variety of convolutional network architectures and datasets that KS(conf) reliably detects out-of-specs situations even under conditions where other tests fail. It furthermore has a number of properties that make it an excellent candidate for practical deployment: it is easy to implement, adds almost no overhead to the system, works with any classifier that outputs confidence scores, and requires no a priori knowledge about how the data distribution could change. |
format | Online Article Text |
id | pubmed-7152589 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Springer US |
record_format | MEDLINE/PubMed |
spelling | pubmed-71525892020-04-18 KS(conf): A Light-Weight Test if a Multiclass Classifier Operates Outside of Its Specifications Sun, Rémy Lampert, Christoph H. Int J Comput Vis Article We study the problem of automatically detecting if a given multi-class classifier operates outside of its specifications (out-of-specs), i.e. on input data from a different distribution than what it was trained for. This is an important problem to solve on the road towards creating reliable computer vision systems for real-world applications, because the quality of a classifier’s predictions cannot be guaranteed if it operates out-of-specs. Previously proposed methods for out-of-specs detection make decisions on the level of single inputs. This, however, is insufficient to achieve low false positive rate and high false negative rates at the same time. In this work, we describe a new procedure named KS(conf), based on statistical reasoning. Its main component is a classical Kolmogorov–Smirnov test that is applied to the set of predicted confidence values for batches of samples. Working with batches instead of single samples allows increasing the true positive rate without negatively affecting the false positive rate, thereby overcoming a crucial limitation of single sample tests. We show by extensive experiments using a variety of convolutional network architectures and datasets that KS(conf) reliably detects out-of-specs situations even under conditions where other tests fail. It furthermore has a number of properties that make it an excellent candidate for practical deployment: it is easy to implement, adds almost no overhead to the system, works with any classifier that outputs confidence scores, and requires no a priori knowledge about how the data distribution could change. Springer US 2019-10-10 2020 /pmc/articles/PMC7152589/ /pubmed/32313381 http://dx.doi.org/10.1007/s11263-019-01232-x Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Sun, Rémy Lampert, Christoph H. KS(conf): A Light-Weight Test if a Multiclass Classifier Operates Outside of Its Specifications |
title | KS(conf): A Light-Weight Test if a Multiclass Classifier Operates Outside of Its Specifications |
title_full | KS(conf): A Light-Weight Test if a Multiclass Classifier Operates Outside of Its Specifications |
title_fullStr | KS(conf): A Light-Weight Test if a Multiclass Classifier Operates Outside of Its Specifications |
title_full_unstemmed | KS(conf): A Light-Weight Test if a Multiclass Classifier Operates Outside of Its Specifications |
title_short | KS(conf): A Light-Weight Test if a Multiclass Classifier Operates Outside of Its Specifications |
title_sort | ks(conf): a light-weight test if a multiclass classifier operates outside of its specifications |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7152589/ https://www.ncbi.nlm.nih.gov/pubmed/32313381 http://dx.doi.org/10.1007/s11263-019-01232-x |
work_keys_str_mv | AT sunremy ksconfalightweighttestifamulticlassclassifieroperatesoutsideofitsspecifications AT lampertchristophh ksconfalightweighttestifamulticlassclassifieroperatesoutsideofitsspecifications |