Cargando…

Efficiency of different measures for defining the applicability domain of classification models

The goal of defining an applicability domain for a predictive classification model is to identify the region in chemical space where the model’s predictions are reliable. The boundary of the applicability domain is defined with the help of a measure that shall reflect the reliability of an individua...

Descripción completa

Detalles Bibliográficos
Autores principales:	Klingspohn, Waldemar, Mathea, Miriam, ter Laak, Antonius, Heinrich, Nikolaus, Baumann, Knut
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer International Publishing 2017
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5543028/ https://www.ncbi.nlm.nih.gov/pubmed/29086213 http://dx.doi.org/10.1186/s13321-017-0230-2

_version_	1783255072474398720
author	Klingspohn, Waldemar Mathea, Miriam ter Laak, Antonius Heinrich, Nikolaus Baumann, Knut
author_facet	Klingspohn, Waldemar Mathea, Miriam ter Laak, Antonius Heinrich, Nikolaus Baumann, Knut
author_sort	Klingspohn, Waldemar
collection	PubMed
description	The goal of defining an applicability domain for a predictive classification model is to identify the region in chemical space where the model’s predictions are reliable. The boundary of the applicability domain is defined with the help of a measure that shall reflect the reliability of an individual prediction. Here, the available measures are differentiated into those that flag unusual objects and which are independent of the original classifier and those that use information of the trained classifier. The former set of techniques is referred to as novelty detection while the latter is designated as confidence estimation. A review of the available confidence estimators shows that most of these measures estimate the probability of class membership of the predicted objects which is inversely related to the error probability. Thus, class probability estimates are natural candidates for defining the applicability domain but were not comprehensively included in previous benchmark studies. The focus of the present study is to find the best measure for defining the applicability domain for a given binary classification technique and to determine the performance of novelty detection versus confidence estimation. Six different binary classification techniques in combination with ten data sets were studied to benchmark the various measures. The area under the receiver operating characteristic curve (AUC ROC) was employed as main benchmark criterion. It is shown that class probability estimates constantly perform best to differentiate between reliable and unreliable predictions. Previously proposed alternatives to class probability estimates do not perform better than the latter and are inferior in most cases. Interestingly, the impact of defining an applicability domain depends on the observed area under the receiver operator characteristic curve. That means that it depends on the level of difficulty of the classification problem (expressed as AUC ROC) and will be largest for intermediately difficult problems (range AUC ROC 0.7–0.9). In the ranking of classifiers, classification random forests performed best on average. Hence, classification random forests in combination with the respective class probability estimate are a good starting point for predictive binary chemoinformatic classifiers with applicability domain. [Figure: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-017-0230-2) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-5543028
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	Springer International Publishing
record_format	MEDLINE/PubMed
spelling	pubmed-55430282017-08-18 Efficiency of different measures for defining the applicability domain of classification models Klingspohn, Waldemar Mathea, Miriam ter Laak, Antonius Heinrich, Nikolaus Baumann, Knut J Cheminform Research Article The goal of defining an applicability domain for a predictive classification model is to identify the region in chemical space where the model’s predictions are reliable. The boundary of the applicability domain is defined with the help of a measure that shall reflect the reliability of an individual prediction. Here, the available measures are differentiated into those that flag unusual objects and which are independent of the original classifier and those that use information of the trained classifier. The former set of techniques is referred to as novelty detection while the latter is designated as confidence estimation. A review of the available confidence estimators shows that most of these measures estimate the probability of class membership of the predicted objects which is inversely related to the error probability. Thus, class probability estimates are natural candidates for defining the applicability domain but were not comprehensively included in previous benchmark studies. The focus of the present study is to find the best measure for defining the applicability domain for a given binary classification technique and to determine the performance of novelty detection versus confidence estimation. Six different binary classification techniques in combination with ten data sets were studied to benchmark the various measures. The area under the receiver operating characteristic curve (AUC ROC) was employed as main benchmark criterion. It is shown that class probability estimates constantly perform best to differentiate between reliable and unreliable predictions. Previously proposed alternatives to class probability estimates do not perform better than the latter and are inferior in most cases. Interestingly, the impact of defining an applicability domain depends on the observed area under the receiver operator characteristic curve. That means that it depends on the level of difficulty of the classification problem (expressed as AUC ROC) and will be largest for intermediately difficult problems (range AUC ROC 0.7–0.9). In the ranking of classifiers, classification random forests performed best on average. Hence, classification random forests in combination with the respective class probability estimate are a good starting point for predictive binary chemoinformatic classifiers with applicability domain. [Figure: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-017-0230-2) contains supplementary material, which is available to authorized users. Springer International Publishing 2017-08-03 /pmc/articles/PMC5543028/ /pubmed/29086213 http://dx.doi.org/10.1186/s13321-017-0230-2 Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Article Klingspohn, Waldemar Mathea, Miriam ter Laak, Antonius Heinrich, Nikolaus Baumann, Knut Efficiency of different measures for defining the applicability domain of classification models
title	Efficiency of different measures for defining the applicability domain of classification models
title_full	Efficiency of different measures for defining the applicability domain of classification models
title_fullStr	Efficiency of different measures for defining the applicability domain of classification models
title_full_unstemmed	Efficiency of different measures for defining the applicability domain of classification models
title_short	Efficiency of different measures for defining the applicability domain of classification models
title_sort	efficiency of different measures for defining the applicability domain of classification models
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5543028/ https://www.ncbi.nlm.nih.gov/pubmed/29086213 http://dx.doi.org/10.1186/s13321-017-0230-2
work_keys_str_mv	AT klingspohnwaldemar efficiencyofdifferentmeasuresfordefiningtheapplicabilitydomainofclassificationmodels AT matheamiriam efficiencyofdifferentmeasuresfordefiningtheapplicabilitydomainofclassificationmodels AT terlaakantonius efficiencyofdifferentmeasuresfordefiningtheapplicabilitydomainofclassificationmodels AT heinrichnikolaus efficiencyofdifferentmeasuresfordefiningtheapplicabilitydomainofclassificationmodels AT baumannknut efficiencyofdifferentmeasuresfordefiningtheapplicabilitydomainofclassificationmodels

Efficiency of different measures for defining the applicability domain of classification models

Ejemplares similares