Cargando…

Preference-Driven Classification Measure

Classification is one of the main problems of machine learning, and assessing the quality of classification is one of the most topical tasks, all the more difficult as it depends on many factors. Many different measures have been proposed to assess the quality of the classification, often depending...

Descripción completa

Detalles Bibliográficos
Autores principales: Kozak, Jan, Probierz, Barbara, Kania, Krzysztof, Juszczuk, Przemysław
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9032202/
https://www.ncbi.nlm.nih.gov/pubmed/35455193
http://dx.doi.org/10.3390/e24040531
_version_ 1784692583418560512
author Kozak, Jan
Probierz, Barbara
Kania, Krzysztof
Juszczuk, Przemysław
author_facet Kozak, Jan
Probierz, Barbara
Kania, Krzysztof
Juszczuk, Przemysław
author_sort Kozak, Jan
collection PubMed
description Classification is one of the main problems of machine learning, and assessing the quality of classification is one of the most topical tasks, all the more difficult as it depends on many factors. Many different measures have been proposed to assess the quality of the classification, often depending on the application of a specific classifier. However, in most cases, these measures are focused on binary classification, and for the problem of many decision classes, they are significantly simplified. Due to the increasing scope of classification applications, there is a growing need to select a classifier appropriate to the situation, including more complex data sets with multiple decision classes. This paper aims to propose a new measure of classifier quality assessment (called the preference-driven measure, abbreviated p-d), regardless of the number of classes, with the possibility of establishing the relative importance of each class. Furthermore, we propose a solution in which the classifier’s assessment can be adapted to the analyzed problem using a vector of preferences. To visualize the operation of the proposed measure, we present it first on an example involving two decision classes and then test its operation on real, multi-class data sets. Additionally, in this case, we demonstrate how to adjust the assessment to the user’s preferences. The results obtained allow us to confirm that the use of a preference-driven measure indicates that other classifiers are better to use according to preferences, particularly as opposed to the classical measures of classification quality assessment.
format Online
Article
Text
id pubmed-9032202
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-90322022022-04-23 Preference-Driven Classification Measure Kozak, Jan Probierz, Barbara Kania, Krzysztof Juszczuk, Przemysław Entropy (Basel) Article Classification is one of the main problems of machine learning, and assessing the quality of classification is one of the most topical tasks, all the more difficult as it depends on many factors. Many different measures have been proposed to assess the quality of the classification, often depending on the application of a specific classifier. However, in most cases, these measures are focused on binary classification, and for the problem of many decision classes, they are significantly simplified. Due to the increasing scope of classification applications, there is a growing need to select a classifier appropriate to the situation, including more complex data sets with multiple decision classes. This paper aims to propose a new measure of classifier quality assessment (called the preference-driven measure, abbreviated p-d), regardless of the number of classes, with the possibility of establishing the relative importance of each class. Furthermore, we propose a solution in which the classifier’s assessment can be adapted to the analyzed problem using a vector of preferences. To visualize the operation of the proposed measure, we present it first on an example involving two decision classes and then test its operation on real, multi-class data sets. Additionally, in this case, we demonstrate how to adjust the assessment to the user’s preferences. The results obtained allow us to confirm that the use of a preference-driven measure indicates that other classifiers are better to use according to preferences, particularly as opposed to the classical measures of classification quality assessment. MDPI 2022-04-10 /pmc/articles/PMC9032202/ /pubmed/35455193 http://dx.doi.org/10.3390/e24040531 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kozak, Jan
Probierz, Barbara
Kania, Krzysztof
Juszczuk, Przemysław
Preference-Driven Classification Measure
title Preference-Driven Classification Measure
title_full Preference-Driven Classification Measure
title_fullStr Preference-Driven Classification Measure
title_full_unstemmed Preference-Driven Classification Measure
title_short Preference-Driven Classification Measure
title_sort preference-driven classification measure
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9032202/
https://www.ncbi.nlm.nih.gov/pubmed/35455193
http://dx.doi.org/10.3390/e24040531
work_keys_str_mv AT kozakjan preferencedrivenclassificationmeasure
AT probierzbarbara preferencedrivenclassificationmeasure
AT kaniakrzysztof preferencedrivenclassificationmeasure
AT juszczukprzemysław preferencedrivenclassificationmeasure