Cargando…
Preference-Driven Classification Measure
Classification is one of the main problems of machine learning, and assessing the quality of classification is one of the most topical tasks, all the more difficult as it depends on many factors. Many different measures have been proposed to assess the quality of the classification, often depending...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9032202/ https://www.ncbi.nlm.nih.gov/pubmed/35455193 http://dx.doi.org/10.3390/e24040531 |
_version_ | 1784692583418560512 |
---|---|
author | Kozak, Jan Probierz, Barbara Kania, Krzysztof Juszczuk, Przemysław |
author_facet | Kozak, Jan Probierz, Barbara Kania, Krzysztof Juszczuk, Przemysław |
author_sort | Kozak, Jan |
collection | PubMed |
description | Classification is one of the main problems of machine learning, and assessing the quality of classification is one of the most topical tasks, all the more difficult as it depends on many factors. Many different measures have been proposed to assess the quality of the classification, often depending on the application of a specific classifier. However, in most cases, these measures are focused on binary classification, and for the problem of many decision classes, they are significantly simplified. Due to the increasing scope of classification applications, there is a growing need to select a classifier appropriate to the situation, including more complex data sets with multiple decision classes. This paper aims to propose a new measure of classifier quality assessment (called the preference-driven measure, abbreviated p-d), regardless of the number of classes, with the possibility of establishing the relative importance of each class. Furthermore, we propose a solution in which the classifier’s assessment can be adapted to the analyzed problem using a vector of preferences. To visualize the operation of the proposed measure, we present it first on an example involving two decision classes and then test its operation on real, multi-class data sets. Additionally, in this case, we demonstrate how to adjust the assessment to the user’s preferences. The results obtained allow us to confirm that the use of a preference-driven measure indicates that other classifiers are better to use according to preferences, particularly as opposed to the classical measures of classification quality assessment. |
format | Online Article Text |
id | pubmed-9032202 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-90322022022-04-23 Preference-Driven Classification Measure Kozak, Jan Probierz, Barbara Kania, Krzysztof Juszczuk, Przemysław Entropy (Basel) Article Classification is one of the main problems of machine learning, and assessing the quality of classification is one of the most topical tasks, all the more difficult as it depends on many factors. Many different measures have been proposed to assess the quality of the classification, often depending on the application of a specific classifier. However, in most cases, these measures are focused on binary classification, and for the problem of many decision classes, they are significantly simplified. Due to the increasing scope of classification applications, there is a growing need to select a classifier appropriate to the situation, including more complex data sets with multiple decision classes. This paper aims to propose a new measure of classifier quality assessment (called the preference-driven measure, abbreviated p-d), regardless of the number of classes, with the possibility of establishing the relative importance of each class. Furthermore, we propose a solution in which the classifier’s assessment can be adapted to the analyzed problem using a vector of preferences. To visualize the operation of the proposed measure, we present it first on an example involving two decision classes and then test its operation on real, multi-class data sets. Additionally, in this case, we demonstrate how to adjust the assessment to the user’s preferences. The results obtained allow us to confirm that the use of a preference-driven measure indicates that other classifiers are better to use according to preferences, particularly as opposed to the classical measures of classification quality assessment. MDPI 2022-04-10 /pmc/articles/PMC9032202/ /pubmed/35455193 http://dx.doi.org/10.3390/e24040531 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Kozak, Jan Probierz, Barbara Kania, Krzysztof Juszczuk, Przemysław Preference-Driven Classification Measure |
title | Preference-Driven Classification Measure |
title_full | Preference-Driven Classification Measure |
title_fullStr | Preference-Driven Classification Measure |
title_full_unstemmed | Preference-Driven Classification Measure |
title_short | Preference-Driven Classification Measure |
title_sort | preference-driven classification measure |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9032202/ https://www.ncbi.nlm.nih.gov/pubmed/35455193 http://dx.doi.org/10.3390/e24040531 |
work_keys_str_mv | AT kozakjan preferencedrivenclassificationmeasure AT probierzbarbara preferencedrivenclassificationmeasure AT kaniakrzysztof preferencedrivenclassificationmeasure AT juszczukprzemysław preferencedrivenclassificationmeasure |