Cargando…
Statistical independence for the evaluation of classifier-based diagnosis
Machine learning techniques are increasingly adopted in computer-aided diagnosis. Evaluation methods for classification results that are based on the study of one or more metrics can be unable to distinguish between cases in which the classifier is discriminating the classes from cases in which it i...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4883157/ https://www.ncbi.nlm.nih.gov/pubmed/27747500 http://dx.doi.org/10.1007/s40708-014-0007-6 |
_version_ | 1782434222329298944 |
---|---|
author | Olivetti, Emanuele Greiner, Susanne Avesani, Paolo |
author_facet | Olivetti, Emanuele Greiner, Susanne Avesani, Paolo |
author_sort | Olivetti, Emanuele |
collection | PubMed |
description | Machine learning techniques are increasingly adopted in computer-aided diagnosis. Evaluation methods for classification results that are based on the study of one or more metrics can be unable to distinguish between cases in which the classifier is discriminating the classes from cases in which it is not. In the binary setting, such circumstances can be encountered when data are unbalanced with respect to the diagnostic groups. Having more healthy controls than pathological subjects, datasets meant for diagnosis frequently show a certain degree of unbalancedness. In this work, we propose to recast the evaluation of classification results as a test of statistical independence between the predicted and the actual diagnostic groups. We address the problem within the Bayesian hypothesis testing framework. Different from the standard metrics, the proposed method is able to handle unbalanced data and takes into account the size of the available data. We show experimental evidence of the efficacy of the approach both on simulated data and on real data about the diagnosis of the Attention Deficit Hyperactivity Disorder (ADHD). |
format | Online Article Text |
id | pubmed-4883157 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-48831572016-08-19 Statistical independence for the evaluation of classifier-based diagnosis Olivetti, Emanuele Greiner, Susanne Avesani, Paolo Brain Inform Article Machine learning techniques are increasingly adopted in computer-aided diagnosis. Evaluation methods for classification results that are based on the study of one or more metrics can be unable to distinguish between cases in which the classifier is discriminating the classes from cases in which it is not. In the binary setting, such circumstances can be encountered when data are unbalanced with respect to the diagnostic groups. Having more healthy controls than pathological subjects, datasets meant for diagnosis frequently show a certain degree of unbalancedness. In this work, we propose to recast the evaluation of classification results as a test of statistical independence between the predicted and the actual diagnostic groups. We address the problem within the Bayesian hypothesis testing framework. Different from the standard metrics, the proposed method is able to handle unbalanced data and takes into account the size of the available data. We show experimental evidence of the efficacy of the approach both on simulated data and on real data about the diagnosis of the Attention Deficit Hyperactivity Disorder (ADHD). Springer Berlin Heidelberg 2014-12-11 /pmc/articles/PMC4883157/ /pubmed/27747500 http://dx.doi.org/10.1007/s40708-014-0007-6 Text en © The Author(s) 2014 https://creativecommons.org/licenses/by/4.0/ Open AccessThis article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited. |
spellingShingle | Article Olivetti, Emanuele Greiner, Susanne Avesani, Paolo Statistical independence for the evaluation of classifier-based diagnosis |
title | Statistical independence for the evaluation of classifier-based diagnosis |
title_full | Statistical independence for the evaluation of classifier-based diagnosis |
title_fullStr | Statistical independence for the evaluation of classifier-based diagnosis |
title_full_unstemmed | Statistical independence for the evaluation of classifier-based diagnosis |
title_short | Statistical independence for the evaluation of classifier-based diagnosis |
title_sort | statistical independence for the evaluation of classifier-based diagnosis |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4883157/ https://www.ncbi.nlm.nih.gov/pubmed/27747500 http://dx.doi.org/10.1007/s40708-014-0007-6 |
work_keys_str_mv | AT olivettiemanuele statisticalindependencefortheevaluationofclassifierbaseddiagnosis AT greinersusanne statisticalindependencefortheevaluationofclassifierbaseddiagnosis AT avesanipaolo statisticalindependencefortheevaluationofclassifierbaseddiagnosis |