Cargando…

Statistical independence for the evaluation of classifier-based diagnosis

Machine learning techniques are increasingly adopted in computer-aided diagnosis. Evaluation methods for classification results that are based on the study of one or more metrics can be unable to distinguish between cases in which the classifier is discriminating the classes from cases in which it i...

Descripción completa

Detalles Bibliográficos
Autores principales: Olivetti, Emanuele, Greiner, Susanne, Avesani, Paolo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4883157/
https://www.ncbi.nlm.nih.gov/pubmed/27747500
http://dx.doi.org/10.1007/s40708-014-0007-6
_version_ 1782434222329298944
author Olivetti, Emanuele
Greiner, Susanne
Avesani, Paolo
author_facet Olivetti, Emanuele
Greiner, Susanne
Avesani, Paolo
author_sort Olivetti, Emanuele
collection PubMed
description Machine learning techniques are increasingly adopted in computer-aided diagnosis. Evaluation methods for classification results that are based on the study of one or more metrics can be unable to distinguish between cases in which the classifier is discriminating the classes from cases in which it is not. In the binary setting, such circumstances can be encountered when data are unbalanced with respect to the diagnostic groups. Having more healthy controls than pathological subjects, datasets meant for diagnosis frequently show a certain degree of unbalancedness. In this work, we propose to recast the evaluation of classification results as a test of statistical independence between the predicted and the actual diagnostic groups. We address the problem within the Bayesian hypothesis testing framework. Different from the standard metrics, the proposed method is able to handle unbalanced data and takes into account the size of the available data. We show experimental evidence of the efficacy of the approach both on simulated data and on real data about the diagnosis of the Attention Deficit Hyperactivity Disorder (ADHD).
format Online
Article
Text
id pubmed-4883157
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-48831572016-08-19 Statistical independence for the evaluation of classifier-based diagnosis Olivetti, Emanuele Greiner, Susanne Avesani, Paolo Brain Inform Article Machine learning techniques are increasingly adopted in computer-aided diagnosis. Evaluation methods for classification results that are based on the study of one or more metrics can be unable to distinguish between cases in which the classifier is discriminating the classes from cases in which it is not. In the binary setting, such circumstances can be encountered when data are unbalanced with respect to the diagnostic groups. Having more healthy controls than pathological subjects, datasets meant for diagnosis frequently show a certain degree of unbalancedness. In this work, we propose to recast the evaluation of classification results as a test of statistical independence between the predicted and the actual diagnostic groups. We address the problem within the Bayesian hypothesis testing framework. Different from the standard metrics, the proposed method is able to handle unbalanced data and takes into account the size of the available data. We show experimental evidence of the efficacy of the approach both on simulated data and on real data about the diagnosis of the Attention Deficit Hyperactivity Disorder (ADHD). Springer Berlin Heidelberg 2014-12-11 /pmc/articles/PMC4883157/ /pubmed/27747500 http://dx.doi.org/10.1007/s40708-014-0007-6 Text en © The Author(s) 2014 https://creativecommons.org/licenses/by/4.0/ Open AccessThis article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.
spellingShingle Article
Olivetti, Emanuele
Greiner, Susanne
Avesani, Paolo
Statistical independence for the evaluation of classifier-based diagnosis
title Statistical independence for the evaluation of classifier-based diagnosis
title_full Statistical independence for the evaluation of classifier-based diagnosis
title_fullStr Statistical independence for the evaluation of classifier-based diagnosis
title_full_unstemmed Statistical independence for the evaluation of classifier-based diagnosis
title_short Statistical independence for the evaluation of classifier-based diagnosis
title_sort statistical independence for the evaluation of classifier-based diagnosis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4883157/
https://www.ncbi.nlm.nih.gov/pubmed/27747500
http://dx.doi.org/10.1007/s40708-014-0007-6
work_keys_str_mv AT olivettiemanuele statisticalindependencefortheevaluationofclassifierbaseddiagnosis
AT greinersusanne statisticalindependencefortheevaluationofclassifierbaseddiagnosis
AT avesanipaolo statisticalindependencefortheevaluationofclassifierbaseddiagnosis