Cargando…

The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets

Binary classifiers are routinely evaluated with performance measures such as sensitivity and specificity, and performance is frequently illustrated with Receiver Operating Characteristics (ROC) plots. Alternative measures such as positive predictive value (PPV) and the associated Precision/Recall (P...

Descripción completa

Detalles Bibliográficos
Autores principales: Saito, Takaya, Rehmsmeier, Marc
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4349800/
https://www.ncbi.nlm.nih.gov/pubmed/25738806
http://dx.doi.org/10.1371/journal.pone.0118432
_version_ 1782360088706547712
author Saito, Takaya
Rehmsmeier, Marc
author_facet Saito, Takaya
Rehmsmeier, Marc
author_sort Saito, Takaya
collection PubMed
description Binary classifiers are routinely evaluated with performance measures such as sensitivity and specificity, and performance is frequently illustrated with Receiver Operating Characteristics (ROC) plots. Alternative measures such as positive predictive value (PPV) and the associated Precision/Recall (PRC) plots are used less frequently. Many bioinformatics studies develop and evaluate classifiers that are to be applied to strongly imbalanced datasets in which the number of negatives outweighs the number of positives significantly. While ROC plots are visually appealing and provide an overview of a classifier's performance across a wide range of specificities, one can ask whether ROC plots could be misleading when applied in imbalanced classification scenarios. We show here that the visual interpretability of ROC plots in the context of imbalanced datasets can be deceptive with respect to conclusions about the reliability of classification performance, owing to an intuitive but wrong interpretation of specificity. PRC plots, on the other hand, can provide the viewer with an accurate prediction of future classification performance due to the fact that they evaluate the fraction of true positives among positive predictions. Our findings have potential implications for the interpretation of a large number of studies that use ROC plots on imbalanced datasets.
format Online
Article
Text
id pubmed-4349800
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-43498002015-03-17 The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets Saito, Takaya Rehmsmeier, Marc PLoS One Research Article Binary classifiers are routinely evaluated with performance measures such as sensitivity and specificity, and performance is frequently illustrated with Receiver Operating Characteristics (ROC) plots. Alternative measures such as positive predictive value (PPV) and the associated Precision/Recall (PRC) plots are used less frequently. Many bioinformatics studies develop and evaluate classifiers that are to be applied to strongly imbalanced datasets in which the number of negatives outweighs the number of positives significantly. While ROC plots are visually appealing and provide an overview of a classifier's performance across a wide range of specificities, one can ask whether ROC plots could be misleading when applied in imbalanced classification scenarios. We show here that the visual interpretability of ROC plots in the context of imbalanced datasets can be deceptive with respect to conclusions about the reliability of classification performance, owing to an intuitive but wrong interpretation of specificity. PRC plots, on the other hand, can provide the viewer with an accurate prediction of future classification performance due to the fact that they evaluate the fraction of true positives among positive predictions. Our findings have potential implications for the interpretation of a large number of studies that use ROC plots on imbalanced datasets. Public Library of Science 2015-03-04 /pmc/articles/PMC4349800/ /pubmed/25738806 http://dx.doi.org/10.1371/journal.pone.0118432 Text en © 2015 Saito, Rehmsmeier http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Saito, Takaya
Rehmsmeier, Marc
The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets
title The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets
title_full The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets
title_fullStr The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets
title_full_unstemmed The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets
title_short The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets
title_sort precision-recall plot is more informative than the roc plot when evaluating binary classifiers on imbalanced datasets
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4349800/
https://www.ncbi.nlm.nih.gov/pubmed/25738806
http://dx.doi.org/10.1371/journal.pone.0118432
work_keys_str_mv AT saitotakaya theprecisionrecallplotismoreinformativethantherocplotwhenevaluatingbinaryclassifiersonimbalanceddatasets
AT rehmsmeiermarc theprecisionrecallplotismoreinformativethantherocplotwhenevaluatingbinaryclassifiersonimbalanceddatasets
AT saitotakaya precisionrecallplotismoreinformativethantherocplotwhenevaluatingbinaryclassifiersonimbalanceddatasets
AT rehmsmeiermarc precisionrecallplotismoreinformativethantherocplotwhenevaluatingbinaryclassifiersonimbalanceddatasets