Cargando…
The STONE Curve: A ROC‐Derived Model Performance Assessment Tool
A new model validation and performance assessment tool is introduced, the sliding threshold of observation for numeric evaluation (STONE) curve. It is based on the relative operating characteristic (ROC) curve technique, but instead of sorting all observations in a categorical classification, the ST...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7507778/ https://www.ncbi.nlm.nih.gov/pubmed/32999898 http://dx.doi.org/10.1029/2020EA001106 |
_version_ | 1783585297795121152 |
---|---|
author | Liemohn, Michael W. Azari, Abigail R. Ganushkina, Natalia Y. Rastätter, Lutz |
author_facet | Liemohn, Michael W. Azari, Abigail R. Ganushkina, Natalia Y. Rastätter, Lutz |
author_sort | Liemohn, Michael W. |
collection | PubMed |
description | A new model validation and performance assessment tool is introduced, the sliding threshold of observation for numeric evaluation (STONE) curve. It is based on the relative operating characteristic (ROC) curve technique, but instead of sorting all observations in a categorical classification, the STONE tool uses the continuous nature of the observations. Rather than defining events in the observations and then sliding the threshold only in the classifier/model data set, the threshold is changed simultaneously for both the observational and model values, with the same threshold value for both data and model. This is only possible if the observations are continuous and the model output is in the same units and scale as the observations, that is, the model is trying to exactly reproduce the data. The STONE curve has several similarities with the ROC curve—plotting probability of detection against probability of false detection, ranging from the (1,1) corner for low thresholds to the (0,0) corner for high thresholds, and values above the zero‐intercept unity‐slope line indicating better than random predictive ability. The main difference is that the STONE curve can be nonmonotonic, doubling back in both the x and y directions. These ripples reveal asymmetries in the data‐model value pairs. This new technique is applied to modeling output of a common geomagnetic activity index as well as energetic electron fluxes in the Earth's inner magnetosphere. It is not limited to space physics applications but can be used for any scientific or engineering field where numerical models are used to reproduce observations. |
format | Online Article Text |
id | pubmed-7507778 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-75077782020-09-28 The STONE Curve: A ROC‐Derived Model Performance Assessment Tool Liemohn, Michael W. Azari, Abigail R. Ganushkina, Natalia Y. Rastätter, Lutz Earth Space Sci Technical Reports: Methods A new model validation and performance assessment tool is introduced, the sliding threshold of observation for numeric evaluation (STONE) curve. It is based on the relative operating characteristic (ROC) curve technique, but instead of sorting all observations in a categorical classification, the STONE tool uses the continuous nature of the observations. Rather than defining events in the observations and then sliding the threshold only in the classifier/model data set, the threshold is changed simultaneously for both the observational and model values, with the same threshold value for both data and model. This is only possible if the observations are continuous and the model output is in the same units and scale as the observations, that is, the model is trying to exactly reproduce the data. The STONE curve has several similarities with the ROC curve—plotting probability of detection against probability of false detection, ranging from the (1,1) corner for low thresholds to the (0,0) corner for high thresholds, and values above the zero‐intercept unity‐slope line indicating better than random predictive ability. The main difference is that the STONE curve can be nonmonotonic, doubling back in both the x and y directions. These ripples reveal asymmetries in the data‐model value pairs. This new technique is applied to modeling output of a common geomagnetic activity index as well as energetic electron fluxes in the Earth's inner magnetosphere. It is not limited to space physics applications but can be used for any scientific or engineering field where numerical models are used to reproduce observations. John Wiley and Sons Inc. 2020-08-20 2020-08 /pmc/articles/PMC7507778/ /pubmed/32999898 http://dx.doi.org/10.1029/2020EA001106 Text en ©2020 The Authors. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Technical Reports: Methods Liemohn, Michael W. Azari, Abigail R. Ganushkina, Natalia Y. Rastätter, Lutz The STONE Curve: A ROC‐Derived Model Performance Assessment Tool |
title | The STONE Curve: A ROC‐Derived Model Performance Assessment Tool |
title_full | The STONE Curve: A ROC‐Derived Model Performance Assessment Tool |
title_fullStr | The STONE Curve: A ROC‐Derived Model Performance Assessment Tool |
title_full_unstemmed | The STONE Curve: A ROC‐Derived Model Performance Assessment Tool |
title_short | The STONE Curve: A ROC‐Derived Model Performance Assessment Tool |
title_sort | stone curve: a roc‐derived model performance assessment tool |
topic | Technical Reports: Methods |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7507778/ https://www.ncbi.nlm.nih.gov/pubmed/32999898 http://dx.doi.org/10.1029/2020EA001106 |
work_keys_str_mv | AT liemohnmichaelw thestonecurvearocderivedmodelperformanceassessmenttool AT azariabigailr thestonecurvearocderivedmodelperformanceassessmenttool AT ganushkinanataliay thestonecurvearocderivedmodelperformanceassessmenttool AT rastatterlutz thestonecurvearocderivedmodelperformanceassessmenttool AT liemohnmichaelw stonecurvearocderivedmodelperformanceassessmenttool AT azariabigailr stonecurvearocderivedmodelperformanceassessmenttool AT ganushkinanataliay stonecurvearocderivedmodelperformanceassessmenttool AT rastatterlutz stonecurvearocderivedmodelperformanceassessmenttool |