Cargando…

Computation of the distribution of model accuracy statistics in machine learning: Comparison between analytically derived distributions and simulation‐based methods

BACKGROUND AND AIMS: All fields have seen an increase in machine‐learning techniques. To accurately evaluate the efficacy of novel modeling methods, it is necessary to conduct a critical evaluation of the utilized model metrics, such as sensitivity, specificity, and area under the receiver operator...

Descripción completa

Detalles Bibliográficos
Autores principales:	Huang, Alexander A., Huang, Samuel Y.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	John Wiley and Sons Inc. 2023
Materias:	Original Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10119581/ https://www.ncbi.nlm.nih.gov/pubmed/37091362 http://dx.doi.org/10.1002/hsr2.1214

_version_	1785029046375022592
author	Huang, Alexander A. Huang, Samuel Y.
author_facet	Huang, Alexander A. Huang, Samuel Y.
author_sort	Huang, Alexander A.
collection	PubMed
description	BACKGROUND AND AIMS: All fields have seen an increase in machine‐learning techniques. To accurately evaluate the efficacy of novel modeling methods, it is necessary to conduct a critical evaluation of the utilized model metrics, such as sensitivity, specificity, and area under the receiver operator characteristic curve (AUROC). For commonly used model metrics, we proposed the use of analytically derived distributions (ADDs) and compared it with simulation‐based approaches. METHODS: A retrospective cohort study was conducted using the England National Health Services Heart Disease Prediction Cohort. Four machine learning models (XGBoost, Random Forest, Artificial Neural Network, and Adaptive Boost) were used. The distribution of the model metrics and covariate gain statistics were empirically derived using boot‐strap simulation (N = 10,000). The ADDs were created from analytic formulas from the covariates to describe the distribution of the model metrics and compared with those of bootstrap simulation. RESULTS: XGBoost had the most optimal model having the highest AUROC and the highest aggregate score considering six other model metrics. Based on the Anderson–Darling test, the distribution of the model metrics created from bootstrap did not significantly deviate from a normal distribution. The variance created from the ADD led to smaller SDs than those derived from bootstrap simulation, whereas the rest of the distribution remained not statistically significantly different. CONCLUSIONS: ADD allows for cross study comparison of model metrics, which is usually done with bootstrapping that rely on simulations, which cannot be replicated by the reader.
format	Online Article Text
id	pubmed-10119581
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	John Wiley and Sons Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-101195812023-04-22 Computation of the distribution of model accuracy statistics in machine learning: Comparison between analytically derived distributions and simulation‐based methods Huang, Alexander A. Huang, Samuel Y. Health Sci Rep Original Research BACKGROUND AND AIMS: All fields have seen an increase in machine‐learning techniques. To accurately evaluate the efficacy of novel modeling methods, it is necessary to conduct a critical evaluation of the utilized model metrics, such as sensitivity, specificity, and area under the receiver operator characteristic curve (AUROC). For commonly used model metrics, we proposed the use of analytically derived distributions (ADDs) and compared it with simulation‐based approaches. METHODS: A retrospective cohort study was conducted using the England National Health Services Heart Disease Prediction Cohort. Four machine learning models (XGBoost, Random Forest, Artificial Neural Network, and Adaptive Boost) were used. The distribution of the model metrics and covariate gain statistics were empirically derived using boot‐strap simulation (N = 10,000). The ADDs were created from analytic formulas from the covariates to describe the distribution of the model metrics and compared with those of bootstrap simulation. RESULTS: XGBoost had the most optimal model having the highest AUROC and the highest aggregate score considering six other model metrics. Based on the Anderson–Darling test, the distribution of the model metrics created from bootstrap did not significantly deviate from a normal distribution. The variance created from the ADD led to smaller SDs than those derived from bootstrap simulation, whereas the rest of the distribution remained not statistically significantly different. CONCLUSIONS: ADD allows for cross study comparison of model metrics, which is usually done with bootstrapping that rely on simulations, which cannot be replicated by the reader. John Wiley and Sons Inc. 2023-04-20 /pmc/articles/PMC10119581/ /pubmed/37091362 http://dx.doi.org/10.1002/hsr2.1214 Text en © 2023 The Authors. Health Science Reports published by Wiley Periodicals LLC. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Research Huang, Alexander A. Huang, Samuel Y. Computation of the distribution of model accuracy statistics in machine learning: Comparison between analytically derived distributions and simulation‐based methods
title	Computation of the distribution of model accuracy statistics in machine learning: Comparison between analytically derived distributions and simulation‐based methods
title_full	Computation of the distribution of model accuracy statistics in machine learning: Comparison between analytically derived distributions and simulation‐based methods
title_fullStr	Computation of the distribution of model accuracy statistics in machine learning: Comparison between analytically derived distributions and simulation‐based methods
title_full_unstemmed	Computation of the distribution of model accuracy statistics in machine learning: Comparison between analytically derived distributions and simulation‐based methods
title_short	Computation of the distribution of model accuracy statistics in machine learning: Comparison between analytically derived distributions and simulation‐based methods
title_sort	computation of the distribution of model accuracy statistics in machine learning: comparison between analytically derived distributions and simulation‐based methods
topic	Original Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10119581/ https://www.ncbi.nlm.nih.gov/pubmed/37091362 http://dx.doi.org/10.1002/hsr2.1214
work_keys_str_mv	AT huangalexandera computationofthedistributionofmodelaccuracystatisticsinmachinelearningcomparisonbetweenanalyticallyderiveddistributionsandsimulationbasedmethods AT huangsamuely computationofthedistributionofmodelaccuracystatisticsinmachinelearningcomparisonbetweenanalyticallyderiveddistributionsandsimulationbasedmethods

Computation of the distribution of model accuracy statistics in machine learning: Comparison between analytically derived distributions and simulation‐based methods

Ejemplares similares