Cargando…

Generative and reproducible benchmarks or comprehensive evaluation machine learning classifiers

Understanding the strengths and weaknesses of machine learning (ML) algorithms is crucial to determine their scope of application. Here, we introduce the Diverse and Generative ML Benchmark (DIGEN), a collection of synthetic datasets for comprehensive, reproducible, and interpretable benchmarking of...

Descripción completa

Detalles Bibliográficos
Autores principales: Orzechowski, Patryk, Moore, Jason H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Association for the Advancement of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9683726/
https://www.ncbi.nlm.nih.gov/pubmed/36417520
http://dx.doi.org/10.1126/sciadv.abl4747
_version_ 1784835117215121408
author Orzechowski, Patryk
Moore, Jason H.
author_facet Orzechowski, Patryk
Moore, Jason H.
author_sort Orzechowski, Patryk
collection PubMed
description Understanding the strengths and weaknesses of machine learning (ML) algorithms is crucial to determine their scope of application. Here, we introduce the Diverse and Generative ML Benchmark (DIGEN), a collection of synthetic datasets for comprehensive, reproducible, and interpretable benchmarking of ML algorithms for classification of binary outcomes. The DIGEN resource consists of 40 mathematical functions that map continuous features to binary targets for creating synthetic datasets. These 40 functions were found using a heuristic algorithm designed to maximize the diversity of performance among multiple popular ML algorithms, thus providing a useful test suite for evaluating and comparing new methods. Access to the generative functions facilitates understanding of why a method performs poorly compared to other algorithms, thus providing ideas for improvement.
format Online
Article
Text
id pubmed-9683726
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Association for the Advancement of Science
record_format MEDLINE/PubMed
spelling pubmed-96837262022-12-05 Generative and reproducible benchmarks or comprehensive evaluation machine learning classifiers Orzechowski, Patryk Moore, Jason H. Sci Adv Social and Interdisciplinary Sciences Understanding the strengths and weaknesses of machine learning (ML) algorithms is crucial to determine their scope of application. Here, we introduce the Diverse and Generative ML Benchmark (DIGEN), a collection of synthetic datasets for comprehensive, reproducible, and interpretable benchmarking of ML algorithms for classification of binary outcomes. The DIGEN resource consists of 40 mathematical functions that map continuous features to binary targets for creating synthetic datasets. These 40 functions were found using a heuristic algorithm designed to maximize the diversity of performance among multiple popular ML algorithms, thus providing a useful test suite for evaluating and comparing new methods. Access to the generative functions facilitates understanding of why a method performs poorly compared to other algorithms, thus providing ideas for improvement. American Association for the Advancement of Science 2022-11-23 /pmc/articles/PMC9683726/ /pubmed/36417520 http://dx.doi.org/10.1126/sciadv.abl4747 Text en Copyright © 2022 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution NonCommercial License 4.0 (CC BY-NC). https://creativecommons.org/licenses/by-nc/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial license (https://creativecommons.org/licenses/by-nc/4.0/) , which permits use, distribution, and reproduction in any medium, so long as the resultant use is not for commercial advantage and provided the original work is properly cited.
spellingShingle Social and Interdisciplinary Sciences
Orzechowski, Patryk
Moore, Jason H.
Generative and reproducible benchmarks or comprehensive evaluation machine learning classifiers
title Generative and reproducible benchmarks or comprehensive evaluation machine learning classifiers
title_full Generative and reproducible benchmarks or comprehensive evaluation machine learning classifiers
title_fullStr Generative and reproducible benchmarks or comprehensive evaluation machine learning classifiers
title_full_unstemmed Generative and reproducible benchmarks or comprehensive evaluation machine learning classifiers
title_short Generative and reproducible benchmarks or comprehensive evaluation machine learning classifiers
title_sort generative and reproducible benchmarks or comprehensive evaluation machine learning classifiers
topic Social and Interdisciplinary Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9683726/
https://www.ncbi.nlm.nih.gov/pubmed/36417520
http://dx.doi.org/10.1126/sciadv.abl4747
work_keys_str_mv AT orzechowskipatryk generativeandreproduciblebenchmarksorcomprehensiveevaluationmachinelearningclassifiers
AT moorejasonh generativeandreproduciblebenchmarksorcomprehensiveevaluationmachinelearningclassifiers