Cargando…
Generative and reproducible benchmarks or comprehensive evaluation machine learning classifiers
Understanding the strengths and weaknesses of machine learning (ML) algorithms is crucial to determine their scope of application. Here, we introduce the Diverse and Generative ML Benchmark (DIGEN), a collection of synthetic datasets for comprehensive, reproducible, and interpretable benchmarking of...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Association for the Advancement of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9683726/ https://www.ncbi.nlm.nih.gov/pubmed/36417520 http://dx.doi.org/10.1126/sciadv.abl4747 |
_version_ | 1784835117215121408 |
---|---|
author | Orzechowski, Patryk Moore, Jason H. |
author_facet | Orzechowski, Patryk Moore, Jason H. |
author_sort | Orzechowski, Patryk |
collection | PubMed |
description | Understanding the strengths and weaknesses of machine learning (ML) algorithms is crucial to determine their scope of application. Here, we introduce the Diverse and Generative ML Benchmark (DIGEN), a collection of synthetic datasets for comprehensive, reproducible, and interpretable benchmarking of ML algorithms for classification of binary outcomes. The DIGEN resource consists of 40 mathematical functions that map continuous features to binary targets for creating synthetic datasets. These 40 functions were found using a heuristic algorithm designed to maximize the diversity of performance among multiple popular ML algorithms, thus providing a useful test suite for evaluating and comparing new methods. Access to the generative functions facilitates understanding of why a method performs poorly compared to other algorithms, thus providing ideas for improvement. |
format | Online Article Text |
id | pubmed-9683726 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | American Association for the Advancement of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-96837262022-12-05 Generative and reproducible benchmarks or comprehensive evaluation machine learning classifiers Orzechowski, Patryk Moore, Jason H. Sci Adv Social and Interdisciplinary Sciences Understanding the strengths and weaknesses of machine learning (ML) algorithms is crucial to determine their scope of application. Here, we introduce the Diverse and Generative ML Benchmark (DIGEN), a collection of synthetic datasets for comprehensive, reproducible, and interpretable benchmarking of ML algorithms for classification of binary outcomes. The DIGEN resource consists of 40 mathematical functions that map continuous features to binary targets for creating synthetic datasets. These 40 functions were found using a heuristic algorithm designed to maximize the diversity of performance among multiple popular ML algorithms, thus providing a useful test suite for evaluating and comparing new methods. Access to the generative functions facilitates understanding of why a method performs poorly compared to other algorithms, thus providing ideas for improvement. American Association for the Advancement of Science 2022-11-23 /pmc/articles/PMC9683726/ /pubmed/36417520 http://dx.doi.org/10.1126/sciadv.abl4747 Text en Copyright © 2022 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution NonCommercial License 4.0 (CC BY-NC). https://creativecommons.org/licenses/by-nc/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial license (https://creativecommons.org/licenses/by-nc/4.0/) , which permits use, distribution, and reproduction in any medium, so long as the resultant use is not for commercial advantage and provided the original work is properly cited. |
spellingShingle | Social and Interdisciplinary Sciences Orzechowski, Patryk Moore, Jason H. Generative and reproducible benchmarks or comprehensive evaluation machine learning classifiers |
title | Generative and reproducible benchmarks or comprehensive evaluation machine learning classifiers |
title_full | Generative and reproducible benchmarks or comprehensive evaluation machine learning classifiers |
title_fullStr | Generative and reproducible benchmarks or comprehensive evaluation machine learning classifiers |
title_full_unstemmed | Generative and reproducible benchmarks or comprehensive evaluation machine learning classifiers |
title_short | Generative and reproducible benchmarks or comprehensive evaluation machine learning classifiers |
title_sort | generative and reproducible benchmarks or comprehensive evaluation machine learning classifiers |
topic | Social and Interdisciplinary Sciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9683726/ https://www.ncbi.nlm.nih.gov/pubmed/36417520 http://dx.doi.org/10.1126/sciadv.abl4747 |
work_keys_str_mv | AT orzechowskipatryk generativeandreproduciblebenchmarksorcomprehensiveevaluationmachinelearningclassifiers AT moorejasonh generativeandreproduciblebenchmarksorcomprehensiveevaluationmachinelearningclassifiers |