Cargando…

Structured Verification of Machine Learning Models in Industrial Settings

The use of machine learning (ML) allows us to automate and scale the decision-making processes. The key to this automation is the development of ML models that generalize training data toward unseen data. Such models can become extremely versatile and powerful, which makes democratization of artific...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kaminwar, Sai Rahul, Goschenhofer, Jann, Thomas, Janek, Thon, Ingo, Bischl, Bernd
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Mary Ann Liebert, Inc., publishers 2023
Materias:	Original Articles
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280203/ https://www.ncbi.nlm.nih.gov/pubmed/34978896 http://dx.doi.org/10.1089/big.2021.0112

_version_	1785060748869763072
author	Kaminwar, Sai Rahul Goschenhofer, Jann Thomas, Janek Thon, Ingo Bischl, Bernd
author_facet	Kaminwar, Sai Rahul Goschenhofer, Jann Thomas, Janek Thon, Ingo Bischl, Bernd
author_sort	Kaminwar, Sai Rahul
collection	PubMed
description	The use of machine learning (ML) allows us to automate and scale the decision-making processes. The key to this automation is the development of ML models that generalize training data toward unseen data. Such models can become extremely versatile and powerful, which makes democratization of artificial intelligence (AI) possible, that is, providing ML to non-ML experts such as software engineers or domain experts. Typically, automated ML (AutoML) is being referred to as a key step toward it. However, from our perspective, we believe that democratization of the verification process of ML systems is a larger and even more crucial challenge to achieve the democratization of AI. Currently, the process of ensuring that an ML model works as intended is unstructured. It is largely based on experience and domain knowledge that cannot be automated. The current approaches such as cross-validation or explainable AI are not enough to overcome the real challenges and are discussed extensively in this article. Arguing toward structured verification approaches, we discuss a set of guidelines to verify models, code, and data in each step of the ML lifecycle. These guidelines can help to reliably measure and select an optimal solution, besides minimizing the risk of bugs and undesired behavior in edge-cases.
format	Online Article Text
id	pubmed-10280203
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Mary Ann Liebert, Inc., publishers
record_format	MEDLINE/PubMed
spelling	pubmed-102802032023-06-21 Structured Verification of Machine Learning Models in Industrial Settings Kaminwar, Sai Rahul Goschenhofer, Jann Thomas, Janek Thon, Ingo Bischl, Bernd Big Data Original Articles The use of machine learning (ML) allows us to automate and scale the decision-making processes. The key to this automation is the development of ML models that generalize training data toward unseen data. Such models can become extremely versatile and powerful, which makes democratization of artificial intelligence (AI) possible, that is, providing ML to non-ML experts such as software engineers or domain experts. Typically, automated ML (AutoML) is being referred to as a key step toward it. However, from our perspective, we believe that democratization of the verification process of ML systems is a larger and even more crucial challenge to achieve the democratization of AI. Currently, the process of ensuring that an ML model works as intended is unstructured. It is largely based on experience and domain knowledge that cannot be automated. The current approaches such as cross-validation or explainable AI are not enough to overcome the real challenges and are discussed extensively in this article. Arguing toward structured verification approaches, we discuss a set of guidelines to verify models, code, and data in each step of the ML lifecycle. These guidelines can help to reliably measure and select an optimal solution, besides minimizing the risk of bugs and undesired behavior in edge-cases. Mary Ann Liebert, Inc., publishers 2023-06-01 2023-06-12 /pmc/articles/PMC10280203/ /pubmed/34978896 http://dx.doi.org/10.1089/big.2021.0112 Text en © Sai Rahul Kaminwar et al. 2023; Published by Mary Ann Liebert, Inc. https://creativecommons.org/licenses/by-nc/4.0/This Open Access article is distributed under the terms of the Creative Commons Attribution Noncommercial License [CC-BY-NC] (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ) which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and the source are cited.
spellingShingle	Original Articles Kaminwar, Sai Rahul Goschenhofer, Jann Thomas, Janek Thon, Ingo Bischl, Bernd Structured Verification of Machine Learning Models in Industrial Settings
title	Structured Verification of Machine Learning Models in Industrial Settings
title_full	Structured Verification of Machine Learning Models in Industrial Settings
title_fullStr	Structured Verification of Machine Learning Models in Industrial Settings
title_full_unstemmed	Structured Verification of Machine Learning Models in Industrial Settings
title_short	Structured Verification of Machine Learning Models in Industrial Settings
title_sort	structured verification of machine learning models in industrial settings
topic	Original Articles
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280203/ https://www.ncbi.nlm.nih.gov/pubmed/34978896 http://dx.doi.org/10.1089/big.2021.0112
work_keys_str_mv	AT kaminwarsairahul structuredverificationofmachinelearningmodelsinindustrialsettings AT goschenhoferjann structuredverificationofmachinelearningmodelsinindustrialsettings AT thomasjanek structuredverificationofmachinelearningmodelsinindustrialsettings AT thoningo structuredverificationofmachinelearningmodelsinindustrialsettings AT bischlbernd structuredverificationofmachinelearningmodelsinindustrialsettings

Structured Verification of Machine Learning Models in Industrial Settings

Ejemplares similares