Cargando…

A training strategy for hybrid models to break the curse of dimensionality

Mechanistic/data-driven hybrid modeling is a key approach when the mechanistic details of the processes at hand are not sufficiently well understood, but also inferring a model purely from data is too complex. By the integration of first principles into a data-driven approach, hybrid modeling promis...

Descripción completa

Detalles Bibliográficos
Autores principales:	E. Samadi, Moein, Kiefer, Sandra, Fritsch, Sebastian Johaness, Bickenbach, Johannes, Schuppert, Andreas
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9477345/ https://www.ncbi.nlm.nih.gov/pubmed/36107916 http://dx.doi.org/10.1371/journal.pone.0274569

_version_	1784790340338712576
author	E. Samadi, Moein Kiefer, Sandra Fritsch, Sebastian Johaness Bickenbach, Johannes Schuppert, Andreas
author_facet	E. Samadi, Moein Kiefer, Sandra Fritsch, Sebastian Johaness Bickenbach, Johannes Schuppert, Andreas
author_sort	E. Samadi, Moein
collection	PubMed
description	Mechanistic/data-driven hybrid modeling is a key approach when the mechanistic details of the processes at hand are not sufficiently well understood, but also inferring a model purely from data is too complex. By the integration of first principles into a data-driven approach, hybrid modeling promises a feasible data demand alongside extrapolation. In this work, we introduce a learning strategy for tree-structured hybrid models to perform a binary classification task. Given a set of binary labeled data, the challenge is to use them to develop a model that accurately assesses labels of new unlabeled data. Our strategy employs graph-theoretic methods to analyze the data and deduce a function that maps input features to output labels. Our focus here is on data sets represented by binary features in which the label assessment of unlabeled data points is always extrapolation. Our strategy shows the existence of small sets of data points within given binary data for which knowing the labels allows for extrapolation to the entire valid input space. An implementation of our strategy yields a notable reduction of training-data demand in a binary classification task compared with different supervised machine learning algorithms. As an application, we have fitted a tree-structured hybrid model to the vital status of a cohort of COVID-19 patients requiring intensive-care unit treatment and mechanical ventilation. Our learning strategy yields the existence of patient cohorts for whom knowing the vital status enables extrapolation to the entire valid input space of the developed hybrid model.
format	Online Article Text
id	pubmed-9477345
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-94773452022-09-16 A training strategy for hybrid models to break the curse of dimensionality E. Samadi, Moein Kiefer, Sandra Fritsch, Sebastian Johaness Bickenbach, Johannes Schuppert, Andreas PLoS One Research Article Mechanistic/data-driven hybrid modeling is a key approach when the mechanistic details of the processes at hand are not sufficiently well understood, but also inferring a model purely from data is too complex. By the integration of first principles into a data-driven approach, hybrid modeling promises a feasible data demand alongside extrapolation. In this work, we introduce a learning strategy for tree-structured hybrid models to perform a binary classification task. Given a set of binary labeled data, the challenge is to use them to develop a model that accurately assesses labels of new unlabeled data. Our strategy employs graph-theoretic methods to analyze the data and deduce a function that maps input features to output labels. Our focus here is on data sets represented by binary features in which the label assessment of unlabeled data points is always extrapolation. Our strategy shows the existence of small sets of data points within given binary data for which knowing the labels allows for extrapolation to the entire valid input space. An implementation of our strategy yields a notable reduction of training-data demand in a binary classification task compared with different supervised machine learning algorithms. As an application, we have fitted a tree-structured hybrid model to the vital status of a cohort of COVID-19 patients requiring intensive-care unit treatment and mechanical ventilation. Our learning strategy yields the existence of patient cohorts for whom knowing the vital status enables extrapolation to the entire valid input space of the developed hybrid model. Public Library of Science 2022-09-15 /pmc/articles/PMC9477345/ /pubmed/36107916 http://dx.doi.org/10.1371/journal.pone.0274569 Text en © 2022 E. Samadi et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article E. Samadi, Moein Kiefer, Sandra Fritsch, Sebastian Johaness Bickenbach, Johannes Schuppert, Andreas A training strategy for hybrid models to break the curse of dimensionality
title	A training strategy for hybrid models to break the curse of dimensionality
title_full	A training strategy for hybrid models to break the curse of dimensionality
title_fullStr	A training strategy for hybrid models to break the curse of dimensionality
title_full_unstemmed	A training strategy for hybrid models to break the curse of dimensionality
title_short	A training strategy for hybrid models to break the curse of dimensionality
title_sort	training strategy for hybrid models to break the curse of dimensionality
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9477345/ https://www.ncbi.nlm.nih.gov/pubmed/36107916 http://dx.doi.org/10.1371/journal.pone.0274569
work_keys_str_mv	AT esamadimoein atrainingstrategyforhybridmodelstobreakthecurseofdimensionality AT kiefersandra atrainingstrategyforhybridmodelstobreakthecurseofdimensionality AT fritschsebastianjohaness atrainingstrategyforhybridmodelstobreakthecurseofdimensionality AT bickenbachjohannes atrainingstrategyforhybridmodelstobreakthecurseofdimensionality AT schuppertandreas atrainingstrategyforhybridmodelstobreakthecurseofdimensionality AT esamadimoein trainingstrategyforhybridmodelstobreakthecurseofdimensionality AT kiefersandra trainingstrategyforhybridmodelstobreakthecurseofdimensionality AT fritschsebastianjohaness trainingstrategyforhybridmodelstobreakthecurseofdimensionality AT bickenbachjohannes trainingstrategyforhybridmodelstobreakthecurseofdimensionality AT schuppertandreas trainingstrategyforhybridmodelstobreakthecurseofdimensionality

A training strategy for hybrid models to break the curse of dimensionality

Ejemplares similares