Cargando…

A Model Falsification Approach to Learning in Non-Stationary Environments for Experimental Design

The application of data driven machine learning and advanced statistical tools to complex physics experiments, such as Magnetic Confinement Nuclear Fusion, can be problematic, due the varying conditions of the systems to be studied. In particular, new experiments have to be planned in unexplored reg...

Descripción completa

Detalles Bibliográficos
Autores principales:	Murari, Andrea, Lungaroni, Michele, Peluso, Emmanuele, Craciunescu, Teddy, Gelfusa, Michela
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2019
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6884580/ https://www.ncbi.nlm.nih.gov/pubmed/31784604 http://dx.doi.org/10.1038/s41598-019-54145-7

_version_	1783474579949223936
author	Murari, Andrea Lungaroni, Michele Peluso, Emmanuele Craciunescu, Teddy Gelfusa, Michela
author_facet	Murari, Andrea Lungaroni, Michele Peluso, Emmanuele Craciunescu, Teddy Gelfusa, Michela
author_sort	Murari, Andrea
collection	PubMed
description	The application of data driven machine learning and advanced statistical tools to complex physics experiments, such as Magnetic Confinement Nuclear Fusion, can be problematic, due the varying conditions of the systems to be studied. In particular, new experiments have to be planned in unexplored regions of the operational space. As a consequence, care must be taken because the input quantities used to train and test the performance of the analysis tools are not necessarily sampled by the same probability distribution as in the final applications. The regressors and dependent variables cannot therefore be assumed to verify the i.i.d. (independent and identical distribution) hypothesis and learning has therefore to take place under non stationary conditions. In the present paper, a new data driven methodology is proposed to guide planning of experiments, to explore the operational space and to optimise performance. The approach is based on the falsification of existing models. The deployment of Symbolic Regression via Genetic Programming to the available data is used to identify a set of candidate models, using the method of the Pareto Frontier. The confidence intervals for the predictions of such models are then used to find the best region of the parameter space for their falsification, where the next set of experiments can be most profitably carried out. Extensive numerical tests and applications to the scaling laws in Tokamaks prove the viability of the proposed methodology.
format	Online Article Text
id	pubmed-6884580
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-68845802019-12-06 A Model Falsification Approach to Learning in Non-Stationary Environments for Experimental Design Murari, Andrea Lungaroni, Michele Peluso, Emmanuele Craciunescu, Teddy Gelfusa, Michela Sci Rep Article The application of data driven machine learning and advanced statistical tools to complex physics experiments, such as Magnetic Confinement Nuclear Fusion, can be problematic, due the varying conditions of the systems to be studied. In particular, new experiments have to be planned in unexplored regions of the operational space. As a consequence, care must be taken because the input quantities used to train and test the performance of the analysis tools are not necessarily sampled by the same probability distribution as in the final applications. The regressors and dependent variables cannot therefore be assumed to verify the i.i.d. (independent and identical distribution) hypothesis and learning has therefore to take place under non stationary conditions. In the present paper, a new data driven methodology is proposed to guide planning of experiments, to explore the operational space and to optimise performance. The approach is based on the falsification of existing models. The deployment of Symbolic Regression via Genetic Programming to the available data is used to identify a set of candidate models, using the method of the Pareto Frontier. The confidence intervals for the predictions of such models are then used to find the best region of the parameter space for their falsification, where the next set of experiments can be most profitably carried out. Extensive numerical tests and applications to the scaling laws in Tokamaks prove the viability of the proposed methodology. Nature Publishing Group UK 2019-11-29 /pmc/articles/PMC6884580/ /pubmed/31784604 http://dx.doi.org/10.1038/s41598-019-54145-7 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle	Article Murari, Andrea Lungaroni, Michele Peluso, Emmanuele Craciunescu, Teddy Gelfusa, Michela A Model Falsification Approach to Learning in Non-Stationary Environments for Experimental Design
title	A Model Falsification Approach to Learning in Non-Stationary Environments for Experimental Design
title_full	A Model Falsification Approach to Learning in Non-Stationary Environments for Experimental Design
title_fullStr	A Model Falsification Approach to Learning in Non-Stationary Environments for Experimental Design
title_full_unstemmed	A Model Falsification Approach to Learning in Non-Stationary Environments for Experimental Design
title_short	A Model Falsification Approach to Learning in Non-Stationary Environments for Experimental Design
title_sort	model falsification approach to learning in non-stationary environments for experimental design
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6884580/ https://www.ncbi.nlm.nih.gov/pubmed/31784604 http://dx.doi.org/10.1038/s41598-019-54145-7
work_keys_str_mv	AT murariandrea amodelfalsificationapproachtolearninginnonstationaryenvironmentsforexperimentaldesign AT lungaronimichele amodelfalsificationapproachtolearninginnonstationaryenvironmentsforexperimentaldesign AT pelusoemmanuele amodelfalsificationapproachtolearninginnonstationaryenvironmentsforexperimentaldesign AT craciunescuteddy amodelfalsificationapproachtolearninginnonstationaryenvironmentsforexperimentaldesign AT gelfusamichela amodelfalsificationapproachtolearninginnonstationaryenvironmentsforexperimentaldesign AT murariandrea modelfalsificationapproachtolearninginnonstationaryenvironmentsforexperimentaldesign AT lungaronimichele modelfalsificationapproachtolearninginnonstationaryenvironmentsforexperimentaldesign AT pelusoemmanuele modelfalsificationapproachtolearninginnonstationaryenvironmentsforexperimentaldesign AT craciunescuteddy modelfalsificationapproachtolearninginnonstationaryenvironmentsforexperimentaldesign AT gelfusamichela modelfalsificationapproachtolearninginnonstationaryenvironmentsforexperimentaldesign

A Model Falsification Approach to Learning in Non-Stationary Environments for Experimental Design

Ejemplares similares