Cargando…

Explaining the physics of transfer learning in data-driven turbulence modeling

Transfer learning (TL), which enables neural networks (NNs) to generalize out-of-distribution via targeted re-training, is becoming a powerful tool in scientific machine learning (ML) applications such as weather/climate prediction and turbulence modeling. Effective TL requires knowing (1) how to re...

Descripción completa

Detalles Bibliográficos
Autores principales:	Subel, Adam, Guan, Yifei, Chattopadhyay, Ashesh, Hassanzadeh, Pedram
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2023
Materias:	Physical Sciences and Engineering
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9991455/ https://www.ncbi.nlm.nih.gov/pubmed/36896127 http://dx.doi.org/10.1093/pnasnexus/pgad015

_version_	1784902158828699648
author	Subel, Adam Guan, Yifei Chattopadhyay, Ashesh Hassanzadeh, Pedram
author_facet	Subel, Adam Guan, Yifei Chattopadhyay, Ashesh Hassanzadeh, Pedram
author_sort	Subel, Adam
collection	PubMed
description	Transfer learning (TL), which enables neural networks (NNs) to generalize out-of-distribution via targeted re-training, is becoming a powerful tool in scientific machine learning (ML) applications such as weather/climate prediction and turbulence modeling. Effective TL requires knowing (1) how to re-train NNs? and (2) what physics are learned during TL? Here, we present novel analyses and a framework addressing (1)–(2) for a broad range of multi-scale, nonlinear, dynamical systems. Our approach combines spectral (e.g. Fourier) analyses of such systems with spectral analyses of convolutional NNs, revealing physical connections between the systems and what the NN learns (a combination of low-, high-, band-pass filters and Gabor filters). Integrating these analyses, we introduce a general framework that identifies the best re-training procedure for a given problem based on physics and NN theory. As test case, we explain the physics of TL in subgrid-scale modeling of several setups of 2D turbulence. Furthermore, these analyses show that in these cases, the shallowest convolution layers are the best to re-train, which is consistent with our physics-guided framework but is against the common wisdom guiding TL in the ML literature. Our work provides a new avenue for optimal and explainable TL, and a step toward fully explainable NNs, for wide-ranging applications in science and engineering, such as climate change modeling.
format	Online Article Text
id	pubmed-9991455
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-99914552023-03-08 Explaining the physics of transfer learning in data-driven turbulence modeling Subel, Adam Guan, Yifei Chattopadhyay, Ashesh Hassanzadeh, Pedram PNAS Nexus Physical Sciences and Engineering Transfer learning (TL), which enables neural networks (NNs) to generalize out-of-distribution via targeted re-training, is becoming a powerful tool in scientific machine learning (ML) applications such as weather/climate prediction and turbulence modeling. Effective TL requires knowing (1) how to re-train NNs? and (2) what physics are learned during TL? Here, we present novel analyses and a framework addressing (1)–(2) for a broad range of multi-scale, nonlinear, dynamical systems. Our approach combines spectral (e.g. Fourier) analyses of such systems with spectral analyses of convolutional NNs, revealing physical connections between the systems and what the NN learns (a combination of low-, high-, band-pass filters and Gabor filters). Integrating these analyses, we introduce a general framework that identifies the best re-training procedure for a given problem based on physics and NN theory. As test case, we explain the physics of TL in subgrid-scale modeling of several setups of 2D turbulence. Furthermore, these analyses show that in these cases, the shallowest convolution layers are the best to re-train, which is consistent with our physics-guided framework but is against the common wisdom guiding TL in the ML literature. Our work provides a new avenue for optimal and explainable TL, and a step toward fully explainable NNs, for wide-ranging applications in science and engineering, such as climate change modeling. Oxford University Press 2023-01-23 /pmc/articles/PMC9991455/ /pubmed/36896127 http://dx.doi.org/10.1093/pnasnexus/pgad015 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of National Academy of Sciences. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Physical Sciences and Engineering Subel, Adam Guan, Yifei Chattopadhyay, Ashesh Hassanzadeh, Pedram Explaining the physics of transfer learning in data-driven turbulence modeling
title	Explaining the physics of transfer learning in data-driven turbulence modeling
title_full	Explaining the physics of transfer learning in data-driven turbulence modeling
title_fullStr	Explaining the physics of transfer learning in data-driven turbulence modeling
title_full_unstemmed	Explaining the physics of transfer learning in data-driven turbulence modeling
title_short	Explaining the physics of transfer learning in data-driven turbulence modeling
title_sort	explaining the physics of transfer learning in data-driven turbulence modeling
topic	Physical Sciences and Engineering
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9991455/ https://www.ncbi.nlm.nih.gov/pubmed/36896127 http://dx.doi.org/10.1093/pnasnexus/pgad015
work_keys_str_mv	AT subeladam explainingthephysicsoftransferlearningindatadriventurbulencemodeling AT guanyifei explainingthephysicsoftransferlearningindatadriventurbulencemodeling AT chattopadhyayashesh explainingthephysicsoftransferlearningindatadriventurbulencemodeling AT hassanzadehpedram explainingthephysicsoftransferlearningindatadriventurbulencemodeling

Explaining the physics of transfer learning in data-driven turbulence modeling

Ejemplares similares