Cargando…

Explaining the physics of transfer learning in data-driven turbulence modeling

Transfer learning (TL), which enables neural networks (NNs) to generalize out-of-distribution via targeted re-training, is becoming a powerful tool in scientific machine learning (ML) applications such as weather/climate prediction and turbulence modeling. Effective TL requires knowing (1) how to re...

Descripción completa

Detalles Bibliográficos
Autores principales: Subel, Adam, Guan, Yifei, Chattopadhyay, Ashesh, Hassanzadeh, Pedram
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9991455/
https://www.ncbi.nlm.nih.gov/pubmed/36896127
http://dx.doi.org/10.1093/pnasnexus/pgad015
_version_ 1784902158828699648
author Subel, Adam
Guan, Yifei
Chattopadhyay, Ashesh
Hassanzadeh, Pedram
author_facet Subel, Adam
Guan, Yifei
Chattopadhyay, Ashesh
Hassanzadeh, Pedram
author_sort Subel, Adam
collection PubMed
description Transfer learning (TL), which enables neural networks (NNs) to generalize out-of-distribution via targeted re-training, is becoming a powerful tool in scientific machine learning (ML) applications such as weather/climate prediction and turbulence modeling. Effective TL requires knowing (1) how to re-train NNs? and (2) what physics are learned during TL? Here, we present novel analyses and a framework addressing (1)–(2) for a broad range of multi-scale, nonlinear, dynamical systems. Our approach combines spectral (e.g. Fourier) analyses of such systems with spectral analyses of convolutional NNs, revealing physical connections between the systems and what the NN learns (a combination of low-, high-, band-pass filters and Gabor filters). Integrating these analyses, we introduce a general framework that identifies the best re-training procedure for a given problem based on physics and NN theory. As test case, we explain the physics of TL in subgrid-scale modeling of several setups of 2D turbulence. Furthermore, these analyses show that in these cases, the shallowest convolution layers are the best to re-train, which is consistent with our physics-guided framework but is against the common wisdom guiding TL in the ML literature. Our work provides a new avenue for optimal and explainable TL, and a step toward fully explainable NNs, for wide-ranging applications in science and engineering, such as climate change modeling.
format Online
Article
Text
id pubmed-9991455
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-99914552023-03-08 Explaining the physics of transfer learning in data-driven turbulence modeling Subel, Adam Guan, Yifei Chattopadhyay, Ashesh Hassanzadeh, Pedram PNAS Nexus Physical Sciences and Engineering Transfer learning (TL), which enables neural networks (NNs) to generalize out-of-distribution via targeted re-training, is becoming a powerful tool in scientific machine learning (ML) applications such as weather/climate prediction and turbulence modeling. Effective TL requires knowing (1) how to re-train NNs? and (2) what physics are learned during TL? Here, we present novel analyses and a framework addressing (1)–(2) for a broad range of multi-scale, nonlinear, dynamical systems. Our approach combines spectral (e.g. Fourier) analyses of such systems with spectral analyses of convolutional NNs, revealing physical connections between the systems and what the NN learns (a combination of low-, high-, band-pass filters and Gabor filters). Integrating these analyses, we introduce a general framework that identifies the best re-training procedure for a given problem based on physics and NN theory. As test case, we explain the physics of TL in subgrid-scale modeling of several setups of 2D turbulence. Furthermore, these analyses show that in these cases, the shallowest convolution layers are the best to re-train, which is consistent with our physics-guided framework but is against the common wisdom guiding TL in the ML literature. Our work provides a new avenue for optimal and explainable TL, and a step toward fully explainable NNs, for wide-ranging applications in science and engineering, such as climate change modeling. Oxford University Press 2023-01-23 /pmc/articles/PMC9991455/ /pubmed/36896127 http://dx.doi.org/10.1093/pnasnexus/pgad015 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of National Academy of Sciences. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Physical Sciences and Engineering
Subel, Adam
Guan, Yifei
Chattopadhyay, Ashesh
Hassanzadeh, Pedram
Explaining the physics of transfer learning in data-driven turbulence modeling
title Explaining the physics of transfer learning in data-driven turbulence modeling
title_full Explaining the physics of transfer learning in data-driven turbulence modeling
title_fullStr Explaining the physics of transfer learning in data-driven turbulence modeling
title_full_unstemmed Explaining the physics of transfer learning in data-driven turbulence modeling
title_short Explaining the physics of transfer learning in data-driven turbulence modeling
title_sort explaining the physics of transfer learning in data-driven turbulence modeling
topic Physical Sciences and Engineering
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9991455/
https://www.ncbi.nlm.nih.gov/pubmed/36896127
http://dx.doi.org/10.1093/pnasnexus/pgad015
work_keys_str_mv AT subeladam explainingthephysicsoftransferlearningindatadriventurbulencemodeling
AT guanyifei explainingthephysicsoftransferlearningindatadriventurbulencemodeling
AT chattopadhyayashesh explainingthephysicsoftransferlearningindatadriventurbulencemodeling
AT hassanzadehpedram explainingthephysicsoftransferlearningindatadriventurbulencemodeling