Cargando…

Domain-driven models yield better predictions at lower cost than reservoir computers in Lorenz systems

Recent advances in computing algorithms and hardware have rekindled interest in developing high-accuracy, low-cost surrogate models for simulating physical systems. The idea is to replace expensive numerical integration of complex coupled partial differential equations at fine time scales performed...

Descripción completa

Detalles Bibliográficos
Autores principales: Pyle, Ryan, Jovanovic, Nikola, Subramanian, Devika, Palem, Krishna V., Patel, Ankit B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society Publishing 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7898131/
https://www.ncbi.nlm.nih.gov/pubmed/33583272
http://dx.doi.org/10.1098/rsta.2020.0246
_version_ 1783653807888007168
author Pyle, Ryan
Jovanovic, Nikola
Subramanian, Devika
Palem, Krishna V.
Patel, Ankit B.
author_facet Pyle, Ryan
Jovanovic, Nikola
Subramanian, Devika
Palem, Krishna V.
Patel, Ankit B.
author_sort Pyle, Ryan
collection PubMed
description Recent advances in computing algorithms and hardware have rekindled interest in developing high-accuracy, low-cost surrogate models for simulating physical systems. The idea is to replace expensive numerical integration of complex coupled partial differential equations at fine time scales performed on supercomputers, with machine-learned surrogates that efficiently and accurately forecast future system states using data sampled from the underlying system. One particularly popular technique being explored within the weather and climate modelling community is the echo state network (ESN), an attractive alternative to other well-known deep learning architectures. Using the classical Lorenz 63 system, and the three tier multi-scale Lorenz 96 system (Thornes T, Duben P, Palmer T. 2017 Q. J. R. Meteorol. Soc. 143, 897–908. (doi:10.1002/qj.2974)) as benchmarks, we realize that previously studied state-of-the-art ESNs operate in two distinct regimes, corresponding to low and high spectral radius (LSR/HSR) for the sparse, randomly generated, reservoir recurrence matrix. Using knowledge of the mathematical structure of the Lorenz systems along with systematic ablation and hyperparameter sensitivity analyses, we show that state-of-the-art LSR-ESNs reduce to a polynomial regression model which we call Domain-Driven Regularized Regression (D2R2). Interestingly, D2R2 is a generalization of the well-known SINDy algorithm (Brunton SL, Proctor JL, Kutz JN. 2016 Proc. Natl Acad. Sci. USA 113, 3932–3937. (doi:10.1073/pnas.1517384113)). We also show experimentally that LSR-ESNs (Chattopadhyay A, Hassanzadeh P, Subramanian D. 2019 (http://arxiv.org/abs/1906.08829)) outperform HSR ESNs (Pathak J, Hunt B, Girvan M, Lu Z, Ott E. 2018 Phys. Rev. Lett. 120, 024102. (doi:10.1103/PhysRevLett.120.024102)) while D2R2 dominates both approaches. A significant goal in constructing surrogates is to cope with barriers to scaling in weather prediction and simulation of dynamical systems that are imposed by time and energy consumption in supercomputers. Inexact computing has emerged as a novel approach to helping with scaling. In this paper, we evaluate the performance of three models (LSR-ESN, HSR-ESN and D2R2) by varying the precision or word size of the computation as our inexactness-controlling parameter. For precisions of 64, 32 and 16 bits, we show that, surprisingly, the least expensive D2R2 method yields the most robust results and the greatest savings compared to ESNs. Specifically, D2R2 achieves 68 × in computational savings, with an additional 2 × if precision reductions are also employed, outperforming ESN variants by a large margin. This article is part of the theme issue ‘Machine learning for weather and climate modelling’.
format Online
Article
Text
id pubmed-7898131
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher The Royal Society Publishing
record_format MEDLINE/PubMed
spelling pubmed-78981312021-03-04 Domain-driven models yield better predictions at lower cost than reservoir computers in Lorenz systems Pyle, Ryan Jovanovic, Nikola Subramanian, Devika Palem, Krishna V. Patel, Ankit B. Philos Trans A Math Phys Eng Sci Articles Recent advances in computing algorithms and hardware have rekindled interest in developing high-accuracy, low-cost surrogate models for simulating physical systems. The idea is to replace expensive numerical integration of complex coupled partial differential equations at fine time scales performed on supercomputers, with machine-learned surrogates that efficiently and accurately forecast future system states using data sampled from the underlying system. One particularly popular technique being explored within the weather and climate modelling community is the echo state network (ESN), an attractive alternative to other well-known deep learning architectures. Using the classical Lorenz 63 system, and the three tier multi-scale Lorenz 96 system (Thornes T, Duben P, Palmer T. 2017 Q. J. R. Meteorol. Soc. 143, 897–908. (doi:10.1002/qj.2974)) as benchmarks, we realize that previously studied state-of-the-art ESNs operate in two distinct regimes, corresponding to low and high spectral radius (LSR/HSR) for the sparse, randomly generated, reservoir recurrence matrix. Using knowledge of the mathematical structure of the Lorenz systems along with systematic ablation and hyperparameter sensitivity analyses, we show that state-of-the-art LSR-ESNs reduce to a polynomial regression model which we call Domain-Driven Regularized Regression (D2R2). Interestingly, D2R2 is a generalization of the well-known SINDy algorithm (Brunton SL, Proctor JL, Kutz JN. 2016 Proc. Natl Acad. Sci. USA 113, 3932–3937. (doi:10.1073/pnas.1517384113)). We also show experimentally that LSR-ESNs (Chattopadhyay A, Hassanzadeh P, Subramanian D. 2019 (http://arxiv.org/abs/1906.08829)) outperform HSR ESNs (Pathak J, Hunt B, Girvan M, Lu Z, Ott E. 2018 Phys. Rev. Lett. 120, 024102. (doi:10.1103/PhysRevLett.120.024102)) while D2R2 dominates both approaches. A significant goal in constructing surrogates is to cope with barriers to scaling in weather prediction and simulation of dynamical systems that are imposed by time and energy consumption in supercomputers. Inexact computing has emerged as a novel approach to helping with scaling. In this paper, we evaluate the performance of three models (LSR-ESN, HSR-ESN and D2R2) by varying the precision or word size of the computation as our inexactness-controlling parameter. For precisions of 64, 32 and 16 bits, we show that, surprisingly, the least expensive D2R2 method yields the most robust results and the greatest savings compared to ESNs. Specifically, D2R2 achieves 68 × in computational savings, with an additional 2 × if precision reductions are also employed, outperforming ESN variants by a large margin. This article is part of the theme issue ‘Machine learning for weather and climate modelling’. The Royal Society Publishing 2021-04-05 2021-02-15 /pmc/articles/PMC7898131/ /pubmed/33583272 http://dx.doi.org/10.1098/rsta.2020.0246 Text en © 2021 The Authors. http://creativecommons.org/licenses/by/4.0/ http://creativecommons.org/licenses/by/4.0/http://creativecommons.org/licenses/by/4.0/Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited.
spellingShingle Articles
Pyle, Ryan
Jovanovic, Nikola
Subramanian, Devika
Palem, Krishna V.
Patel, Ankit B.
Domain-driven models yield better predictions at lower cost than reservoir computers in Lorenz systems
title Domain-driven models yield better predictions at lower cost than reservoir computers in Lorenz systems
title_full Domain-driven models yield better predictions at lower cost than reservoir computers in Lorenz systems
title_fullStr Domain-driven models yield better predictions at lower cost than reservoir computers in Lorenz systems
title_full_unstemmed Domain-driven models yield better predictions at lower cost than reservoir computers in Lorenz systems
title_short Domain-driven models yield better predictions at lower cost than reservoir computers in Lorenz systems
title_sort domain-driven models yield better predictions at lower cost than reservoir computers in lorenz systems
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7898131/
https://www.ncbi.nlm.nih.gov/pubmed/33583272
http://dx.doi.org/10.1098/rsta.2020.0246
work_keys_str_mv AT pyleryan domaindrivenmodelsyieldbetterpredictionsatlowercostthanreservoircomputersinlorenzsystems
AT jovanovicnikola domaindrivenmodelsyieldbetterpredictionsatlowercostthanreservoircomputersinlorenzsystems
AT subramaniandevika domaindrivenmodelsyieldbetterpredictionsatlowercostthanreservoircomputersinlorenzsystems
AT palemkrishnav domaindrivenmodelsyieldbetterpredictionsatlowercostthanreservoircomputersinlorenzsystems
AT patelankitb domaindrivenmodelsyieldbetterpredictionsatlowercostthanreservoircomputersinlorenzsystems