Data-driven Equations Discovery

2024
Chekroun, Mickaël D., H. Liu, K. Srinivasan, and James C. McWilliams. 2024. “The High-Frequency and Rare Events Barriers to Neural Closures of Atmospheric Dynamics.” Journal of Physics: Complexity 5: 025004. Publisher's version Abstract
Recent years have seen a surge in interest for leveraging neural networks to parameterize small-scale or fast processes in climate and turbulence models. In this short paper, we point out two fundamental issues in this endeavor. The first concerns the difficulties neural networks may experience in capturing rare events due to limitations in how data is sampled. The second arises from the inherent multiscale nature of these systems. They combine high-frequency components (like inertia-gravity waves) with slower, evolving processes (geostrophic motion). This multiscale nature creates a significant hurdle for neural network closures. To illustrate these challenges, we focus on the atmospheric 1980 Lorenz model, a simplified version of the Primitive Equations that drive climate models. This model serves as a compelling example because it captures the essence of these difficulties.
Srinivasan, Kaushik, Mickaël D. Chekroun, and James C. McWilliams. 2024. “Turbulence closure with small, local neural networks: Forced two-dimensional and β-plane flows.” Journal of Advances in Modeling Earth Systems 16 (4): e2023MS003795. Publisher's link Abstract

We parameterize sub-grid scale (SGS) fluxes in sinusoidally forced two-dimensional turbulence on the β-plane at high Reynolds numbers (Re ∼25,000) using simple 2-layer convolutional neural networks (CNN) having only O(1000) parameters, two orders of magnitude smaller than recent studies employing deeper CNNs with 8–10 layers; we obtain stable, accurate, and long-term online or a posteriori solutions at 16× downscaling factors. Our methodology significantly improves training efficiency and speed of online large eddy simulations runs, while offering insights into the physics of closure in such turbulent flows. Our approach benefits from extensive hyperparameter searching in learning rate and weight decay coefficient space, as well as the use of cyclical learning rate annealing, which leads to more robust and accurate online solutions compared to fixed learning rates. Our CNNs use either the coarse velocity or the vorticity and strain fields as inputs, and output the two components of the deviatoric stress tensor, Sd. We minimize a loss between the SGS vorticity flux divergence (computed from the high-resolution solver) and that obtained from the CNN-modeled Sd, without requiring energy or enstrophy preserving constraints. The success of shallow CNNs in accurately parameterizing this class of turbulent flows implies that the SGS stresses have a weak non-local dependence on coarse fields; it also aligns with our physical conception that small-scales are locally controlled by larger scales such as vortices and their strained filaments. Furthermore, 2-layer CNN-parameterizations are more likely to be interpretable.

2023
Lucarini, Valerio, and Mickaël D. Chekroun. 2023. “Theoretical tools for understanding the climate crisis from Hasselmann’s programme and beyond.” Nature Review Physics 5: 744-765 . Publisher's link Abstract
Klaus Hasselmann’s revolutionary intuition in climate science was to use the stochasticity associated with fast weather processes to probe the slow dynamics of the climate system. Doing so led to fundamentally new ways to study the response of climate models to perturbations, and to perform detection and attribution for climate change signals. Hasselmann’s programme has been extremely influential in climate science and beyond. In this Perspective, we first summarize the main aspects of such a programme using modern concepts and tools of statistical physics and applied mathematics. We then provide an overview of some promising scientific perspectives that might clarify the science behind the climate crisis and that stem from Hasselmann’s ideas. We show how to perform rigorous and data-driven model reduction by constructing parameterizations in systems that do not necessarily feature a timescale separation between unresolved and resolved processes. We outline a general theoretical framework for explaining the relationship between climate variability and climate change, and for performing climate change projections. This framework enables us seamlessly to explain some key general aspects of climatic tipping points. Finally, we show that response theory provides a solid framework supporting optimal fingerprinting methods for detection and attribution.
2021
Chekroun, Mickaël D., Honghu Liu, and James C. McWilliams. 2021. “Stochastic rectification of fast oscillations on slow manifold closures.” Proc. Natl. Acad. Sci. 118 (48): E2113650118. Publisher's Version Abstract
The problems of identifying the slow component (e.g., for weather forecast initialization) and of characterizing slow–fast interactions are central to geophysical fluid dynamics. In this study, the related rectification problem of slow manifold closures is addressed when breakdown of slow-to-fast scales deterministic parameterizations occurs due to explosive emergence of fast oscillations on the slow, geostrophic motion. For such regimes, it is shown on the Lorenz 80 model that if 1) the underlying manifold provides a good approximation of the optimal nonlinear parameterization that averages out the fast variables and 2) the residual dynamics off this manifold is mainly orthogonal to it, then no memory terms are required in the Mori–Zwanzig full closure. Instead, the noise term is key to resolve, and is shown to be, in this case, well modeled by a state-independent noise, obtained by means of networks of stochastic nonlinear oscillators. This stochastic parameterization allows, in turn, for rectifying the momentum-balanced slow manifold, and for accurate recovery of the multiscale dynamics. The approach is promising to be further applied to the closure of other more complex slow–fast systems, in strongly coupled regimes.
Santos Gutiérrez, Manuel, Valerio Lucarini, Mickaël D. Chekroun, and Michael Ghil. 2021. “Reduced-order models for coupled dynamical systems: Data-driven methods and the Koopman operator.” Chaos 31: 053116. Publisher's Version Abstract
Providing efficient and accurate parameterizations for model reduction is a key goal in many areas of science and technology. Here, we present a strong link between data-driven and theoretical approaches to achieving this goal. Formal perturbation expansions of the Koopman operator allow us to derive general stochastic parameterizations of weakly coupled dynamical systems. Such parameterizations yield a set of stochastic integrodifferential equations with explicit noise and memory kernel formulas to describe the effects of unresolved variables. We show that the perturbation expansions involved need not be truncated when the coupling is additive. The unwieldy integrodifferential equations can be recast as a simpler multilevel Markovian model, and we establish an intuitive connection with a generalized Langevin equation. This connection helps setting up a parallelism between the top-down, equation-based methodology herein and the well-established empirical model reduction (EMR) methodology that has been shown to provide efficient dynamical closures to partially observed systems. Hence, our findings, on the one hand, support the physical basis and robustness of the EMR methodology and, on the other hand, illustrate the practical relevance of the perturbative expansion used for deriving the parameterizations.
Parameterizations aim to reduce the complexity of high-dimensional dynamical systems. Here, a theory-based and a data-driven approach for the parameterization of coupled systems are compared, showing that both yield the same stochastic multilevel structure. The results provide very strong support to the use of empirical methods in model reduction and clarify the practical relevance of the proposed theoretical framework.
2018
Kondrashov, Dmitri, Mickaël D. Chekroun, and Michael Ghil. 2018. “Data-adaptive harmonic decomposition and prediction of Arctic sea ice extent.” Dynamics and Statistics of the Climate System 3 (1): 1. Publisher's Version Abstract

Decline in the Arctic sea ice extent (SIE) is an area of active scientific research with profound socio-economic implications. Of particular interest are reliable methods for SIE forecasting on subseasonal time scales, in particular from early summer into fall, when sea ice coverage in the Arctic reaches its minimum. Here, we apply the recent data-adaptive harmonic (DAH) technique of Chekroun and Kondrashov, (2017), Chaos, 27 for the description, modeling and prediction of the Multisensor Analyzed Sea Ice Extent (MASIE, 2006–2016) data set. The DAH decomposition of MASIE identifies narrowband, spatio-temporal data-adaptive modes over four key Arctic regions. The time evolution of the DAH coefficients of these modes can be modelled and predicted by using a set of coupled Stuart–Landau stochastic differential equations that capture the modes’ frequencies and amplitude modulation in time. Retrospective forecasts show that our resulting multilayer Stuart–Landau model (MSLM) is quite skilful in predicting September SIE compared to year-to-year persistence; moreover, the DAH–MSLM approach provided accurate real-time prediction that was highly competitive for the 2016–2017 Sea Ice Outlook.

Kondrashov, Dmitri, Mickaël D. Chekroun, and Pavel Berloff. 2018. “Multiscale Stuart-Landau Emulators: Application to Wind-Driven Ocean Gyres.” Fluids 3 (1): 21. Publisher's Version Abstract

The multiscale variability of the ocean circulation due to its nonlinear dynamics remains a big challenge for theoretical understanding and practical ocean modeling. This paper demonstrates how the data-adaptive harmonic (DAH) decomposition and inverse stochastic modeling techniques introduced in (Chekroun and Kondrashov, (2017), Chaos, 27), allow for reproducing with high fidelity the main statistical properties of multiscale variability in a coarse-grained eddy-resolving ocean flow. This fully-data-driven approach relies on extraction of frequency-ranked time-dependent coefficients describing the evolution of spatio-temporal DAH modes (DAHMs) in the oceanic flow data. In turn, the time series of these coefficients are efficiently modeled by a family of low-order stochastic differential equations (SDEs) stacked per frequency, involving a fixed set of predictor functions and a small number of model coefficients. These SDEs take the form of stochastic oscillators, identified as multilayer Stuart–Landau models (MSLMs), and their use is justified by relying on the theory of Ruelle–Pollicott resonances. The good modeling skills shown by the resulting DAH-MSLM emulators demonstrates the feasibility of using a network of stochastic oscillators for the modeling of geophysical turbulence. In a certain sense, the original quasiperiodic Landau view of turbulence, with the amendment of the inclusion of stochasticity, may be well suited to describe turbulence.

Decadal DAH mode

 

Kondrashov, Dmitri, and Mickaël D. Chekroun. 2018. “Data-adaptive harmonic analysis and modeling of solar wind-magnetosphere coupling.” Journal of Atmospheric and Solar-Terrestrial Physics 177: 179-189. Publisher's Version Abstract

The solar wind-magnetosphere coupling is studied by new data-adaptive harmonic (DAH) decomposition approach for the spectral analysis and inverse modeling of multivariate time observations of complex nonlinear dynamical systems. DAH identifies frequency-based modes of interactions in the combined dataset of Auroral Electrojet (AE) index and solar wind forcing. The time evolution of these modes can be very effi- ciently simulated by using systems of stochastic differential equations (SDEs) that are stacked per frequency and formed by coupled Stuart-Landau oscillators. These systems of SDEs capture the modes’ frequencies as well as their amplitude modulations, and yield, in turn, an accurate modeling of the AE index’ statistical properties.

 

Kondrashov, Dmitri, Mickaël D. Chekroun, Xiaojun Yuan, and Michael Ghil. 2018. “Data-adaptive harmonic decomposition and stochastic modeling of Arctic sea ice.” Advances in Nonlinear Geosciences, A. Tsonis, 179-205. Springer. Publisher's Version Abstract

We present and apply a novel method of describing and modeling complex multivariate datasets in the geosciences and elsewhere. Data-adaptive harmonic (DAH) decomposition identifies narrow-banded, spatio-temporal modes (DAHMs) whose frequencies are not necessarily integer multiples of each other. The evolution in time of the DAH coefficients (DAHCs) of these modes can be modeled using a set of coupled Stuart-Landau stochastic differential equations that capture the modes’ frequencies and amplitude modulation in time and space. This methodology is applied first to a challenging synthetic dataset and then to Arctic sea ice concentration (SIC) data from the US National Snow and Ice Data Center (NSIDC). The 36-year (1979–2014) dataset is parsimoniously and accurately described by our DAHMs. Preliminary results indicate that simulations using our multilayer Stuart-Landau model (MSLM) of SICs are stable for much longer time intervals, beyond the end of the twenty-first century, and exhibit interdecadal variability consistent with past historical records. Preliminary results indicate that this MSLM is quite skillful in predicting September sea ice extent.

2017
Chekroun, Mickaël D., and Dmitri Kondrashov. 2017. “Data-adaptive harmonic spectra and multilayer Stuart-Landau models.” Chaos: An Interdisciplinary Journal of Nonlinear Science 27 (9): 093110. Publisher's version Abstract

Harmonic decompositions of multivariate time series are considered for which we adopt an integral operator approach with  
periodic semigroup kernels. Spectral decomposition theorems are derived that cover the important cases of two-time statistics drawn from a mixing invariant measure. 

The corresponding eigenvalues can be grouped per Fourier frequency, and are actually given, at each frequency, as the singular values of a cross-spectral matrix depending on the data. These eigenvalues obey furthermore a variational principle that allows us to define naturally a multidimensional power spectrum.  The eigenmodes, as far as they are concerned, exhibit a data-adaptive character manifested in their phase which allows us in turn to define a multidimensional phase spectrum.

The resulting data-adaptive harmonic (DAH) modes allow for reducing the data-driven modeling effort to elemental models stacked per frequency, only coupled at different frequencies by the same noise realization. In particular, the DAH decomposition extracts time-dependent coefficients stacked by Fourier frequency which can be efficiently modeled---provided the decay of temporal correlations is sufficiently well-resolved---within a class of multilayer stochastic models (MSMs) tailored here on stochastic Stuart-Landau oscillators.

Applications to the Lorenz 96 model and to a stochastic heat equation driven by a space-time white noise, are considered. In both cases, the DAH decomposition allows for an extraction of spatio-temporal modes revealing key features of the dynamics in the embedded phase space. The multilayer Stuart-Landau models (MSLMs) are shown to successfully model the typical patterns of the corresponding time-evolving fields, as well as their statistics of occurrence.

Boers, N., M. D. Chekroun, H. Liu, D. Kondrashov, D.-D. Rousseau, A. Svensson, M. Bigler, and M. Ghil. 2017. “Inverse stochastic-dynamic models for high-resolution Greenland ice-core records.” Earth System Dynamics 8: 1171–1190. Publisher's Version Abstract

Proxy records from Greenland ice cores have been studied for several decades, yet many open questions remain regarding the climate variability encoded therein. Here, we use a Bayesian framework for inferring inverse, stochastic-dynamic models from δ18O and dust records of unprecedented, subdecadal temporal resolution. The records stem from the North Greenland Ice Core Project (NGRIP) and we focus on the time interval 59 ka–22 ka b2k. Our model reproduces the dynamical characteristics of both the δ18O and dust proxy records, including the millennial-scale Dansgaard–Oeschger variability, as well as statistical properties such as probability density functions, waiting times and power spectra, with no need for any external forcing. The crucial ingredients for capturing these properties are (i) high-resolution training data; (ii) cubic drift terms; (iii) nonlinear coupling terms between the δ18O and dust time series; and (iv) non-Markovian contributions that represent short-term memory effects.

2016
Kondrashov, Dmitri, Mickaël D. Chekroun, and Michael Ghil. 2016. “Comment on ``Nonparametric forecasting of low-dimensional dynamical systems''.” Phys. Rev. E 93. American Physical Society: 036201. Publisher's Version Abstract

The comparison performed in Berry et al. [Phys. Rev. E 91, 032915 (2015)] between the skill in predicting the El Niño-Southern Oscillation climate phenomenon by the prediction method of Berry et al. and the “past-noise” forecasting method of Chekroun et al. [Proc. Natl. Acad. Sci. USA 108, 11766 (2011)] is flawed. Three specific misunderstandings in Berry et al. are pointed out and corrected.

Chen, C., M. Cane, N. Henderson, D. Lee, D. Chapman, D. Kondrashov, and M. D. Chekroun. 2016. “Diversity, nonlinearity, seasonality and memory effect in ENSO simulation and prediction using empirical model reduction.” Journal of Climate 29 (5): 1809–1830. Publisher's Version Abstract

A suite of empirical model experiments under the empirical model reduction framework are conducted to advance the understanding of ENSO diversity, nonlinearity, seasonality, and the memory effect in the simulation and prediction of tropical Pacific sea surface temperature (SST) anomalies. The model training and evaluation are carried out using 4000-yr preindustrial control simulation data from the coupled model GFDL CM2.1. The results show that multivariate models with tropical Pacific subsurface information and multilevel models with SST history information both improve the prediction skill dramatically. These two types of models represent the ENSO memory effect based on either the recharge oscillator or the time-delayed oscillator viewpoint. Multilevel SST models are a bit more efficient, requiring fewer model coefficients. Nonlinearity is found necessary to reproduce the ENSO diversity feature for extreme events. The nonlinear models reconstruct the skewed probability density function of SST anomalies and improve the prediction of the skewed amplitude, though the role of nonlinearity may be slightly overestimated given the strong nonlinear ENSO in GFDL CM2.1. The models with periodic terms reproduce the SST seasonal phase locking but do not improve the prediction appreciably. The models with multiple ingredients capture several ENSO characteristics simultaneously and exhibit overall better prediction skill for more diverse target patterns. In particular, they alleviate the spring/autumn prediction barrier and reduce the tendency for predicted values to lag the target month value.

2015
Kondrashov, Dmitri, Mickaël D. Chekroun, and Michael Ghil. 2015. “Data-driven non-Markovian closure models.” Physica D: Nonlinear Phenomena 297: 33 - 55. Publisher's Version Abstract
Abstract This paper has two interrelated foci: (i) obtaining stable and efficient data-driven closure models by using a multivariate time series of partial observations from a large-dimensional system; and (ii) comparing these closure models with the optimal closures predicted by the Mori–Zwanzig (MZ) formalism of statistical physics. Multilayer stochastic models (MSMs) are introduced as both a generalization and a time-continuous limit of existing multilevel, regression-based approaches to closure in a data-driven setting; these approaches include empirical model reduction (EMR), as well as more recent multi-layer modeling. It is shown that the multilayer structure of \MSMs\ can provide a natural Markov approximation to the generalized Langevin equation (GLE) of the \MZ\ formalism. A simple correlation-based stopping criterion for an EMR–MSM model is derived to assess how well it approximates the \GLE\ solution. Sufficient conditions are derived on the structure of the nonlinear cross-interactions between the constitutive layers of a given \MSM\ to guarantee the existence of a global random attractor. This existence ensures that no blow-up can occur for a broad class of \MSM\ applications, a class that includes non-polynomial predictors and nonlinearities that do not necessarily preserve quadratic energy invariants. The EMR–MSM methodology is first applied to a conceptual, nonlinear, stochastic climate model of coupled slow and fast variables, in which only slow variables are observed. It is shown that the resulting closure model with energy-conserving nonlinearities efficiently captures the main statistical features of the slow variables, even when there is no formal scale separation and the fast variables are quite energetic. Second, an \MSM\ is shown to successfully reproduce the statistics of a partially observed, generalized Lotka–Volterra model of population dynamics in its chaotic regime. The challenges here include the rarity of strange attractors in the model’s parameter space and the existence of multiple attractor basins with fractal boundaries. The positivity constraint on the solutions’ components replaces here the quadratic-energy–preserving constraint of fluid-flow problems and it successfully prevents blow-up.
2013
Kondrashov, K., M. D. Chekroun, A. W. Robertson, and M. Ghil. 2013. “Low-order stochastic model and “past-noise forecasting” of the Madden-Julian oscillation.” Geophysical Research Letters 40 (19): 5303—5310. Publisher's Version Abstract

This paper presents a predictability study of the Madden-Julian Oscillation (MJO) that relies on combining empirical model reduction (EMR) with the “past-noise forecasting” (PNF) method. EMR is a data-driven methodology for constructing stochastic low-dimensional models that account for nonlinearity, seasonality and serial correlation in the estimated noise, while PNF constructs an ensemble of forecasts that accounts for interactions between (i) high-frequency variability (noise), estimated here by EMR, and (ii) the low-frequency mode of MJO, as captured by singular spectrum analysis (SSA). A key result is that—compared to an EMR ensemble driven by generic white noise—PNF is able to considerably improve prediction of MJO phase. When forecasts are initiated from weak MJO conditions, the useful skill is of up to 30 days. PNF also significantly improves MJO prediction skill for forecasts that start over the Indian Ocean.

2011
Chekroun, M. D., D. Kondrashov, and M. Ghil. 2011. “Predicting stochastic systems by noise sampling, and application to the El Niño-Southern Oscillation.” Proceeding of the National Academy of Sciences 108 (29): 11766—11771. Publisher's Version Abstract

Interannual and interdecadal prediction are major challenges of climate dynamics. In this article we develop a prediction method for climate processes that exhibit low-frequency variability (LFV). The method constructs a nonlinear stochastic model from past observations and estimates a path of the “weather” noise that drives this model over previous finite-time windows. The method has two steps: (i) select noise samples—or “snippets”—from the past noise, which have forced the system during short-time intervals that resemble the LFV phase just preceding the currently observed state; and (ii) use these snippets to drive the system from the current state into the future. The method is placed in the framework of pathwise linear-response theory and is then applied to an El Niño–Southern Oscillation (ENSO) model derived by the empirical model reduction (EMR) methodology; this nonlinear model has 40 coupled, slow, and fast variables. The domain of validity of this forecasting procedure depends on the nature of the system’s pathwise response; it is shown numerically that the ENSO model’s response is linear on interannual time scales. As a result, the method’s skill at a 6- to 16-month lead is highly competitive when compared with currently used dynamic and statistic prediction methods for the Niño-3 index and the global sea surface temperature field.