Statistical methods

2018
Kondrashov, Dmitri, Mickaël D. Chekroun, Xiaojun Yuan, and Michael Ghil. 2018. “Data-Adaptive Harmonic Decomposition and Stochastic Modeling of Arctic Sea Ice.” Advances in Nonlinear Geosciences, edited by Anastasios Tsonis. Springer. Publisher's Version Abstract

We present and apply a novel method of describing and modeling complex multivariate datasets in the geosciences and elsewhere. Data-adaptive harmonic (DAH) decomposition identifies narrow-banded, spatio-temporal modes (DAHMs) whose frequencies are not necessarily integer multiples of each other. The evolution in time of the DAH coefficients (DAHCs) of these modes can be modeled using a set of coupled Stuart-Landau stochastic differential equations that capture the modes’ frequencies and amplitude modulation in time and space. This methodology is applied first to a challenging synthetic dataset and then to Arctic sea ice concentration (SIC) data from the US National Snow and Ice Data Center (NSIDC). The 36-year (1979–2014) dataset is parsimoniously and accurately described by our DAHMs. Preliminary results indicate that simulations using our multilayer Stuart-Landau model (MSLM) of SICs are stable for much longer time intervals, beyond the end of the twenty-first century, and exhibit interdecadal variability consistent with past historical records. Preliminary results indicate that this MSLM is quite skillful in predicting September sea ice extent. 

Kondrashov, Dmitri, and Mickaël D Chekroun. 2018. “Data-adaptive harmonic analysis and modeling of solar wind-magnetosphere coupling.” Journal of Atmospheric and Solar-Terrestrial Physics, 177: 179-189. Publisher's Version Abstract
The solar wind-magnetosphere coupling is studied by new data-adaptive harmonic decomposition (DAHD) approach for the spectral analysis and inverse modeling of multivariate time observations of complex nonlinear dynamical systems. DAHD identifies frequency-based modes of interactions in the combined dataset of Auroral Electrojet (AE) index and solar wind forcing. The time evolution of these modes can be very efficiently simulated by using systems of stochastic differential equations (SDEs) that are stacked per frequency and formed by coupled Stuart-Landau oscillators. These systems of SDEs capture the modes' frequencies as well as their amplitude modulations, and yield, in turn, an accurate modeling of the AE index' statistical properties.
Kondrashov, Dmitri, Mickaël D. Chekroun, and Michael Ghil. 2018. “Data-adaptive harmonic decomposition and prediction of Arctic sea ice extent.” Dynamics and Statistics of the Climate System 3 (1): dzy001. Publisher's Version Abstract
Decline in the Arctic sea ice extent (SIE) is an area of active scientific research with profound socio-economic implications. Of particular interest are reliable methods for SIE forecasting on subseasonal time scales, in particular from early summer into fall, when sea ice coverage in the Arctic reaches its minimum. Here, we apply the recent data-adaptive harmonic (DAH) technique of Chekroun and Kondrashov, (2017), Chaos, 27 for the description, modeling and prediction of the Multisensor Analyzed Sea Ice Extent (MASIE, 2006–2016) data set. The DAH decomposition of MASIE identifies narrowband, spatio-temporal data-adaptive modes over four key Arctic regions. The time evolution of the DAH coefficients of these modes can be modelled and predicted by using a set of coupled Stuart–Landau stochastic differential equations that capture the modes’ frequencies and amplitude modulation in time. Retrospective forecasts show that our resulting multilayer Stuart–Landau model (MSLM) is quite skilful in predicting September SIE compared to year-to-year persistence; moreover, the DAH–MSLM approach provided accurate real-time prediction that was highly competitive for the 2016–2017 Sea Ice Outlook.
Kondrashov, Dmitri, Mickaël D. Chekroun, and Pavel Berloff. 2018. “Multiscale Stuart-Landau Emulators: Application to Wind-Driven Ocean Gyres.” Fluids 3 (1): 21. Publisher's Version Abstract

The multiscale variability of the ocean circulation due to its nonlinear dynamics remains a big challenge for theoretical understanding and practical ocean modeling. This paper demonstrates how the data-adaptive harmonic (DAH) decomposition and inverse stochastic modeling techniques introduced in (Chekroun and Kondrashov, (2017), Chaos, 27), allow for reproducing with high fidelity the main statistical properties of multiscale variability in a coarse-grained eddy-resolving ocean flow. This fully-data-driven approach relies on extraction of frequency-ranked time-dependent coefficients describing the evolution of spatio-temporal DAH modes (DAHMs) in the oceanic flow data. In turn, the time series of these coefficients are efficiently modeled by a family of low-order stochastic differential equations (SDEs) stacked per frequency, involving a fixed set of predictor functions and a small number of model coefficients. These SDEs take the form of stochastic oscillators, identified as multilayer Stuart–Landau models (MSLMs), and their use is justified by relying on the theory of Ruelle–Pollicott resonances. The good modeling skills shown by the resulting DAH-MSLM emulators demonstrates the feasibility of using a network of stochastic oscillators for the modeling of geophysical turbulence. In a certain sense, the original quasiperiodic Landau view of turbulence, with the amendment of the inclusion of stochasticity, may be well suited to describe turbulence. 

Sainte Fare Garnot, Vivien, Andreas Groth, and Michael Ghil. 2018. “Coupled Climate-Economic Modes in the Sahel's Interannual Variability.” Ecological Economics 153. Elsevier BV: 111–123. Abstract
We study the influence of interannual climate variability on the economy of several countries in the Sahel region. In the agricultural sector, we are able to identify coupled climate-economic modes that are statistically significant on interannual time scales. In particular, precipitation is a key climatic factor for agriculture in this semi-arid region. Locality and diversity characterize the Sahel's climatic and economic system, with the coupled climate-economic patterns exhibiting substantial differences from country to country. Large-scale atmospheric patterns — like the El Niño–Southern Oscillation and its quasi-biennial and quasi-quadrennial oscillatory modes — have quite limited influence on the economies, while more location-specific rainfall patterns play an important role.
PDF
2017
Chekroun, Mickaël D., and Dmitri Kondrashov. 2017. “Data-adaptive harmonic spectra and multilayer Stuart-Landau models.” Chaos 27: 093110. Publisher's Version Abstract

Harmonic decompositions of multivariate time series are considered for which we adopt an integral operator approach with periodic semigroup kernels. Spectral decomposition theorems are derived that cover the important cases of two-time statistics drawn from a mixing invariant measure.

The corresponding eigenvalues can be grouped per Fourier frequency, and are actually given, at each frequency, as the singular values of a cross-spectral matrix depending on the data. These eigenvalues obey furthermore a variational principle that allows us to define naturally a multidimensional power spectrum. The eigenmodes, as far as they are concerned, exhibit a data-adaptive character manifested in their phase which allows us in turn to define a multidimensional phase spectrum.

The resulting data-adaptive harmonic (DAH) modes allow for reducing the data-driven modeling effort to elemental models stacked per frequency, only coupled at different frequencies by the same noise realization. In particular, the DAH decomposition extracts time-dependent coe cients stacked by Fourier frequency which can be e ciently modeled—provided the decay of temporal correlations is su ciently well-resolved—within a class of multilayer stochastic models (MSMs) tailored here on stochastic Stuart-Landau oscillators.

Applications to the Lorenz 96 model and to a stochastic heat equation driven by a space-time white noise, are considered. In both cases, the DAH decomposition allows for an extraction of spatio-temporal modes revealing key features of the dynamics in the embedded phase space. The multilayer Stuart-Landau models (MSLMs) are shown to successfully model the typical patterns of the corresponding time-evolving fields, as well as their statistics of occurrence. 

Groth, Andreas, and Michael Ghil. 2017. “Synchronization of world economic activity.” Chaos 27 (12): 127002. Abstract

Common dynamical properties of business cycle fluctuations are studied in a sample of more than 100 countries that represent economic regions from all around the world. We apply the methodology of multivariate singular spectrum analysis (M-SSA) to identify oscillatory modes and to detect whether these modes are shared by clusters of phase- and frequency-locked oscillators. An extension of the M-SSA approach is introduced to help analyze structural changes in the cluster configuration of synchronization. With this novel technique, we are able to identify a common mode of business cycle activity across our sample, and thus point to the existence of a world business cycle. Superimposed on this mode, we further identify several major events that have markedly influenced the landscape of world economic activity in the postwar era.

PDF
Groth, Andreas, and Michael Ghil. 2017. Synchronization of world economic activity. Paris: Chair Energy & Prosperity. Publisher's version Abstract

Common dynamical properties of business cycle fluctuations are studied in a sample of more than 100 countries that represent economic regions from all around the world. We apply the methodology of multivariate singular spectrum analysis (M-SSA) to identify oscillatory modes and to detect whether these modes are shared by clusters of phase- and frequency-locked oscillators. An extension of the M-SSA approach is introduced to help analyze structural changes in the cluster configuration of synchronization. With this novel technique, we are able to identify a common mode of business cycle activity across our sample, and thus point to the existence of a world business cycle. Superimposed on this mode, we further identify several major events that have markedly influenced the landscape of world economic activity in the postwar era. These findings raise therefore questions about assessments of climate change impacts that are based purely on long-term economic growth models. A key conclusion is the importance of endogenous-dynamics e?ects at the interface between natural climate variability and economic fluctuations.

Groth, Andreas, Yizhak Feliks, Dmitri Kondrashov, and Michael Ghil. 2017. “Interannual variability in the North Atlantic ocean’s temperature field and its association with the wind stress forcing.” Journal of Climate 30 (7): 2655-2678. Abstract

Spectral analyses of the North Atlantic temperature field in the Simple Ocean Data Analysis (SODA) reanalysis identify prominent and statistically significant interannual oscillations along the Gulf Stream front and in large regions of the North Atlantic. A 7–8-yr oscillatory mode is characterized by a basin-wide southwest-to-northeast–oriented propagation pattern in the sea surface temperature (SST) field. This pattern is found to be linked to a seesaw in the meridional-dipole structure of the zonal wind stress forcing (TAUX). In the subpolar gyre, the SST and TAUX fields of this mode are shown to be in phase opposition, which suggests a cooling effect of the wind stress on the upper ocean layer. Over all, this mode’s temperature field is characterized by a strong equivalent-barotropic component, as shown by covariations in SSTs and sea surface heights, and by phase-coherent behavior of temperature layers at depth with the SST field. Recent improvements of multivariate singular spectrum analysis (M-SSA) help separate spatio-temporal patterns. This methodology is developed further and applied to studying the ocean’s response to variability in the atmospheric forcing. Statistical evidence is shown to exist for other mechanisms generating oceanic variability of similar 7–8-yr periodicity in the Gulf Stream region; the latter variability is likewise characterized by a strongly equivalent-barotropic component. Two other modes of biennial variability in the Gulf Stream region are also identified, and it is shown that interannual variability in this region cannot be explained by the ocean’s response to similar variability in the atmospheric forcing alone.

PDF North Atlantic SST 7.7-yr mode
2016

Spectral analyses of the North Atlantic temperature field in the Simple Ocean Data Analysis (SODA) reanalysis identify prominent and statistically significant interannual oscillations along the Gulf Stream front and in large regions of the North Atlantic. A 7--8-yr oscillatory mode is characterized by a basin-wide southwest-to-northeast--oriented propagation pattern in the sea-surface temperature (SST) field. This pattern is found to be linked to a seesaw in the meridional-dipole structure of the zonal wind-stress forcing (TAUX). In the subpolar gyre, the SST and TAUX fields of this mode are shown to be in phase opposition, which suggests a cooling effect of the wind stress on the upper ocean layer. Over all, this mode's temperature field is characterized by a strong equivalent-barotropic component, as shown by covariations in SST and sea-surface height (SSH), and by phase-coherent behavior of temperature layers at depth with the SST field. On the other hand, this mode shares many features of the gyre mode and raises the possibilty for the existence of an intrinsic oceanic mode of similar 7--8-yr period in the Gulf Stream region.

PDF North Atlantic SST 7.7-yr mode
Greco, G, Dmitri Kondrashov, S Kobayashi, Michael Ghil, M Branchesi, C Guidorzi, G Stratta, M Ciszak, F Marino, and A Ortolan. 2016. “Singular Spectrum Analysis for astronomical time series: constructing a parsimonious hypothesis test.” The Universe of Digital Sky Surveys, 105–107. Springer. Publisher's Version
Sella, Lisa, Gianna Vivaldo, Andreas Groth, and Michael Ghil. 2016. “Economic Cycles and Their Synchronization: A Comparison of Cyclic Modes in Three European Countries.” Journal of Business Cycle Research 12 (1). Springer: 25-48. Publisher's Version Abstract

The present work applies singular spectrum analysis (SSA) to the study of macroeconomic fluctuations in three European countries: Italy, The Netherlands, and the United Kingdom. This advanced spectral method provides valuable spatial and frequency information for multivariate data sets and goes far beyond the classical forms of time domain analysis. In particular, SSA enables us to identify dominant cycles that characterize the deterministic behavior of each time series separately, as well as their shared behavior. We demonstrate its usefulness by analyzing several fundamental indicators of the three countries' real aggregate economy in a univariate, as well as a multivariate setting. Since business cycles are international phenomena, which show common characteristics across countries, our aim is to uncover supranational behavior within the set of representative European economies selected herein. Finally, the analysis is extended to include several indicators from the U.S. economy, in order to examine its influence on the European economies under study and their interrelationships.

PDF
Walwer, Damian, Eric Calais, and Michael Ghil. 2016. “Data-Adaptive Detection of Transient Deformation in Geodetic Networks.” Journal of Geophysical Research: Solid Earth 121 (3). Wiley Online Library: 2129-2152 . Abstract

The recent development of dense and continuously operating Global Navigation Satellite System (GNSS) networks worldwide has led to a significant increase in geodetic data sets that sometimes capture transient-deformation signals. It is challenging, however, to extract such transients of geophysical origin from the background noise inherent to GNSS time series and, even more so, to separate them from other signals, such as seasonal redistributions of geophysical fluid mass loads. In addition, because of the very large number of continuously recording GNSS stations now available, it has become impossible to systematically inspect each time series and visually compare them at all neighboring sites. Here we show that Multichannel Singular Spectrum Analysis (M-SSA), a method derived from the analysis of dynamical systems, can be used to extract transient deformations, seasonal oscillations, and background noise present in GNSS time series. M-SSA is a multivariate, nonparametric, statistical method that simultaneously exploits the spatial and temporal correlations of geophysical fields. The method allows for the extraction of common modes of variability, such as trends with nonconstant slopes and oscillations shared across time series, without a priori hypotheses about their spatiotemporal structure or their noise characteristics. We illustrate this method using synthetic examples and show applications to actual GPS data from Alaska to detect seasonal signals and microdeformation at the Akutan active volcano. The geophysically coherent spatiotemporal patterns of uplift and subsidence thus detected are compared to the results of an idealized model of such processes in the presence of a magma chamber source.

Chen, C., M. A. Cane, N. Henderson, D. Eun Lee, D. Chapman, Dmitri Kondrashov, and Mickaël D. Chekroun. 2016. “Diversity, nonlinearity, seasonality and memory effect in ENSO simulation and prediction using empirical model reduction.” Journal of Climate 29 (5): 1809-1830. Abstract

A suite of empirical model experiments under the empirical model reduction framework are conducted to advance the understanding of ENSO diversity, nonlinearity, seasonality, and the memory effect in the simulation and prediction of tropical Pacific sea surface temperature (SST) anomalies. The model training and evaluation are carried out using 4000-yr preindustrial control simulation data from the coupled model GFDL CM2.1. The results show that multivariate models with tropical Pacific subsurface information and multilevel models with SST history information both improve the prediction skill dramatically. These two types of models represent the ENSO memory effect based on either the recharge oscillator or the time-delayed oscillator viewpoint. Multilevel SST models are a bit more efficient, requiring fewer model coefficients. Nonlinearity is found necessary to reproduce the ENSO diversity feature for extreme events. The nonlinear models reconstruct the skewed probability density function of SST anomalies and improve the prediction of the skewed amplitude, though the role of nonlinearity may be slightly overestimated given the strong nonlinear ENSO in GFDL CM2.1. The models with periodic terms reproduce the SST seasonal phase locking but do not improve the prediction appreciably. The models with multiple ingredients capture several ENSO characteristics simultaneously and exhibit overall better prediction skill for more diverse target patterns. In particular, they alleviate the spring/autumn prediction barrier and reduce the tendency for predicted values to lag the target month value.

Edeline, Eric, Andreas Groth, Bernard Cazelles, David Claessen, Ian J. Winfield, Jan Ohlberger, L. Asbjørn Vøllestad, Nils C. Stenseth, and Michael Ghil. 2016. “Pathogens trigger top-down climate forcing on ecosystem dynamics.” Oecologia, 1–14. Abstract

Evaluating the effects of climate variation on ecosystems is of paramount importance for our ability to forecast and mitigate the consequences of global change. However, the ways in which complex food webs respond to climate variations remain poorly understood. Here, we use long-term time series to investigate the effects of temperature variation on the intraguild-predation (IGP) system of Windermere (UK), a lake where pike (Esox lucius, top predator) feed on small-sized perch (Perca fluviatilis) but compete with large-sized perch for the same food sources. Spectral analyses of time series reveal that pike recruitment dynamics are temperature controlled. In 1976, expansion of a size-truncating perch pathogen into the lake severely impacted large perch and favoured pike as the IGP-dominant species. This pathogen-induced regime shift to a pike-dominated IGP apparently triggered a temperature-controlled trophic cascade passing through pike down to dissolved nutrients. In simple food chains, warming is predicted to strengthen top–down control by accelerating metabolic rates in ectothermic consumers, while pathogens of top consumers are predicted to dampen this top–down control. In contrast, the local IGP structure in Windermere made warming and pathogens synergistic in their top–down effects on ecosystem functioning. More generally, our results point to top predators as major mediators of community response to global change, and show that size-selective agents (e.g. pathogens, fishers or hunters) may change the topological architecture of food webs and alter whole ecosystem sensitivity to climate variation.

PDF PDF - Supplementary material
2015
Groth, Andreas. 2015. “Business cycle analysis and forecasting using advanced spectral methods and data-based low-order models.” 35th International Symposium on Forecasting Riverside, California, June 2015. Abstract

PDF
Kondrashov, Dmitri, Mickaël D. Chekroun, and Michael Ghil. 2015. “Data-driven non-Markovian closure models.” Physica D: Nonlinear Phenomena 297. Elsevier: 33–55. Abstract

This paper has two interrelated foci: (i) obtaining stable and efficient data-driven closure models by using a multivariate time series of partial observations from a large-dimensional system; and (ii) comparing these closure models with the optimal closures predicted by the Mori–Zwanzig (MZ) formalism of statistical physics. Multilayer stochastic models (MSMs) are introduced as both a generalization and a time-continuous limit of existing multilevel, regression-based approaches to closure in a data-driven setting; these approaches include empirical model reduction (EMR), as well as more recent multi-layer modeling. It is shown that the multilayer structure of MSMs can provide a natural Markov approximation to the generalized Langevin equation (GLE) of the MZ formalism. A simple correlation-based stopping criterion for an EMR–MSM model is derived to assess how well it approximates the GLE solution. Sufficient conditions are derived on the structure of the nonlinear cross-interactions between the constitutive layers of a given MSM to guarantee the existence of a global random attractor. This existence ensures that no blow-up can occur for a broad class of MSM applications, a class that includes non-polynomial predictors and nonlinearities that do not necessarily preserve quadratic energy invariants. The EMR–MSM methodology is first applied to a conceptual, nonlinear, stochastic climate model of coupled slow and fast variables, in which only slow variables are observed. It is shown that the resulting closure model with energy-conserving nonlinearities efficiently captures the main statistical features of the slow variables, even when there is no formal scale separation and the fast variables are quite energetic. Second, an MSM is shown to successfully reproduce the statistics of a partially observed, generalized Lotka–Volterra model of population dynamics in its chaotic regime. The challenges here include the rarity of strange attractors in the model’s parameter space and the existence of multiple attractor basins with fractal boundaries. The positivity constraint on the solutions’ components replaces here the quadratic-energy–preserving constraint of fluid-flow problems and it successfully prevents blow-up.

Groth, Andreas, Patrice Dumas, Michael Ghil, and Stéphane Hallegatte. 2015. “Impacts of natural disasters on a dynamic economy.” Extreme Events : Observations, Modeling, and Economics, edited by Eric Chavez, Michael Ghil, and Jaime Urrutia-Fucugauchi, 343–360. American Geophysical Union and Wiley-Blackwell. Abstract

This chapter presents a modeling framework for macroeconomic growth dynamics; it is motivated by recent attempts to formulate and study “integrated models” of the coupling between natural and socioeconomic phe­ nomena. The challenge is to describe the interfaces between human activities and the functioning of the earth system. We examine the way in which this interface works in the presence of endogenous business cycle dynam­ ics, based on a nonequilibrium dynamic model. Recent findings about the macroeconomic response to natural disasters in such a nonequilibrium setting have shown a more severe response to natural disasters during expan­ sions than during recessions. These findings raise questions about the assessment of climate change damages or natural disaster losses that are based purely on long-term growth models. In order to compare the theoretical findings with observational data, we analyze cyclic behavior in the U.S. economy, based on multivariate singular spectrum analysis. We analyze a total of nine aggregate indicators in a 52 year interval (1954–2005) and demon­ strate that the behavior of the U.S. economy changes significantly between intervals of growth and recession, with higher volatility during expansions.

PDF
Groth, Andreas, and Michael Ghil. 2015. “Monte Carlo Singular Spectrum Analysis (SSA) revisited: Detecting oscillator clusters in multivariate datasets.” Journal of Climate 28 (19): 7873–7893. Abstract

Singular spectrum analysis (SSA) along with its multivariate extension (M-SSA) provides an efficient way to identify weak oscillatory behavior in high-dimensional data. To prevent the misinterpretation of stochastic fluctuations in short time series as oscillations, Monte Carlo (MC)–type hypothesis tests provide objective criteria for the statistical significance of the oscillatory behavior. Procrustes target rotation is introduced here as a key method for refining previously available MC tests. The proposed modification helps reduce the risk of type-I errors, and it is shown to improve the test’s discriminating power. The reliability of the proposed methodology is examined in an idealized setting for a cluster of harmonic oscillators immersed in red noise. Furthermore, the common method of data compression into a few leading principal components, prior to M-SSA, is reexamined, and its possibly negative effects are discussed. Finally, the generalized Procrustes test is applied to the analysis of interannual variability in the North Atlantic’s sea surface temperature and sea level pressure fields. The results of this analysis provide further evidence for shared mechanisms of variability between the Gulf Stream and the North Atlantic Oscillation in the interannual frequency band.

PDF
Groth, Andreas, Michael Ghil, Stéphane Hallegatte, and Patrice Dumas. 2015. “The Role of Oscillatory Modes in U.S. Business Cycles.” OECD Journal: Journal of Business Cycle Measurement and Analysis, no. 2015/1: 63–81. Abstract

We apply multivariate singular spectrum analysis to the study of U.S. business cycle dynamics. This method provides a robust way to identify and reconstruct oscillations, whether intermittent or modulated. We show such oscillations to be associated with comovements across the entire economy. The problem of spurious cycles generated by the use of detrending filters is addressed and we present a Monte Carlo test to extract significant oscillations. The behavior of the U.S. economy is shown to change significantly from one phase of the business cycle to another: the recession phase is dominated by a five-year mode, while the expansion phase exhibits more complex dynamics, with higher-frequency modes coming into play. We show that the variations so identified cannot be generated by random shocks alone, as assumed in ‘real’ business-cycle models, and that endogenous, deterministically generated variability has to be involved.

PDF

Pages