Statistical methods

Sella L, Vivaldo G, Groth A, Ghil M. Economic Cycles and Their Synchronization: A Comparison of Cyclic Modes in Three European Countries. Journal of Business Cycle Research [Internet]. 2016;12 (1) :25-48. Publisher's VersionAbstract

The present work applies singular spectrum analysis (SSA) to the study of macroeconomic fluctuations in three European countries: Italy, The Netherlands, and the United Kingdom. This advanced spectral method provides valuable spatial and frequency information for multivariate data sets and goes far beyond the classical forms of time domain analysis. In particular, SSA enables us to identify dominant cycles that characterize the deterministic behavior of each time series separately, as well as their shared behavior. We demonstrate its usefulness by analyzing several fundamental indicators of the three countries' real aggregate economy in a univariate, as well as a multivariate setting. Since business cycles are international phenomena, which show common characteristics across countries, our aim is to uncover supranational behavior within the set of representative European economies selected herein. Finally, the analysis is extended to include several indicators from the U.S. economy, in order to examine its influence on the European economies under study and their interrelationships.

Vautard R, Yiou P, Ghil M. Singular-spectrum analysis: A toolkit for short, noisy chaotic signals. Physica D. 1992;58 (1–4) :95–126.Abstract
Singular-spectrum analysis (SSA) is developed further, based on experience with applications to geophysical time series. It is shown that SSA provides a crude but robust approximation of strange attractors by tori, in the presence of noise. The method works well for short, noisy time series. The lagged-covariance matrix of the processes studied is the basis of SSA. We select subsets of eigenelements and associated principal components (PCs) in order to provide (i) a noise-reduction algorithm, (ii) a detrending algorithm, and (iii) an algorithm for the identification of oscillatory components. Reconstructed components (RCs) are developed to provide optimal reconstruction of a dynamic process at precise epochs, rather than averaged over the window length of the analysis. SSA is combined with advanced spectral-analysis methods - the maximum entropy method (MEM) and the multi-taper method (MTM) - to refine the interpretation of oscillatory behavior. A combined SSA-MEM method is also used for the prediction of selected subsets of RCs. The entire toolkit is validated against a set of four prescribed time series generated by known processes, quasi-periodic or chaotic. It is also applied to a time series of global surface air temperatures, 130 years long, which has attracted considerable attention in the context of the global warming issue and provides a severe test for noise reduction and prediction.
Penland C, Ghil M. Forecasting Northern Hemisphere 700\mbox-mb geopotential height anomalies using empirical normal modes. Monthly Weather Review. 1993;121 (8) :2355–2372.Abstract
Multivariate linear prediction based on single-lag inverse modeling is developed further and critically examined. The method is applied to the National Meteorological Center analyses of Northern Hemisphere 700-mb geopotential height anomalies, which have been filtered to eliminate periods shorter than 10 days. Empirically derived normal modes of the randomly forced linear system are usually correlated, even at zero lag, suggesting that combinations of modes should be used in predictions. Due to nonlinearities in the dynamics and the neglect of interactions with other pressure levels, the lag at which the analysis is performed is crucial; best predictions obtain when the autocovariances involved in the analysis are calculated at a lag comparable to the exponential decay times of the modes. Errors in prediction have a significant seasonal dependence, indicating that the annual cycle affects the higher-order statistics of the field. Optimized linear predictions using this method are useful for about half a day longer than predictions made by persistence. Conditional probabilities are much more efficiently calculated using normal-mode parameters than from histograms, and yield similar results. Maps of the model's Fourier spectra—integrated over specified frequency intervals and consistent with the assumptions made in a linear analysis—agree with maps obtained from fast Fourier transforms of the data.
Kravtsov S, Kondrashov D, Ghil M. Empirical model reduction and the modelling hierarchy in climate dynamics and the geosciences. Stochastic physics and climate modelling. Cambridge University Press, Cambridge. 2009 :35–72.Abstract
Modern climate dynamics uses a two-fisted approach in attacking and solving the problems of atmospheric and oceanic flows. The two fists are: (i) observational analyses; and (ii) simulations of the geofluids, including the coupled atmosphere–ocean system, using a hierarchy of dynamical models. These models represent interactions between many processes that act on a broad range of spatial and time scales, from a few to tens of thousands of kilometers, and from diurnal to multidecadal, respectively. The evolution of virtual climates simulated by the most detailed and realistic models in the hierarchy is typically as difficult to interpret as that of the actual climate system, based on the available observations thereof. Highly simplified models of weather and climate, though, help gain a deeper understanding of a few isolated processes, as well as giving clues on how the interaction between these processes and the rest of the climate system may participate in shaping climate variability. Finally, models of intermediate complexity, which resolve well a subset of the climate system and parameterise the remainder of the processes or scales of motion, serve as a conduit between the models at the two ends of the hierarchy. We present here a methodology for constructing intermediate mod- els based almost entirely on the observed evolution of selected climate fields, without reference to dynamical equations that may govern this evolution; these models parameterise unresolved processes as multi- variate stochastic forcing. This methodology may be applied with equal success to actual observational data sets, as well as to data sets resulting from a high-end model simulation. We illustrate this methodology by its applications to: (i) observed and simulated low-frequency variability of atmospheric flows in the Northern Hemisphere; (ii) observed evo- lution of tropical sea-surface temperatures; and (iii) observed air–sea interaction in the Southern Ocean. Similar results have been obtained for (iv) radial-diffusion model simulations of Earth’s radiation belts, but are not included here because of space restrictions. In each case, the reduced stochastic model represents surprisingly well a variety of linear and nonlinear statistical properties of the resolved fields. Our methodology thus provides an efficient means of constructing reduced, numerically inexpensive climate models. These models can be thought of as stochastic–dynamic prototypes of more complex deterministic models, as in examples (i) and (iv), but work just as well in the situation when the actual governing equations are poorly known, as in (ii) and (iii). These models can serve as competitive prediction tools, as in (ii), or be included as stochastic parameterisations of certain processes within more complex climate models, as in (iii). Finally, the methodology can be applied, with some modifications, to geophysical problems outside climate dynamics, as illustrated by (iv).
Groth A, Dumas P, Ghil M, Hallegatte S. Impacts of natural disasters on a dynamic economy. In: Chavez E, Ghil M, Urrutia-Fucugauchi J Extreme Events : Observations, Modeling, and Economics. American Geophysical Union and Wiley-Blackwell ; 2015. pp. 343–360.Abstract

This chapter presents a modeling framework for macroeconomic growth dynamics; it is motivated by recent attempts to formulate and study “integrated models” of the coupling between natural and socioeconomic phe­ nomena. The challenge is to describe the interfaces between human activities and the functioning of the earth system. We examine the way in which this interface works in the presence of endogenous business cycle dynam­ ics, based on a nonequilibrium dynamic model. Recent findings about the macroeconomic response to natural disasters in such a nonequilibrium setting have shown a more severe response to natural disasters during expan­ sions than during recessions. These findings raise questions about the assessment of climate change damages or natural disaster losses that are based purely on long-term growth models. In order to compare the theoretical findings with observational data, we analyze cyclic behavior in the U.S. economy, based on multivariate singular spectrum analysis. We analyze a total of nine aggregate indicators in a 52 year interval (1954–2005) and demon­ strate that the behavior of the U.S. economy changes significantly between intervals of growth and recession, with higher volatility during expansions.

Moron V, Vautard R, Ghil M. Trends, interdecadal and interannual oscillations in global sea-surface temperatures. Climate Dynamics. 1998;14 (7) :545–569.Abstract

This study aims at a global description of climatic phenomena that exhibit some regularity during the twentieth century. Multi-channel singular spectrum analysis is used to extract long-term trends and quasi-regular oscillations of global sea-surface temperature (SST) fields since 1901. Regional analyses are also performed on the Pacific, (Northern and Southern) Atlantic, and Indian Ocean basins. The strongest climatic signal is the irregular long-term trend, characterized by overall warming during 1910–1940 and since 1975, with cooling (especially of the Northern Hemisphere) between these two warming intervals. Substantial cooling prevailed in the North Pacific between 1950 and 1980, and continues in the North Atlantic today. Both cooling and warming are preceded by SST anomalies of the same sign in the subpolar North Atlantic. Near-decadal oscillations are present primarily over the North Atlantic, but also over the South Atlantic and the Indian Ocean. A 13–15-y oscillation exhibits a seesaw pattern between the Gulf-Stream region and the North-Atlantic Drift and affects also the tropical Atlantic. Another 7–8-y oscillation involves the entire double-gyre circulation of the North Atlantic, being mostly of one sign across the basin, with a minor maximum of opposite sign in the subpolar gyre and the major maximum in the northwestern part of the subtropical gyre. Three distinct interannual signals are found, with periods of about 60–65, 45 and 24–30 months. All three are strongest in the tropical Eastern Pacific. The first two extend throughout the whole Pacific and still exhibit some consistent, albeit weak, patterns in other ocean basins. The latter is weaker overall and has no consistent signature outside the Pacific. The 60-month oscillation obtains primarily before the 1960s and the 45-month oscillation afterwards.

Yiou P, Sornette D, Ghil M. Data-adaptive wavelets and multi-scale singular-spectrum analysis. Physica D. 2000;142 (3-4) :254–290.Abstract

Using multi-scale ideas from wavelet analysis, we extend singular-spectrum analysis (SSA) to the study of nonstationary time series, including the case where intermittency gives rise to the divergence of their variance. The wavelet transform resembles a local Fourier transform within a finite moving window whose width W, proportional to the major period of interest, is varied to explore a broad range of such periods. SSA, on the other hand, relies on the construction of the lag-correlation matrix C on M lagged copies of the time series over a fixed window width W to detect the regular part of the variability in that window in terms of the minimal number of oscillatory components; here W=M[Delta]t with [Delta]t as the time step. The proposed multi-scale SSA is a local SSA analysis within a moving window of width M<=W<=N, where N is the length of the time series. Multi-scale SSA varies W, while keeping a fixed W/M ratio, and uses the eigenvectors of the corresponding lag-correlation matrix C(M) as data-adaptive wavelets; successive eigenvectors of C(M) correspond approximately to successive derivatives of the first mother wavelet in standard wavelet analysis. Multi-scale SSA thus solves objectively the delicate problem of optimizing the analyzing wavelet in the time-frequency domain by a suitable localization of the signal's correlation matrix. We present several examples of application to synthetic signals with fractal or power-law behavior which mimic selected features of certain climatic or geophysical time series. The method is applied next to the monthly values of the Southern Oscillation Index (SOI) for 1933-1996; the SOI time series is widely believed to capture major features of the El Niño/Southern Oscillation (ENSO) in the Tropical Pacific. Our methodology highlights an abrupt periodicity shift in the SOI near 1960. This abrupt shift between 5 and 3 years supports the Devil's staircase scenario for the ENSO phenomenon (preliminary results of this study were presented at the XXII General Assembly of the European Geophysical Society, Vienna, May 1997, and at the Fall Meeting of the American Geophysical Union, San Francisco, December 1997).