A time series is a sequence of data points indexed in time order. Formally, a time series is a stochastic process for , where is an index set, typically or for discrete-time time series. Analysis of time series involves understanding the underlying structure and function that produced the data, often for the purpose of forecasting future values.
The foundational assumption in many time series models is stationarity. A time series is strictly stationary if the joint distribution of is identical to that of for all .
In practice, strict stationarity is often too restrictive. Weak stationarity (or wide-sense stationarity) requires only that the first two moments are invariant with respect to time translation:
- for all .
- for all . The function is the autocovariance function at lag . The autocorrelation function (ACF) is defined as .
Foundational Time Series Processes
White Noise
A sequence of uncorrelated random variables with mean zero and finite, constant variance is termed a white noise process, denoted . The autocovariance function for white noise is given by if , and otherwise. When the process consists of independent and identically distributed (i.i.d.) random variables, it is termed strictly white noise. Gaussian white noise assumes .
Random Walk
A random walk is defined by the process , where . Expanding this equation yields (assuming ). The expected value is , but the variance is . Because the variance is strictly dependent on , a random walk is non-stationary. The covariance between and (where ) is .
Which of the following processes is strictly stationary?
Linear Models: AR, MA, and ARMA
Linear time series models capture the linear dependencies between observations.
Autoregressive (AR) Models
An autoregressive model of order , denoted AR(), models the current value as a linear combination of its previous values plus a white noise term: Using the backshift operator , where , the AR() model implies: where is the autoregressive polynomial. For an AR() process to be stationary, all roots of the characteristic equation must lie outside the unit circle in the complex plane (). For an AR(1) process , the condition simplifies to , yielding ACF .
Moving Average (MA) Models
A moving average model of order , denoted MA(), expresses as a linear combination of the current and previous white noise terms: Using the moving average polynomial , this is written as . Every finite-order MA process is stationary because it is a finite linear combination of stationary white noise processes. The autocovariance for , dictating that the ACF cuts off after lag .
Invertibility of an MA process ensures that it can be uniquely expressed as an infinite-order AR process. An MA() model is invertible if all roots of lie outside the unit circle.
ARMA and ARIMA Models
Combining AR and MA concepts forms the Autoregressive Moving Average model, ARMA():
Stationarity and invertibility of the ARMA process depend on the roots of and respectively. Time series exhibiting non-stationarity in the mean, such as trends, require differencing. First-order differencing removes linear trends; second-order removes quadratic trends. Applying differences produces an Autoregressive Integrated Moving Average model, ARIMA():
You are building a time series model for daily foreign exchange rates between USD and EUR. The log daily prices P_t exhibit a wandering behavior resembling a random walk. When you plot the differences X_t = log(P_t) - log(P_{t-1}), the resulting series mean-reverts to zero. The ACF of X_t shows significant spikes at lags 1 and 2, but vanishes to zero afterwards. The Partial Autocorrelation Function (PACF) gradually decays toward zero.
Based on the properties of X_t, what ARIMA model structure best represents the log price process P_t?
Partial Autocorrelation Function (PACF)
While the ACF measures the linear dependence between and inclusive of intermediate effects, the Partial Autocorrelation Function (PACF) isolates the direct correlation. The PACF at lag , denoted , represents the correlation between and after removing the linear dependence of both variables on the intermediate values .
For an AR() process, the PACF cuts off strictly after lag ( for ). Conversely, for an MA() process, the PACF tails off gradually. This dualistic behavior provides the foundation for the Box-Jenkins model identification methodology.
Spectral Analysis
Time domain analysis emphasizes serial correlations over time lags. Spectral analysis (frequency domain analysis) decomposes the variance of a time series over a continuous spectrum of angular frequencies . For a stationary process with autocovariance function , the spectral density function represents the Fourier transform of the autocovariance sequence: The total variance of the process corresponds to the integral over the frequency band: A peak at a specific frequency in the spectral density plot implies periodic behavior with cycle length . For Gaussian white noise, is absolute zero at all , rendering the spectral density perfectly flat: .
Filtering Operations in the frequency domain allow straightforward manipulation of time series signals. An LTI (Linear Time-Invariant) filter defined by sequence applies the convolution . The frequency response function of the filter is . The spectral density of the filtered output modifies according to:
Multivariate Time Series and Vector Autoregression (VAR)
When assessing joint dynamics of multiple interrelated time series , univariate ARIMA models are insufficient. The Vector Autoregressive model of order , VAR(), generalizes the AR structure to dimension : where are coefficient matrices and is a -dimensional multivariate white noise zero-mean vector strictly characterized by the covariance matrix .
Stationarity in a VAR system demands that roots of the determinant equation fall strictly outside the complex unit circle. VAR models naturally represent Granger causality: Granger-causes if the past observations of statistically improve the prediction horizon for compared to strict reliance on the isolated past of .
State-Space Models and the Kalman Filter
A more generalized analytic framework is provided by State-Space Modeling. A state-space model characterizes observation dynamics through an underlying, unobserved state variable sequence . The process divides into deterministic functional dependencies:
- Measurement Equation: Links observed data to the unobserved state.
- State Equation (Transition Equation): Governs Markovian state evolution over sequence steps.
Here, specifies observation measurement noise, and structural transition disturbance. Matrices configure the parameters of dynamic correlation.
The Kalman filter supplies a recursive mechanism for determining the optimal minimum mean-squared error (MMSE) estimator for the state vector given the accrued observation sequence up to time , . The calculation iterates between the prediction step and optimal update (correction) computation involving the Kalman gain component modifying the prediction based on observed innovation error.
In a generic linear state-space model evaluated using the Kalman filter framework, which sequence step incorporates information exclusively derived from novel observations y_t not previously included structurally?
Structural Breakpoints and Non-Linearities
Standard parametric assumptions often fail mapping prolonged macroeconomic sequences due to fundamental shifts in generating mechanisms. A structural breakpoint models definitive shifts within the parameter spaces governing stationary dynamics. Formally evaluating structural sequence integrity requires analyzing sequence partitions mapping varying ARMA polynomials strictly restricted within designated time indices corresponding to systemic shocks.
Alternatively, Arch/GARCH frameworks directly model phenomena demonstrating localized heteroskedasticity. The Generalized Autoregressive Conditional Heteroskedasticity framework models the distinct variance sequence dynamically: The GARCH formulation precisely quantifies volatility clustering characterizations fundamentally essential to contemporary financial risk modeling frameworks.
Advanced paradigms increasingly rely upon threshold autoregressive paradigms (TAR) addressing non-linear functional manifestations, or fractional integration models (ARFIMA) structurally designed for mapping processes exhibiting exceptionally protracted long-range dependency characterized by exceptionally slowed hyperbolic ACF exponential decay functions.