Macroeconomic model reference

Kalman filter Model

Recursive state estimator that updates latent macro states as each noisy observation arrives.

Empirical forecasting models · Model guide

Kalman filter: question, structure, and use cases

Recursive state estimator that updates latent macro states as each noisy observation arrives.

What is the best linear estimate of an unobservable state vector given noisy observations arriving sequentially?

Background

Rudolf Kalman published 'A New Approach to Linear Filtering and Prediction Problems' in 1960. The result was immediate and far-reaching: given a linear state-space model with Gaussian disturbances, the Kalman filter computes the exact conditional mean and covariance of the latent state at each point in time, using only the previous state estimate and the current observation. No other linear estimator achieves lower mean-squared error. The filter was adopted first by aerospace engineers (Apollo navigation systems), then by electrical engineers, and by the late 1980s it had become central to macroeconometrics through the work of Harvey (1989) and Hamilton (1994).

The state-space representation splits the world into what you observe (measurement equation) and what you model (transition equation). The transition equation describes how the latent state evolves over time - trend growth, cyclical dynamics, time-varying parameters, factor loadings. The measurement equation links the latent state to observable data, with measurement noise. The Kalman filter takes this two-equation system and produces the prediction step (project the state forward using the transition equation) and the update step (revise the projection when new data arrive). The update step is the key: it weights the model's prediction against the data observation using the Kalman gain, which is optimal in the MSE sense.

In macroeconomics, the Kalman filter serves three roles. First, it estimates unobservable states: the output gap, the natural rate of interest, underlying inflation trends. Second, it evaluates the likelihood function of any model that can be written in state-space form, enabling maximum likelihood estimation of structural parameters. Third, it handles missing data and mixed-frequency observations naturally - if an observation is missing at time t, the update step is skipped and the prediction step carries the state forward under increased uncertainty.

The filter's optimality is conditional on linearity and Gaussianity. When the state-space model is nonlinear (e.g., DSGE models with occasionally binding constraints), the extended Kalman filter (EKF) or unscented Kalman filter (UKF) provides approximations. When disturbances are non-Gaussian, the particle filter replaces the Kalman filter with sequential Monte Carlo. But for the large class of linear Gaussian models that dominate applied macroeconometrics - UCMs, dynamic factor models, TVP-VARs, nowcasting systems - the Kalman filter is exact and optimal.

How the Parts Fit Together

The Kalman filter requires two inputs: a fully specified state-space model (system matrices F, H, Q, R and initial conditions) and a sequence of observations $y_1$ , ..., $y_T$ . The system matrices can be fixed or time-varying. The state vector alpha_t has dimension m, which can be much larger than the observation dimension n. Deterministic inputs (intercepts, exogenous variables) enter through both equations.

The recursive algorithm has two phases per time step. The prediction step uses the transition equation to project the state and its covariance one step ahead: alpha_{t|t-1} = F * alpha_{t-1|t-1} and $P_{t|t-1} = F * P_{t-1|t-1} * F$ ' + Q. The update step incorporates the new observation $y_t$ : compute the innovation $v_t = y_t - H *$ alpha_{t|t-1}, the innovation variance $S_t = H * P_{t|t-1} * H$ ' + R, the Kalman gain $K_t = P_{t|t-1} * H$ ' * $S_t^{-1}$ , and then update: alpha_{t|t} = alpha_{t|t-1} + $K_t * v_t$ and $P_{t|t} =$ (I - $K_t * H) * P_{t|t-1}$ . The Kalman gain determines how much weight to give the data versus the model prediction.

The filter's outputs include the filtered state alpha_{t|t} (using data up to time t), the predicted state alpha_{t|t-1} (using data up to t-1), the innovation sequence $v_t$ (prediction errors), and the innovation variances $S_t$ . The log-likelihood is computed as a byproduct: sum of log-densities of $v_t$ evaluated under $N(0, S_t)$ . For smoothing (estimating alpha_t using the full sample 1..T), the Rauch-Tung-Striebel (RTS) backward smoother runs after the forward filter pass.

Applications

The Federal Reserve Board's FRB/US model, the European Central Bank's area-wide model, and virtually every central bank's nowcasting system use the Kalman filter to estimate latent states - the output gap, the natural rate of unemployment (NAIRU), and the neutral real interest rate (r*). Laubach and Williams (2003) estimated r* using a two-sided Kalman smoother that became a benchmark for monetary policy discussions.

Dynamic factor models for coincident economic indicators (Stock and Watson 1991, 2002) run the Kalman filter on a state-space representation where a small number of latent factors drive a large panel of observed series. The Federal Reserve Bank of New York's nowcasting model (Bok et al. 2018) is essentially a dynamic factor model estimated via the Kalman filter with news decomposition.

The Kalman filter requires a correctly specified linear Gaussian model. When the system is nonlinear (DSGE models with zero-lower-bound constraints), the EKF or particle filter is needed. When disturbances have fat tails (financial returns during crises), the Kalman filter underweights extreme observations and the filtered state lags reality. With very high-dimensional state vectors (m > 100), the Kalman filter's $O(m^3)$ matrix inversions become expensive per time step, motivating square-root filters or ensemble methods.

Extensions include the square-root filter (numerically stable for ill-conditioned P matrices), the information filter (propagates $P^{-1}$ instead of P, useful when some observations are very precise), the steady-state Kalman filter (pre-computes the converged gain K for computational speed), and the augmented Kalman filter for correlated state and measurement disturbances.

Components

\alpha_t

State vector

m-dimensional latent vector at time t (e.g., trend, cycle, time-varying coefficients). Not directly observed.

F_t

Transition matrix

m x m matrix governing the state's evolution: alpha_t = $F_t *$ alpha_{t-1} + eta_t. Encodes persistence, mean-reversion, and cross-state dynamics.

H_t

Measurement matrix

n x m matrix linking the state to observations: $y_t = H_t *$ alpha_t + epsilon_t. Selects which state components are observed.

Q_t

State disturbance covariance

m x m covariance of the state innovation eta_t. Controls how much the state drifts each period.

R_t

Measurement noise covariance

n x n covariance of the observation noise epsilon_t. Captures data noise, measurement error, and idiosyncratic variation.

K_t

Kalman gain

m x n weighting matrix: $K_t = P_{t|t-1} H$ ' $S_t^{-1}$ . Balances the model's prediction against the data innovation.

v_t = y_t - H \alpha_{t|t-1}

Innovation (prediction error)

Difference between the observed $y_t$ and the model's prediction. Under correct specification, $v_t$ is $N(0, S_t)$ and serially uncorrelated.

Assumptions

Linear state-space modelMaintained

Both the transition and measurement equations are linear in the state and disturbances.

If violated: Nonlinearity invalidates the Kalman filter's optimality. Use the extended Kalman filter, unscented Kalman filter, or particle filter.

Gaussian disturbancesTestable

State innovations eta_t and measurement errors epsilon_t are jointly Gaussian.

If violated: Without Gaussianity, the Kalman filter remains the best linear unbiased estimator but is no longer the minimum-MSE estimator among all estimators. Innovation-based likelihood is an approximation.

Known system matricesMaintained

F, H, Q, R are either known or estimated in a prior step. The filter conditions on these matrices.

If violated: If system matrices are misspecified, the filter is suboptimal and the innovation diagnostics will flag model failure.

Independence of state and measurement disturbancesTestable

$E[eta_t epsilon_s'] = 0$ for all t, s.

If violated: Correlated disturbances require a modified Kalman filter (or augmenting the state to absorb the correlation). Standard formulas produce biased state estimates.

Correct initial conditionsMaintained

The initial state alpha_0 and covariance $P_0$ are correctly specified (exact or diffuse).

If violated: Poor initial conditions affect the first several filtered states. Diffuse initialization ( $P_0 =$ kappa * I with kappa large) mitigates this but requires careful treatment of the initial log-likelihood contributions.

No structural breaks in system matricesTestable

F, H, Q, R are constant (or their time variation is correctly parameterized).

If violated: Unmodeled structural breaks cause persistent innovation bias and filter divergence. The innovation sequence will show autocorrelation or level shifts.

Concepts, data, and nearby models

Open the concept, data series, policy setting, or neighboring model that anchors this page.

Kalman filter: question, structure, and use cases

Background

How the Parts Fit Together

Applications

Components

Assumptions

Concepts, data, and nearby models

Concepts

Indicators

Policy

Nearby models