The VAR's strength -- treating every variable as endogenous -- is also its weakness. A VAR(p) with n variables requires n^2*p slope coefficients. With n = 7 and p = 4 the system has 196 slopes plus 7 intercepts. Quarterly macro datasets rarely exceed 200 observations, so the parameter-to-observation ratio approaches the danger zone where OLS overfits noise. By the mid-1980s practitioners at the Federal Reserve Bank of Minneapolis had hit this wall repeatedly. Robert Litterman's 1986 Journal of Forecasting paper proposed the solution that became the standard: treat the VAR coefficients as random variables and impose a prior distribution that shrinks most of them toward zero. The resulting Bayesian VAR (BVAR) traded a small amount of bias for a large reduction in forecast variance.
The Minnesota prior -- named for the Minneapolis Fed where Litterman, Thomas Doan, and Christopher Sims developed it -- encodes a specific belief: each variable behaves roughly like a random walk with drift, and cross-variable lags are less important than own lags. Formally, the prior mean for a variable's own first lag is 1 (unit root belief); all other lag coefficients have prior mean 0. The prior variance on coefficient (j,k) at lag l shrinks at rate 1/l^2, penalizing distant lags, and cross-variable coefficients receive an additional shrinkage factor lambda_cross < 1. Two hyperparameters control the overall tightness (lambda) and the relative weight on cross-variable versus own-variable lags. When lambda approaches infinity the posterior collapses to the OLS VAR. When lambda approaches zero every variable becomes an independent random walk.
Giannone, Lenza, and Primiceri (2015, Review of Economics and Statistics) showed how to choose the Minnesota hyperparameters by marginal likelihood maximization, removing the last subjective element. Their approach treats lambda as a continuous hyperparameter and optimizes the marginal data density over a grid. The result: BVAR forecasts that are competitive with or better than large factor models, mixed-frequency models, and professional consensus forecasts for GDP, inflation, and unemployment at horizons of 1-8 quarters. The ECB's suite of BVARs follows this approach almost exactly.
Extensions since the original Minnesota prior include stochastic volatility (Cogley and Sargent, 2005), time-varying parameters (Primiceri, 2005), large BVARs with global-local shrinkage (Banbura, Giannone, and Reichlin, 2010), and hierarchical priors that learn the prior tightness from the data jointly with the coefficients. The BVAR has become the default forecasting tool at central banks worldwide -- not because it always wins point-forecast accuracy contests, but because it consistently avoids the catastrophic forecast failures that plague unrestricted VARs in moderate-to-large systems.