Risk Measures — Cross-Cutting Comparison

This note compares every risk measure used across the Finance library — variance / standard deviation, semi-variance, MAD, VaR (parametric / historical / Monte Carlo), CVaR / Expected Shortfall (ES), spectral risk, distortion risk, entropic risk, Sharpe, Sortino, Treynor, Jensen alpha, Calmar, MAR, Pain, Ulcer, Information ratio, K-ratio, max drawdown, time-under-water, MAE/MFE, beta, idiosyncratic vol — on the axes of coherence (Artzner-Delbaen-Eber-Heath 1999), elicitability (Gneiting 2011), regulatory acceptance (Basel III/IV FRTB, EU Solvency II/III, ICS), tail-sensitivity, and computational cost. Decision tree at the end picks by regulatory regime, reporting requirement, portfolio type, and investor type.

See also

1. The taxonomy

DISPERSION                                COHERENT (Artzner et al 1999)
  variance                                  CVaR / Expected Shortfall (ES)
  std dev (volatility)                      spectral risk (Acerbi 2002)
  semi-variance                             distortion (Wang 2000)
  mean absolute deviation (MAD)             entropic risk
                                            worst-case CVaR
QUANTILE                                  
  Value-at-Risk (VaR)                     CONVEX (Föllmer-Schied 2002 relax PH)
  conditional VaR (CVaR / ES)              convex risk measure
  range (max - min)                        OCE (Ben-Tal-Teboulle 2007)
                                           shortfall risk

DRAWDOWN                                  RATIO / RISK-ADJUSTED RETURN
  max drawdown (MDD)                        Sharpe (1966)
  conditional drawdown (Chekhlov 2005)      Sortino (Sortino-Price 1994)
  pain index, ulcer index                   Treynor (1965)
  Calmar = ann. return / MDD                Jensen alpha (1968)
  MAR ratio                                 M-squared (Modigliani-Miller-jr 1997)
  K-ratio (Kestner)                         Information ratio
  time-under-water (TUW)                    Calmar
                                            Omega (Keating-Shadwick 2002)
                                            Kappa (Kaplan-Knowles 2004)
TRADING                                   FACTOR
  MAE (Max Adverse Excursion)               beta (CAPM)
  MFE (Max Favorable Excursion)             idiosyncratic vol
  R-multiple                                downside beta
  expectancy                                Treynor-Mazuy / Henriksson-Merton

2. Coherence — the Artzner-Delbaen-Eber-Heath 1999 axioms

A risk measure is coherent iff it satisfies all four axioms:

  1. Monotonicity — if X ≤ Y pathwise then ρ(X) ≥ ρ(Y) (less return = more risk).
  2. Translation invariance — ρ(X + c) = ρ(X) - c (adding cash reduces risk by exactly c).
  3. Positive homogeneity — ρ(λX) = λρ(X) for λ ≥ 0 (scaling positions scales risk).
  4. Sub-additivity — ρ(X + Y) ≤ ρ(X) + ρ(Y) (diversification can only reduce risk).
MeasureCoherent?Why / why not
Variance / std devNOnot monotone, not translation invariant
Semi-varianceNOnot monotone
MADNOnot monotone
VaRNOfails sub-additivity (Artzner et al 1999 counter-example)
CVaR / Expected ShortfallYESthe canonical coherent measure
Spectral riskYESweighted average of quantiles w/ non-increasing weight function
Distortion riskYES (if distortion fn is concave)Wang 2000
Entropic riskNO (not positive homogeneous; convex)but convex
Worst-case CVaRYESover a family of measures
Max drawdownNOnot coherent in the ADEH sense (uses path)
Sharpe rationot a risk measure (ratio)uses dispersion

The single most important consequence: VaR is not coherent. It fails sub-additivity (Artzner et al 1999 counter-example: two long-out-of-the-money options on different underlyings have lower VaR sum than the portfolio VaR). This is why Basel III FRTB (Fundamental Review of the Trading Book) replaced VaR with ES.

3. Convex risk measures — Föllmer-Schied 2002 relaxation

Föllmer-Schied 2002 dropped positive homogeneity, keeping only:

  1. Monotonicity
  2. Translation invariance
  3. Convexity — ρ(λX + (1-λ)Y) ≤ λρ(X) + (1-λ)ρ(Y)

Convex risk measures admit a dual representation:

where is a penalty function. Coherent = convex + positive homogeneous = α takes only values 0 or +∞.

Entropic risk: is convex but not coherent; γ is the risk-aversion parameter. Used in optimal control + reinforcement learning (risk-sensitive RL).

4. Elicitability — Gneiting 2011 framing

A risk measure is elicitable if there exists a scoring function S such that ρ(X) = argmin E[S(X, x)]. Elicitability is what makes backtesting possible — a regulator can score forecasts.

MeasureElicitable?
Mean (expected value)YES (squared loss)
MedianYES (absolute loss)
Quantile / VaRYES (asymmetric piecewise linear, “pinball loss”)
VarianceYES jointly with mean
ES / CVaRNO (Gneiting 2011) — alone
(VaR, ES) jointlyYES (Fissler-Ziegel 2016) — jointly with VaR
Mean + varianceYES jointly

The Gneiting 2011 result was a shock: ES is not elicitable alone, which means you cannot backtest ES alone with a scoring rule. Fissler-Ziegel 2016 rescued this: (VaR, ES) is jointly elicitable. Acerbi-Szekely 2014 independently proposed three direct ES backtests (now in Basel III FRTB).

The Basel III FRTB ES backtesting framework uses these — banks must backtest both VaR (at 97.5%) and ES (at 97.5%) jointly.

5. The risk-measure cheat sheet

MeasureFormulaCoherentElicitableTail-sensitiveComputational cost
VarianceE[(X - μ)²]noyes (w/ mean)notrivial
Std dev (volatility)√variancenowith meannotrivial
Semi-varianceE[((X - μ)⁻)²]nounclearpartialtrivial
MADE[|X - μ|]nono (with median, yes)notrivial
VaR_α-inf{x : P(X ≤ x) ≥ 1-α}noyesyesmedium
CVaR_α / ES_αE[-X | X ≤ -VaR_α]yesjointly w/ VaRyesmedium
Spectral risk-∫_0^1 q_u(X) φ(u) du with φ ≥ 0 ↘yesdependstunablemedium
Distortion risk-∫_0^1 q_u(X) dg(u) with g concaveyesdependstunablemedium
Entropic risk(1/γ) log E[exp(-γX)]no (convex only)unclearyestrivial
Worst-case CVaRsup_Q CVaR^Qyesnoyeshigh (robust opt)
Maximum drawdownmax_t (cummax X_s - X_t)nono (path-dep)yestrivial in batch
Conditional drawdownmean drawdown beyond percentileyes (Chekhlov)unclearyesmedium
Pain indexmeandrawdown|nounclearyes
Ulcer index√mean(drawdown²)nounclearyestrivial

6. Performance-and-risk ratios

RatioNumeratorDenominatorSensitivity
Sharpeexcess returnstd devdispersion
Sortinoexcess returndownside std devdownside dispersion
Treynorexcess returnbetasystematic risk
Jensen alphaactual - CAPM expectedn/a (absolute)factor-mispricing
M-squaredleveraged Sharpe at market volstd devmarket-equivalent return
Information Ratio (IR)active returntracking erroractive management
Calmarannualized returnmax drawdowntail dispersion
MARcompound annual growthmax drawdownhedge-fund standard
Sterlingreturnaverage top-N drawdownstail (smoothed)
Burkereturnsqrt sum of drawdown²tail (smoothed)
Pain ratioreturnpain indexpath-dep
Ulcer Performance Index (UPI)excess returnulcer indexpath-dep
K-ratioslope of cumulative log-return vs time / std errorlinearity of equity curvevolatility of slope
Omega∫(1-F(x))dx above threshold / ∫F(x)dx belowdistributionalfull distribution
Kappa-nexcess return / lower partial momentdownsidedownside (general n)
Modified SharpeSharpe w/ Cornish-Fisher VaRparametric quantileskew + kurtosis
Probabilistic Sharpe Ratio (Bailey-López de Prado 2012)accounts for skew/kurt + sample sizedispersion w/ statsuncertainty in Sharpe
WhenUse
Long-only equity, sample period > 5ySharpe
Asymmetric return (options, hedge funds w/ short vol)Sortino, Calmar, MAR
Active manager vs benchmarkInformation Ratio
Tracking strategy w/ drawdown disciplineCalmar, MAR
Time-varying or vol-targeting fundUPI, Pain
Smoothness of equity curveK-ratio
Survivorship + small samplesProbabilistic Sharpe, Deflated Sharpe
Theoretical evaluation of distributionOmega, Kappa

7. VaR vs ES — the practical comparison

PropertyVaR_αES_α
Definitionquantile at level αaverage loss conditional on > VaR
CoherentNOYES
ElicitableYESNO (alone); YES jointly with VaR
Reports a numberyesyes
Captures tail beyond thresholdNOYES
Basel III status (post-FRTB)replaced by ES at 97.5% (in IMA)the new standard
Solvency II (insurance)VaR 99.5% over 1Yconsidered ES; III may adopt
Common quantile95%, 99%, 99.5%, 99.9%97.5% in FRTB
1-day vs 10-day VaRscale by √10 (Gaussian assumption — sometimes wrong)similar
Backtestingbinary breach test (Kupiec, Christoffersen)Acerbi-Szekely 2014, Fissler-Ziegel 2016

The 2017 Basel III FRTB switch from VaR to ES at 97.5% was the single biggest regulatory risk-measure change in 20 years. The implementation date was repeatedly delayed; phased adoption is happening 2024–2028 across jurisdictions.

8. Three ways to compute VaR / ES

MethodComputeAssumptionsStrengthsWeaknesses
Parametric (variance-covariance, Riskmetrics 1996)-μ + σ z_αNormal/Student-t returnstrivial, analytic Greekstail thin (Normal), fragile under regime shift
Historical simulationempirical quantile of N-day returnsdata IID + stationarymodel-free, captures fat tailswindow-dependent, doesn’t extrapolate, ages out
Monte Carlosimulate from model + take quantilemodel-dependentflexible, handles nonlinear (options)expensive, model risk
Filtered historical (Barone-Adesi-Engle 2008)historical residuals scaled by GARCH volGARCH + filteradapts to volatility regimeGARCH misspec
Extreme Value Theory (EVT)GPD / GEV fit to tailtail follows EVTrigorous tail extrapolationrequires careful threshold selection
Copula-based (Sklar + Gaussian/Student-t/Archimedean)marginals + dependence structurechoice of copulaflexible dependencecopula misspecification

9. Drawdown measures — path-dependent risk

MeasureDefinitionUse
Max drawdown (MDD)max_t (peak - X_t) / peakhedge fund reporting; CTA
Time under waterlongest duration X_t < cummaxinvestor-experience proxy
Conditional Drawdown at Risk (CDaR, Chekhlov-Uryasev-Zabarankin 2005)average of worst-α drawdownscoherent path measure
Pain indexmean ofdrawdown
Ulcer index (Martin 1987)√mean drawdown²strategy comparison
Sterlingn-period average drawdownhedge fund (legacy)
Burkesqrt sum drawdown²hedge fund (legacy)
Drawdown ratioreturn / MDDCalmar / MAR variants

Drawdown is the investor-experience measure — drawdowns trigger redemptions, fire sales, career-ending mistakes. Calmar / MAR ratios are standard in CTA / hedge-fund reporting; institutional allocators read drawdowns before Sharpe.

10. Factor + systematic risk

MeasureFormulaUse
Beta (CAPM)cov(R, R_m) / var(R_m)systematic risk per unit of market
Downside betabeta conditional on R_m < thresholdbear-market-only beta
Idiosyncratic volresidual std dev of CAPMactive-management diversifiable
Tracking error (TE)std dev of (R - R_benchmark)active deviation
Active share (Cremers-Petajisto 2009)half ofw - w_benchmark
Factor exposures (Fama-French / Carhart / HXZ)β to factorsfactor decomposition
Risk contributionw_i × ∂σ_p / ∂w_iper-position risk allocation
Marginal VaR / component VaR∂VaR_p / ∂w_i × w_iper-position VaR allocation
Conditional risk attribution (Tasche 2002, Litterman)similar to MVaR but for ESper-position ES allocation

11. Regulatory frameworks

RegimeStandard measureNotes
Basel III (banks, market risk pre-2024)VaR 99% 10-dayscaling factor of 3 (or 4 for poor backtesting)
Basel III FRTB (banks, market risk post-2024 phased)ES 97.5%Standardized Approach (SA) or Internal Models (IMA)
Basel III creditvarious; PD × LGD × EAD framework + stressRisk-Weighted Assets (RWA)
CCAR / DFAST (US Fed, banks)Severely-adverse scenario stresstop-down vs bottom-up
EU Solvency II (insurance, 2016+)VaR 99.5% 1-yearStandard Formula or Internal Model; ORSA
EU Solvency III (consultation 2024+)considering ESnot yet adopted
ICS (Insurance Capital Standard, IAIS)VaR 99.5% 1-yearglobal insurance
IFRS 17 (insurance reporting)risk adjustment for non-financial riskactuarial CoC + percentile + cost-of-capital
SEC Form PF (private funds)concentration + leverage + liquidityquarterly / annual report
AIFMD (EU alternatives)leverage + concentrationreporting + manager remuneration
MiFID II / MIFIRbest execution + reportingnot a risk-measure regime
EMIR (EU OTC)central clearing + reportingcounterparty risk
Dodd-Frank Title VII (US OTC)central clearing + reportingcounterparty risk
CECL / IFRS 9 (credit, accounting)lifetime expected lossreplaces incurred-loss model

The Basel III FRTB regime is the most material change to bank market-risk capital in 30 years. ES at 97.5% (one-tailed = approximately 99% one-tailed of VaR), liquidity horizons per risk factor, P&L attribution test (fail = mandatory standardized approach), backtesting of both VaR and ES via Acerbi-Szekely.

12. Insurance + actuarial — separate world

MeasureUse
Probable Maximum Loss (PML)catastrophe insurance, single-event
1-in-N return periodcatastrophe (1-in-200, 1-in-250, 1-in-1000)
ULAE / ALAEUnallocated / Allocated Loss Adjustment Expenses
Loss ratiolosses / earned premium
Combined ratiolosses + expenses / earned premium
Reserve riskrunoff uncertainty of claim reserves
Risk margin / risk adjustmentIFRS 17 CoC or quantile-based
Tail VaR / TVaRsame as ES; common in actuarial
Required capital (SCR)Solvency II at 99.5% 1Y
MCR (Minimum Capital Requirement)Solvency II floor

See insurance-and-actuarial for the full insurance stack.

13. Decision tree — pick a risk measure

What's the regime?
├─ Bank market risk (regulated)
│    → Basel III FRTB: ES 97.5% per liquidity horizon
│    → Backtest via Acerbi-Szekely (+ joint VaR-ES Fissler-Ziegel)
│    → SA fallback if IMA fails P&L attribution test
├─ Bank credit risk (regulated)
│    → Basel III: PD × LGD × EAD framework + stress
│    → IFRS 9 / CECL: lifetime expected loss
├─ Insurance / actuarial (regulated)
│    → Solvency II / ICS: VaR 99.5% over 1 year
│    → IFRS 17 risk adjustment: CoC or quantile
├─ Hedge fund / CTA reporting
│    → Max drawdown, Calmar, MAR (track record-friendly)
│    → Sharpe, Sortino, Information Ratio
│    → Pain / Ulcer for smoothness
├─ Long-only mutual fund
│    → Sharpe, IR vs benchmark, tracking error, active share
│    → Volatility for ICR / SDR labeling
├─ Pension fund / endowment
│    → ALM-aware: liability-relative; sometimes 1-in-20 / 1-in-100 stress
│    → Surplus volatility, funding-ratio drawdown
├─ Retail / private wealth
│    → Max drawdown + Calmar (intuitive)
│    → Sharpe (familiar)
│    → Capital protection probability (P(loss > 10%))
├─ Quant strategy backtest
│    → Sharpe + Calmar + MDD + TUW + DDR
│    → Probabilistic Sharpe (small sample bias-aware)
│    → Deflated Sharpe (multiple-testing-aware, Bailey-López de Prado 2014)
├─ Market making / dealer book
│    → VaR + ES + xVA aggregated
│    → Stressed VaR (Basel III stressed scenarios)
│    → Liquidity-adjusted VaR (LVaR)
├─ Options portfolio
│    → ES + Greek Greeks (delta-gamma-vega VaR)
│    → Stress scenarios (1987-like, 2020-like, 2022-like)
└─ Catastrophe / climate risk
     → PML at 1-in-N return period
     → Per-peril ELT (event loss table)

14. Anti-patterns

  1. Reporting VaR without ES or scenarios — VaR is not coherent and gives no tail info.
  2. Backtesting ES alone (without joint VaR) — Gneiting non-elicitability; use Acerbi-Szekely or Fissler-Ziegel joint tests.
  3. Scaling 1-day VaR to 10-day by √10 unconditionally — assumes Normal + IID; can be wrong by 50% in fat-tailed regimes.
  4. Parametric VaR for options book — non-linear payoffs need full revaluation, not delta-gamma approximation past mild moves.
  5. Sharpe ratio for short-vol or insurance strategy — left-skewed returns make Sharpe deceptive; use Calmar / Sortino.
  6. Maximum drawdown from monthly data when daily is available — underestimates MDD by ~30-50%.
  7. Reporting ES without specifying “above VaR” vs “above zero” — conventions vary.
  8. Comparing Sharpe across strategies without horizon adjustment — annualization assumption matters.
  9. CDaR / drawdown-based risk for high-frequency strategies — drawdowns occur within seconds and may be irrelevant to investor experience.
  10. Using historical VaR with 1-year window after regime change — window is too short to reflect new regime, too long to forget the old.

15. The 2024–2026 frontier

  • Basel III FRTB fully phased in by 2026 — banks must run ES daily; P&L attribution tests; standardized fallback. Material capital impact (10-30% RWA increase in many cases).
  • Solvency III consultation (EIOPA 2024) — possible ES adoption, climate risk extension.
  • EU Sustainable Finance Disclosure Regulation (SFDR) + EU Taxonomy — ESG + climate VaR-style reporting.
  • Climate VaR — physical + transition risk integrated into stress; PRA SS3/19 (UK) and ECB climate stress tests.
  • Liquidity-adjusted VaR / ES — Basel III FRTB liquidity horizons (10 days for FX, 60+ for credit illiquid).
  • Acerbi-Szekely 2014 ES backtests — three direct ES tests now standard in regulatory filings.
  • Differentiable VaR / ES (Hong-Hu-Liu 2014, Glasserman gradients) — JAX/PyTorch back-prop through MC; gradient-aware capital optimization.
  • Conformal prediction for VaR (Bates-Angelopoulos-Vovk frontier) — distribution-free finite-sample coverage for VaR estimates.
  • Risk parity + tail risk parity (Spitznagel-Universa, Roncalli AQR + Edhec) — TR parity allocates to tail-equivalent risk units.
  • Climate stress under NGFS scenarios — 2°C, 1.5°C, disorderly transition.
  • AI-driven risk attribution — transformer-based scenario generation, agent-based market simulation.

Adjacent

When to pick what

The fastest narrowing: regulated bank → Basel III FRTB ES at 97.5%; regulated insurer → Solvency II VaR 99.5% (or ES if III adopts); hedge fund / CTA → Sharpe + Calmar + MDD + MAR; long-only fund → Sharpe + IR + tracking error; endowment → ALM-aware surplus risk + drawdown; options book → ES + scenario stress; catastrophe → PML at 1-in-200 / 1-in-1000. The single biggest practical lesson from the 1998 LTCM, 2008 GFC, 2020 COVID, and 2022 yen-carry episodes is VaR is insufficient — backstop with stress scenarios + ES + liquidity-adjusted measures. Modern risk reporting layers: headline measure (ES or VaR per regime), stress scenarios (historical + hypothetical), liquidity horizon adjustment, path-dependent drawdown view, and factor decomposition showing which factors drive the risk.