ongoing research projectdeterministic synthetic data

Sector-neutral
pairs trading,
end to end.

An interactive lab built from the literature: cointegration via Engle-Granger, hedge ratios from rolling OLS and a Kalman filter, walk-forward backtests with realistic frictions, and risk-parity sizing across pairs. Bring no data — six pairs come pre-loaded.

Open Pair Lab Run a backtest Read the theory →

cointegration vector

y_{t} - β x_{t} \sim I (0)

z-score signal

z_{t} = (S_{t} - μ_{w}) / σ_{w}

half-life (OU)

t_{1/2} = ln 2/ θ

live preview

MEGA/WHENBanks

z-score · spread

β 1.017ADF τ -7.96p≈ 0.005

Live render of the MEGA-WHEN synthetic spread. Bands at ±2σ, hedge ratio from full-sample OLS, z-score on a 60-day rolling window.

01 / pipeline

From a price universe to a risk-managed book.

Six stages, each with its own page. The Lab walks the same pipeline a desk would: filter, fit, forecast, size, execute.

02 / what's inside

Six modules, one mental model.

interactive

Pair Lab

Pick a pair, watch OLS and Kalman fight over the hedge ratio, see the spread, z-score, rolling-ADF and Bertram-band overlay live.

open module

diagnostics

Methods

ADF + Engle-Granger + Johansen, KPSS (opposite null), Variance Ratio, Hurst, CUSUM — every test a desk runs before trusting a pair, side-by-side.

open module

comparison

Strategies

Cointegration vs Distance method (Gatev-Goetzmann-Rouwenhorst) vs OU s-score (Avellaneda-Lee). Same pair, three trading rules, one chart.

open module

tunable

Backtest Studio

Walk-forward, every parameter tunable, costs baked into Sharpe, bootstrap CI, equity vs buy-and-hold, trade heatmap by entry-z × holding period.

open module

risk

Risk Lab

VaR / CVaR, Ulcer, Pain, Sterling, information ratio, return-distribution moments, stationary-bootstrap Sharpe CI on every pair.

open module

ensemble

Portfolio

Risk-parity sizing, β-to-market, Amihud liquidity + KPSS screens, capacity proxy and a correlation-of-pairs heatmap.

open module

math

Theory

23 sections: Engle-Granger, Johansen/VECM, ADF, KPSS, VR, Hurst, CUSUM, Kalman, OU half-life, Bertram bands, distance, s-score, risk metrics, bootstrap.

open module

reference

Glossary

Plain-English definitions for every term used elsewhere in the Lab.

open module

deterministic

Demo data

Twelve synthetic pairs across twelve sectors — strong, moderate, weak, slow, volatile, structurally broken, independent — each with its own seeded RNG.

open module

03 / built on the literature

The papers behind every module.

Cointegration

Engle & Granger (1987)

Two-step procedure: estimate the long-run relationship, then test residuals for a unit root. Foundation of the entire pairs framework.

Critical values

MacKinnon (1996, 2010)

Finite-sample response surfaces for ADF and cointegration test critical values. Used directly to compute T-adjusted thresholds in this app.

Distance + cointegration

Gatev, Goetzmann & Rouwenhorst (2006)

Empirical demonstration that mean-reverting pairs delivered ~11% annualized excess return 1962–2002. The seminal benchmark.

Time-varying β

Elliott, van der Hoek & Malcolm (2005)

Mean-reverting Gaussian state-space model — the textbook motivation for using a Kalman filter to estimate hedge ratios.

Statistical arbitrage

Avellaneda & Lee (2010)

PCA / sector-residual approach with the s-score signal. Maps neatly onto the sector-neutral framing used here.

Decline & costs

Do & Faff (2010, 2012)

Explicit decomposition showing how transaction costs gradually erode pairs profitability post-2002. Why the Backtest Studio takes costs seriously.

OU thresholds

Bertram (2010)

Analytic optimal entry/exit bands for an Ornstein-Uhlenbeck spread under fixed costs. Reference for the band-suggestion overlay.

Risk parity

Maillard, Roncalli & Teiletche (2010)

Equal-risk-contribution portfolio construction. Used in Portfolio to size pairs by risk rather than capital.

Liquidity

Amihud (2002)

Daily price-impact-per-dollar-of-volume measure. Used here to filter out pairs whose worse leg is too thin to trade.

Multivariate cointegration

Johansen (1988, 1991)

Trace and max-eigenvalue tests give the full rank of the cointegration space. The Methods page runs the 2-variable form against Osterwald-Lenum CVs.

Stationarity null

Kwiatkowski-Phillips-Schmidt-Shin (1992)

Reverses the ADF null. A pair worth trading rejects ADF and fails to reject KPSS — the Lab requires both.

Random walk null

Lo & MacKinlay (1988)

Variance ratio test with heteroskedasticity-robust z. The Methods page reports VR(q) at six horizons from 2 to 64.

Self-similarity

Hurst (1951), Mandelbrot (1969)

Rescaled-range slope tells you whether a series is mean-reverting (H<0.5), random (H=0.5), or trending (H>0.5).

Structural breaks

Brown, Durbin & Evans (1975)

CUSUM of recursive residuals with Brownian-motion bands. The Lab's primary breakdown-warning system.

Tail risk

Rockafellar & Uryasev (2002)

Conditional VaR / Expected Shortfall — coherent risk measure used throughout the Risk Lab.

Bootstrap

Politis & Romano (1994)

Stationary bootstrap with geometric block lengths. Honest standard errors on Sharpe under serial correlation.

Implementations are educational and audited against textbook results; the synthetic universe is constructed so OLS, ADF, Engle-Granger, OU and Kalman behave as theory predicts.

Sector-neutralpairs trading,end to end.

From a price universe to a risk-managed book.

Six modules, one mental model.

Pair Lab

Methods

Strategies

Backtest Studio

Risk Lab

Portfolio

Theory

Glossary

Demo data

The papers behind every module.

Sector-neutral
pairs trading,
end to end.