Crypto Microstructure Research

R001: Cross-Exchange Order Book Asymmetry (2017-2023)

Binance Coinbase Kraken OkEX Bitstamp OB Depth Ratio · 137 seconds · Monthly first-day snapshots

Key Finding: Binance and Coinbase show opposite asymmetry directions. Binance: bid/ask ratio = 0.944 (sell-side pressure, trending ↘), Coinbase: ratio = 1.465 (buy-side pressure, trending ↗). This may reflect different user bases: Asian retail (Binance) vs. US institutional (Coinbase).

R007: Order Book Resilience — Recovery Speed After Large Shocks

Binance Coinbase Kraken Shock Detection · 59 seconds

Key Finding: Kraken recovers fastest from liquidity shocks (avg 2 snapshots to 90% recovery). Coinbase shows highest depth volatility. Binance is the most stable in absolute terms due to massive depth.

R016: Market Maker Behavior — Quote Update Patterns

Binance Coinbase Kraken Update Frequency · 48 seconds

Key Finding: Coinbase has 2x the update frequency of Binance/Kraken (0.04 Hz vs 0.02 Hz). But Binance has highest spread stability (72% of snapshots unchanged) vs. Coinbase (2%) — suggesting Binance MMs use wider but more stable quotes, while Coinbase MMs actively adjust.

R010: Lead-Lag Analysis — Who Discovers Price First?

Binance Coinbase Kraken Bitstamp Hasbrouck InfoShare · Cross-correlation at multiple lags

Surprising Finding: Bitstamp leads all exchanges by 2-5 snapshots, despite being much smaller by volume. Score: Bitstamp (0.42) > Kraken (0.30) > Binance (0.13) > Coinbase (0.00). This suggests institutional/OTC flow may route through smaller, more regulated exchanges first.

R013: Order Flow Imbalance (OFI) — Predictive Power

OFI → Price Regression · Cont et al. (2014) methodology

Key Finding: Kraken's OFI has the strongest predictive power for next-period returns (β=0.00017, R²=1.5%, p<0.001). Binance significant but weaker. Coinbase marginally insignificant (p=0.058). Consistent with R010: Kraken/Bitstamp contain more informed order flow.

R011: Futures-Spot Basis — Perpetual Premium Dynamics

Binance Spot Binance Futures Basis Analysis · 190 perpetual pairs · 2021-2023

Key Finding: The futures-spot basis is a market sentiment thermometer. April 2021 peak: 823 bps (annualized 99%) — extreme bull euphoria. Early 2021 baseline: 5-10 bps. Futures depth/spot depth ratio swings from 0.37 to 6.4×.

R006: Crypto-Equity Correlation (WRDS CRSP)

WRDS Rolling 60-day Correlation · COIN, MSTR, MARA, RIOT vs SPY, QQQ, GLD, TLT · 2021-2023

Key Finding: Crypto stocks (COIN, MSTR, MARA, RIOT) have 0.49-0.55 correlation with QQQ but only 0.06-0.09 with GLD and ~0 with TLT. Crypto is a tech-beta asset, not a safe haven. MARA trades 23.5M shares/day — 2x COIN's volume despite 1/10th the market cap.

GPU02: Optimal Portfolio via Ledoit-Wolf Shrinkage

V100 GPU WRDS Covariance Shrinkage · 12 assets · 683 trading days

Key Finding: Shrinkage intensity = 0.007 (very low — sufficient data). Min-variance portfolio: Return 0.2%, Vol 10.6%, Sharpe 0.02. Critical correlation: QQQ↔SPY: 0.946, COIN↔MSTR: 0.746, MARA↔MSTR: 0.767. Crypto stocks are highly homogeneous — shorting one while holding another provides no diversification.

	SPY	QQQ	GLD	TLT
COIN	0.54	0.59	0.09	0.02
MSTR	0.55	0.60	0.10	0.01
MARA	0.48	0.52	0.05	0.02
RIOT	0.51	0.56	0.09	0.03

R009: Intraday Volume Periodicity — FFT Harmonic Decomposition

FFT Analysis · 4 exchanges · Multiple months

Key Finding: Dominant periods: 4.8h, 24h, 3h. Peak hours differ by exchange: Binance UTC 10 (Asia afternoon), Kraken/OkEX UTC 16 (EU-US overlap). The 4.8h cycle suggests ~5 trading sessions per day, likely aligned with global timezone handoffs.

R012: Multi-Scale Realized Volatility

Realized Volatility · 10/50/100/200-step windows

Key Finding: Volatility scales roughly as √T across all exchanges, consistent with geometric Brownian motion at short horizons. Binance 10-step RV: 1.4%, 200-step: 12.4%. Cross-exchange RV highly similar (<10% difference), confirming strong market integration.

GPU01: Price Pattern Autoencoder — Market Regime Discovery

V100 GPU 0 Conv1D Autoencoder + K-Means · 10,000 windows · 16-dim latent space · 10 seconds

Key Finding: 5 distinct market regimes discovered unsupervised: Bullish (44% of windows), Bearish (43%), Mean-reverting (13%). The autoencoder's latent space cleanly separates trend direction, suggesting price dynamics are dominated by momentum rather than mean-reversion at the 100-snapshot scale.

GPU03: Transformer Price Predictor — Cross-Exchange Attention

V100 GPU 0 Transformer Encoder · 156K params · 4 exchanges × 64 steps

Key Finding: Accuracy: 50.68% (0.68% edge over random). Attention ratio: recent/distant = 1.23 — the model learns mild recency bias but no strong pattern. Implication: Simple direction prediction from cross-exchange snapshots is near-random, consistent with weak-form efficiency. Need richer features (OFI, depth profiles) for alpha.

Data Sources & Infrastructure

Data: Kaiko consolidated order book (L2, 10-level snapshots, 2017-2023), WRDS CRSP daily stock file, Binance Futures perpetual OB.
Compute: Cornell research3 (112 cores, Kaiko data), Cornell research1 (88 cores, 2×V100 32GB, TAQ/WRDS), Cornell BioHPC (660 cores, GROMACS MD simulations).
Software: Python 3.12, PyTorch 2.5, pandas, scipy, matplotlib.
Methods: OB depth ratio, CCDF/power-law, Kyle's Lambda regression, Hasbrouck cross-correlation, OFI (Cont et al. 2014), FFT harmonic decomposition, Ledoit-Wolf shrinkage, Conv1D autoencoder, Transformer encoder.

🔬 BTC Market Microstructure

📊 Order Book Structure