The below is an early effort to evaluate risk regimes within Numerai.
Every quantitative strategy shares a structural vulnerability: when a strategy works, participants converge on it. This convergence is rational at the individual level — you train on the target that pays, using the features that predict — but collectively it creates a crowded trade that eventually undermines itself.
In traditional equity markets, crowded factor trades unwind suddenly and violently. The August 2007 quant quake, momentum crashes, and carry trade reversals all follow the same pattern. Capital accumulates gradually on one side of a trade over months, then exits in days when the underlying factor relationships break. The buildup is slow; the unwind is fast.
Numerai’s tournament has the same dynamics but with a specific mechanism. The majority of modelers (as well as other quant strategy trading systems) optimize against a single target — Ender20 — using overlapping feature sets. When those features work well, CORR rises for everyone, new capital enters, and the stake-weighted meta-model becomes increasingly saturated with Ender20-like predictions. The benchmarks still predict accurately (high CORR), but their unique contribution shrinks (low MMC) because the crowd is expressing the same signal. This is the crowding building.
The correction comes when the crowded feature exposures stop working. Too much capital on the same factors causes those factors to mean-revert or lose effectiveness. CORR collapses — not just for the crowded models, but for all models, because the factor environment itself has shifted. Short-term optimizers, who chased the features precisely because they were working, are the most exposed. They entered late, sized up at the peak, and their positions are the most correlated with each other. When they exit, they exit together.
The research below attempts to detect where we are in this cycle using publicly available benchmark performance data. The core insight is that the relationship between CORR and MMC across Numerai’s 8 benchmark models reveals the crowding state. When non-Ender benchmarks earn significantly more MMC than Ender20 at the same CORR level, the crowd has piled into Ender20-like features. When both CORR and MMC trends weaken simultaneously, the unwind may be beginning.
This is not a predictive model — the dataset is too short and the independent observations too few to claim statistical significance. It is a risk management framework: a structured way to think about when to be fully invested and when to protect capital, based on the crowding cycle that is inherent to any tournament where participants can observe and imitate each other’s success.
**
The system below classifies the benchmark regime into four phases using two trend signals:** the 5-round moving average minus the 15-round moving average of both CORR20 and MMC20 across all 8 Numerai benchmarks.
Phase is determined by which trends are positive. Both up = Phase 1 (early regime, full stake). CORR up but MMC down = Phase 2 (crowd catching up, reduce). Both down = Phase 3 (drawdown, minimum). CORR down but MMC up = Phase 4 (washout ending, moderate).
Stake tapers with duration. The longer you stay in Phase 1, the more you reduce — from 1,500 down to 500 as the run exceeds its historical median of 7 rounds. In Phase 3, you do the opposite — start at 250 and scale up as the drawdown extends past its median of 12 rounds, positioning for recovery.
One reset trigger. If CORR makes a new 20-round high while in Phase 1, the duration clock resets and stake goes back to 1,500. The regime has genuinely renewed.
One veto. Unresolved round data from the maturity-filtered crossover (days 7-13 vs days 14-19) can block the reset. If the leading signal is weakening relative to the confirmed signal, the breakout may be stale — don’t size up.
Result over 318 rounds: 20% better capital efficiency than flat staking, worst single round cut in half, same Sharpe ratio at 56% of the average stake. When vol-matched to the same risk budget as flat 1,500, the system produces approximately 1,149 NMR versus 959 flat — 20% more return for the same risk.
Current reading: Phase 1, day 13, 1.7x median duration. Breakout trigger is firing (CORR at new highs) but the live data veto is active (unresolved crossover weakening). Recommended stake: 500 NMR.






