Model performance

Metrics are from a chronological walk-forward backtest: each game is predicted before that game's result updates the ratings.

Games
6,560
Brier
0.2383
Log loss
0.6692
Accuracy
58.6%

Out-of-sample backtest

Scored 6,560 games. Model Brier 0.2383 vs coin-flip baseline 0.25 (4.7% better than chance). Accuracy 58.6%.

Selected configuration

Chosen by lowest Brier score from a small grid: K=12, home advantage=35 Elo points, season carryover=0.65.

KHome EloCarryoverBrierLog lossAccuracy
12350.650.23830.669258.6%
16450.70.23860.669858.6%
20550.750.23930.671458.5%
24650.80.24050.673958.5%
28750.850.24190.677258.4%