When to Retrain a Model — A Casuistic Analysis

svendaj · February 17, 2026, 10:33pm

Over the last year I was running simple experiment which should give me intuition of frequency of model retraining. As some of you could mention on discord and Forum I am running weekly retraining of all of my models, but I was not sure if it would not “overwrite” really good models trained in history. Now I have some insights/interpretations/hypotheses and I would like to share them with you, before I will repurpose experimental model slots for another research.

Experimental setup

Model JOS_XB_LAZY is currently #2 on model leaderboard both at MMC20 and CORR20 rank and is retrained weekly on every Saturday when new data are available.

At round 924 (opened on Jan 20, 2025 and resolved on Feb 20, 2025) I have uploaded retrained model JOS_XB_LAZY both to its slot and to newly created slot JOS_XB_LAZY_W1, which remained unchanged and was not retrained since.

Every following week (rounds 929, 934, 939, …) I retrained base model and created slot for weekly version ( JOS_XB_LAZY_W2, JOS_XB_LAZY_W3, JOS_XB_LAZY_W4, …) until round 979 (resolved on May 8, 2025) when I uploaded last experimental model JOS_XB_LAZY_W12 (now repurposed as JOS_XB_LAZY_SH).

After that I continued to weekly retrain base model and left weekly versions without any change.

Observations

Here are results plots of experimental models compared to base model:

All experimental weekly models are highly correlated.
Base model diverges around round 1024 about 5 months after experiment start, when experimental models fail to pick new positive signal in the data
Around round 1059 base model is losing positive signal in the data caught by experimental models until round 1079, when they stop deliver positive returns
Final decline of experimental models after round 1154 is probably result of introduction of V5.2 dataset - all experimental models used V5.0 dataset and base model was retrained on V5.2 since round 1164
Repurposed model W12 diverges immediately after its change and return to the weekly retraining

Here are MMC and CORR results of models selected for better clarity of plots:

Decline in performance is mainly due to poor MMC

Conclusions

Having good model is not enough because signal in out-of-sample data is changing.
Good model without retraining may deteriorate over time.
Frequent retraining delivers better results.
Weekly retraining is probably unnecessary, but monthly retraining seems to be minimum.
Retraining on new data is essential.
Retraining can overwrite better model, but superiority of unretrained model is temporal and can cease to exist after some time. Moreover, it is impossible to predict when unretrained model will perform better.

Any other observations or conclusions?

Topic		Replies	Views
How often do you re-train your model? Data Science	8	1398	March 21, 2026
Re-training vs training online Data Science	1	1294	January 19, 2021
How are others improving/working on their models after a bad round? Data Science	6	1310	June 23, 2021
Submitting models weekly Tournament	3	834	October 19, 2021
Updating Validation Set Tournament	1	833	December 27, 2020

When to Retrain a Model — A Casuistic Analysis

Experimental setup

Observations

Conclusions

Related topics