Training on eras groups and ensemble models

olivepossum · August 25, 2020, 9:06am

I’ve been doing some experiments with the idea of training on eras groups and ensemble models.

Taking into account the training dataset has 120 eras (so, 120 sequential months as far as I understood) I join continuous eras in groups of 12 and train them together.

An ensemble model is trained for each group of eras and then performance is evaluated by predicting on the validation data with each of those models. Finally, predictions are averaged.

Each ensemble model is composed by a XGBoostRegressor, a CatBoostRegressor and a LightGBMRegressor.

Models of each ensemble model, are optimized separately using 3 splits kfold cross-validation and no shuffling.

The idea came up after reading some sample scripts and other resources.

Details are published on a Colab notebook

Any feedback is welcome!

jorijnsmit · August 29, 2020, 10:08am

Is ensembling a good idea by default? I can imagine situations in which one of the three models outperforms the ensembled model. Maybe you could compare metrics before deciding on using a single or ensembled model for submitting predictions?

ryendu · May 13, 2021, 9:13pm

Yes, in my opinion, ensembleing is a great Idea in general. I experimented with training mini-simple just dense/linear neural nets on the tournament data, and ensembling them all together and I found that the average loss for each neural net is about 0.3-0.05 depending on the learning rate, but the loss of the mean of all the neural net’s predictions are always less than 0.05. This can be exaggerated and most clearly seen with higher learning rates where the average loss for all the neural nets were about 0.3 while the loss for the mean of all the predictions of the neural nets were always less than 0.05. And just incase I sound confusing, the difference between the average loss for each neural net and the loss of the average of the predictions is that the average loss for each neural net is the loss for each neural net averaged while the loss of the average of the predictions of the neural nets is the loss of the ensemble’s averaged prediction.

ryendu · May 14, 2021, 1:22am

Now that I think about it some more, isn’t Numerai’s meta model just a more complex ensemble?

Topic		Replies	Views
How to ensemble models Data Science	14	3605	September 7, 2021
Taking advantage of Eras Data Science	6	3405	June 10, 2021
Era Boosted Models Data Science	21	15265	October 10, 2021
Era-wise Time-series Cross Validation Data Science	24	11514	November 5, 2021
Which Model is Better? Tournament	44	2644	January 27, 2022

Training on eras groups and ensemble models

Related topics