V5 "Atlas" Data Release

ark · September 12, 2024, 12:24am

@pschyska This was a mandate that we could not provide a v43->v5 map as we only want models trained on v5 data. Models trained on v4.x data, on average, will not have stable performance in the long run.

@stochastic_geometry_1 correct, V5 submissions will not receive scores right away. You can rely on validation / diagnostics to check the performance of a model.

pschyska · September 12, 2024, 11:43am

I challenge the fact that validation performance is enough to gauge the live performance adequately. In my experience, validation results don’t correlate strongly with live performance, especially when considering 0.5Xcorr + 2Xmmc scoring: one of my models, p_tt_rg, has quite poor corr (0.02208/1.2603 live, by my calculation, 0.01916/0.92306 validation), but is in the 98,4th percentile for live score due to mmc. I would have never have selected that model to deploy, it was a happy accident because I wanted to test something with it. As you can’t optimize for mmc, you are essentially asking us to stake models with close to 0 information on how they will do on Sept 27. This sounds like a huge gamble for both parties.

But if it were true we could rely on validation, your claim that models trained on V4.x data can’t have stable performance doesn’t make sense. I showed you how one of my V4.3 models (the first one linked) goes from 0.0303/1.4699 to 0.0337/1.8104 on validation. If you have more specific information about that phenomenon, please share it. In my experiments so far, I have yet to see a V4.3 model that does worse on V5.0 validation. For example: did you consider models other than GBDTs? Maybe models using deep learning or not interpreting the features mainly numerically behave differently?

rpica · September 14, 2024, 2:12pm

I just retrained and uploaded models… why not scoring them? As already discussed, I can also attest how different it is from diagnostics to live submissions.

smilence666 · September 19, 2024, 1:46am

so when you said no score between 9/13 - 9/27, that’s for every submission including the v4 one and it is literally no score and no payout???

wigglemuse · September 19, 2024, 2:14am

No, v4 will continue until switchover day. But there is no overlap of scoring both.

smilence666 · September 20, 2024, 3:42am

Thanks, that’s fair.

edubergeek · September 22, 2024, 3:17pm

A week of scoring would be useful.

edubergeek · September 22, 2024, 3:19pm

Do I have to drain my v4.3 staked models or will the v4.3 staked amounts become available on Sep 27 for staking v5 models?

kenfus · September 22, 2024, 4:18pm

Always happy to see Numerai evolve! Any update on meta_model.parquet for V5?

wigglemuse · September 22, 2024, 6:35pm

Just upload your v5 predictions to the same slots that are already staked starting on the switchover day. (The last of the v43 staked rounds will still take another month to resolve after it stops accepting them, so you can’t just move those stakes immediately to other slots.)

ark · September 23, 2024, 10:25pm

v5 meta_model.parquet should be available on September 27

holden263 · September 26, 2024, 6:56am

Is there a chance to get v43_to_v5_map? I missed the opportunity to get that mapping when it was available.

svendaj · September 28, 2024, 7:58pm

Now also Numerai example scripts provided on Kaggle platform are retrained on v5.0 data and uploaded to the tournament (each profile has link to the Kaggle source code):

JOS_KAGGLE_HELLO Profile - Numerai - basic tutorial trained on small feature set. Best performer with +100% return.
JOS_KAGGLE_MEDIUM Profile - Numerai - same basic tutorial just trained on medium feature set. It used to be called JOS_KAGGLE_SHATT because it used ShatteredX’s Improved & Compact Feature Set (225 features) for v4.3 Midnight Data, but because it was worst performer with “just” +40% return, I have changed it back to medium feature set with v5.0 data.
JOS_KAGGLE_MEDIUM_FN Profile - Numerai - tutorial #2 explaining feature neutralization, trained on medium feature set. Second worst performer with just 53% return. Interesting is that it is actually quite difficult to achieve better metrics with feature neutralization on v5.0. Anyone have an explanation?
JOS_KAGGLE_MEDIUM_TE Profile - Numerai - 3rd introductory tutorial explaining ensembling, trained on medium feature set with 55% return.
JOS_KAGGLE_SUNSHINE Profile - Numerai - older example from github (now not available) featuring both ensembling and neutralization on 1/4th downsampled “all data” with medium feature set - second best performer with above average 77% return.

So let’s see how they will work in “Atlas” era.

kenfus · October 6, 2024, 2:42pm

Hello, is it still coming? I selected my model with the help of that data in the past and if it’s no longer provided, I’ll need to change my strategy.

Topic		Replies	Views
New data frequency Tournament	7	467	January 18, 2024
Signals V2 “Cosmic” Data Announcements	4	1663	December 8, 2024
Which is the current dataset? Tournament	22	1916	November 9, 2022
Fund's recent performance Numeraire	5	2658	September 17, 2024
(Way too early) Comparison of legacy & new models Tournament	27	2250	December 20, 2021

V5 "Atlas" Data Release

Related topics