Is there a live model to track the example model of the new testset?

nyuton · October 1, 2021, 6:35pm

rigrog · October 1, 2021, 9:20pm

Over on the chat, someone said integration_test_7 switched to new (“super massive”) data on round 282, and then… switched back? Weird, if true.

Perhaps you could compare performance of “integration_test_N” (for every “N”) against performance of example_predictions.csv (legacy version, and super-massive version). Just use 2 of your 50 model slots, to submit those (unstaked) from your account.

Or instead of example_predictions.csv, you could run the example code that’s supposed to produce it.

Of course, if you were doing that, you would no longer need integration_test_whatever.

Topic		Replies	Views
What model is currently used for live_example_preds.parquet? Tournament	0	522	July 2, 2023
New data and the example predictions Tournament	4	1431	January 6, 2022
Why "test" data? Tournament	4	1004	April 10, 2022
TC vs. Legacy Data Tournament	2	885	April 4, 2022
Cannot replicate the result of example_validation_predictions.csv Tournament	1	546	March 27, 2022

Is there a live model to track the example model of the new testset?

Related topics