Cannot replicate the result of example_validation_predictions.csv

omegaz · March 24, 2022, 2:09am

Running the example script example_model.py does not give the same result as example_validation_predictions.csv.

This is the command I used:

pip3 install -r requirements.txt
python3 example_model.py

And below is the difference:

$ head example_validation_predictions.csv
id,prediction
n000777698096000,0.22826
n0009793a3b91c27,0.73120
n00099ccd6698ab0,0.84688
n0019e36bbb8702b,0.84971
n0028cb874439df8,0.17504
n002ff30f2ccd7e0,0.46245
n00310da5e169a35,0.41654
n003fa83b4f2f608,0.44718
n004af567f93ca2c,0.59547

$ head validation_predictions_308.csv
id,prediction
n000777698096000,0.4399175033076504
n0009793a3b91c27,0.30779678981873704
n00099ccd6698ab0,0.7927576353913034
n0019e36bbb8702b,0.6281682102368538
n0028cb874439df8,0.124921709675387
n002ff30f2ccd7e0,0.6480622913030104
n00310da5e169a35,0.27058618606598994
n003fa83b4f2f608,0.4535094448706403
n004af567f93ca2c,0.5285866233799925

I am using the 308 era data. Any idea about the difference?

jefferythewind · March 27, 2022, 3:10am

I actually recently had the same question. You can see a big difference submitting to the diagnostics as well. The downloaded file has .25 corr with .95 sharpe but the file produced from the example_model.py is much worse than that.

Topic		Replies	Views
Model ranked low....predictions CSV comparison? Tournament	5	800	February 8, 2021
Are predictions discrete or continuous? Tournament	19	3906	May 22, 2021
Upoload issues on both python and website Tournament	5	879	April 7, 2021
Why this error occurring even though I am using id's from example prediction file Tournament	5	1925	August 31, 2020
New data and the example predictions Tournament	4	1373	January 6, 2022

Cannot replicate the result of example_validation_predictions.csv

Related topics