Marketplace: jackerparker4 model for sale

jackerparker · June 23, 2021, 6:34pm

Upd: you can buy predictions here

Hi everyone, I would like to start selling predictions for my model jackerparker4 (started only at Round 263). Since we don’t have a marketplace yet, I decided to create a post here on the forum.

Brief overview of the model: The model was developed using LightGBM with strong focus on accurate CV, feature selection and feature neutralization. Actually, I used the same principles a year ago with another model which I’ve discussed and shared in this post (Feature neutralization workflow). That old model is still freely available on github. However, at this time I’ve revised all the stages and finished with a new model. In particular, the fast combinatorial cross validation was used and it was discussed in this forum post (Fast Combinatorial Cross Validation). As for the new feature selection and feature neutralization workflows - I didn’t discuss it anywhere and it is my “secret sauce” right now. The model was trained using training (1-120 eras) + val1 (121-132) data, and val2 (197-212 eras) data was used as holdout. That is why there is no sense to compare and look into normal validation report, but here is a comparison of jackerparker4 vs example_predictions for val2 data:

The statistics from my local CV: 0.04636 COR, 1.84 sharpe and -0.0297 min COR value for eras 1-132.

And here are the current live results (the model was started only at round 263):

Additional info that does matter here:

Kaggle (markmipt | Contributor | Kaggle): I participate in a similar competition (Jane Street Market Prediction | Kaggle). It will be finally ended only in 2 months, but my model has 56th position on the current live leaderbord. That kaggle model is something average between my current workflow and the workflow I’ve shared a year ago. The fact that my methods in general work well in different competitions adds some confidence, at least for me.

Science (‪Mark V Ivanov‬ - ‪Google Scholar‬): I have a PhD in bioinformatics, my h-index is 12 and one of my strongest skills is creation of schemes for validation of results in my field (proteomics). The latter also helps me in the model development for finance-related stuff.

Other models: I also have 5 additional active accounts (jackerparker-jackerparker6), but these models are just dot products of jackerparker4. For example, jackerparker5 is the same predictions as the jackerparker4, but 100% feature neutralized. Jackerparker6 is 0% feature neutralized. Jackerparker1-3 are similar models as 4-6, but developed using less reliable CV. So, I have only one model which I trust and that reduces chances of random fit into live data.

To assess the prospects of predictions selling and since my model has not many live rounds, I would like to start with a 25$ price for a single round.

Please contact me for more details here or in the chat.

Regards,
Mark

autratec · June 24, 2021, 9:07am

Good try. Hope your model working. Should u charge based on principal ?

jackerparker · June 24, 2021, 9:30am

Not sure if I understand correctly: do you mean that the price should be based on the stake which user is going to stake on the model? If yes - than my answer is no. I’m not sure if that will be legal in my jurisdiction, as well as any relations with cryptocurrency. I just want to sell model using gumroad.com (platform for creators, that will simplify taxes stuff for me) for fixed fiat price.

autratec · June 24, 2021, 10:29am

Recent earning model change make it feasible to charge based on real earning. For example 20% profit with watermark etc.

restrading · June 24, 2021, 10:32am

@autratec Without a marketplace doing the submissions for the buyers there would not be a good way to enforce the profit-sharing. Also such profit-sharing most likely will cause regulatory complexities.

autratec · June 24, 2021, 10:47am

Let’s leave regulation aside, the weekly prediction data still under control of scientist. No profit sharing, no prediction file.

Performance fee paid by NMR. Direct wallet to wallet transfer.

restrading · June 24, 2021, 10:52am

@autratec An attack can be done by buying->not sharing profit->change NMR wallet->buying again->…

autratec · June 24, 2021, 10:57am

Not such complex. Basic kyc will be necessary…

restrading · June 24, 2021, 10:57am

KYC won’t be quite well-received given this is a crypto project

jackerparker · June 28, 2021, 5:34am

Predictions are available here: jackerparker

autratec · June 28, 2021, 2:02pm

Not quite sure your business model. Assume u will send one time prediction for one round with 25 USD as return ?

jackerparker · June 28, 2021, 2:18pm

Yeah, you are right. For 25 USD buyer will get predictions for a single round. What is approximately 100$/month and 1200$/year, assuming the price will not be adjusted

jnolan9 · June 29, 2021, 12:33pm

Autratec, can you elaborate?

autratec · June 30, 2021, 9:14am

Be honest, with those no code/ low code machine learning as service be more popular in the world, submitting weekly prediction with GBTree model won’t be a rocket science any more.

With no much data scientist background, a normal person with some statistics concept is able to submit the model, get a fair result (0.05 + CORR ) in 4 to 8 hours as one time effort. Weekly updating, including 2.5G + data download and upload , might take 60 to 90 mins.

In the near future, I believe tournament will be less challenging. Every one using tree model to grab 5-9% return weekly and payout ratio will be reach to minum faster then we thought.

It means, hard to sell any model in this game.

sirbradflies · July 1, 2021, 6:36am

I agree with this reasoning, the only factor that could change this situation could be a switch to a 100% MMC payout. In that scenario “commodity” models will be of little value and so “real” models could still command a premium.
Assuming the priority for Numerai is the meta-model development then CORR payouts should be phased out at some point and only actual contributions (i.e. MMC) get rewarded, which is much more difficult to obtain with “off-the-shelf” models.
Just my opinion!

jay1100 · July 1, 2021, 8:44am

If you go for 100% MMC payout people could submit random numbers. This would give high MMC because very different from meta-model, but of course not useful at all. Therefore you always have to reward/penalize CORR as well.

mic · July 1, 2021, 8:55am

MMC is how much you help or hurt the meta model. Random numbers won’t help.

hydration · July 1, 2021, 10:37am

This is such a great idea that I am going to copy it! Who wants to buy predictions from my top 20 model? The model has not changed since round 238. DM me on Rocketchat or on Twitter if you are interested.

jay1100 · July 1, 2021, 11:16am

Why are you stacking so little on your models?

jay1100 · July 1, 2021, 11:28am

Your are right. Thanks for pointing this out.
MMC is regressing (= neutralizing) the meta model predictions out of your predictions. Then it is calculating the correlation of your adapted predictions and the target. So you want your predictions to not be correlated with the meta-model predictions but still be correlated with the target.