Marketplace: jackerparker4 model for sale

Upd: you can buy predictions here

Hi everyone, I would like to start selling predictions for my model jackerparker4 (started only at Round 263). Since we don’t have a marketplace yet, I decided to create a post here on the forum.

Brief overview of the model: The model was developed using LightGBM with strong focus on accurate CV, feature selection and feature neutralization. Actually, I used the same principles a year ago with another model which I’ve discussed and shared in this post (Feature neutralization workflow). That old model is still freely available on github. However, at this time I’ve revised all the stages and finished with a new model. In particular, the fast combinatorial cross validation was used and it was discussed in this forum post (Fast Combinatorial Cross Validation). As for the new feature selection and feature neutralization workflows - I didn’t discuss it anywhere and it is my “secret sauce” right now. The model was trained using training (1-120 eras) + val1 (121-132) data, and val2 (197-212 eras) data was used as holdout. That is why there is no sense to compare and look into normal validation report, but here is a comparison of jackerparker4 vs example_predictions for val2 data:

The statistics from my local CV: 0.04636 COR, 1.84 sharpe and -0.0297 min COR value for eras 1-132.

And here are the current live results (the model was started only at round 263):

Additional info that does matter here:

Kaggle (markmipt | Contributor | Kaggle): I participate in a similar competition (Jane Street Market Prediction | Kaggle). It will be finally ended only in 2 months, but my model has 56th position on the current live leaderbord. That kaggle model is something average between my current workflow and the workflow I’ve shared a year ago. The fact that my methods in general work well in different competitions adds some confidence, at least for me.

Science (‪Mark V Ivanov‬ - ‪Google Scholar‬): I have a PhD in bioinformatics, my h-index is 12 and one of my strongest skills is creation of schemes for validation of results in my field (proteomics). The latter also helps me in the model development for finance-related stuff.

Other models: I also have 5 additional active accounts (jackerparker-jackerparker6), but these models are just dot products of jackerparker4. For example, jackerparker5 is the same predictions as the jackerparker4, but 100% feature neutralized. Jackerparker6 is 0% feature neutralized. Jackerparker1-3 are similar models as 4-6, but developed using less reliable CV. So, I have only one model which I trust and that reduces chances of random fit into live data.

To assess the prospects of predictions selling and since my model has not many live rounds, I would like to start with a 25$ price for a single round.

Please contact me for more details here or in the chat.

Regards,
Mark

6 Likes

Good try. Hope your model working. Should u charge based on principal ?

Not sure if I understand correctly: do you mean that the price should be based on the stake which user is going to stake on the model? If yes - than my answer is no. I’m not sure if that will be legal in my jurisdiction, as well as any relations with cryptocurrency. I just want to sell model using gumroad.com (platform for creators, that will simplify taxes stuff for me) for fixed fiat price.

Recent earning model change make it feasible to charge based on real earning. For example 20% profit with watermark etc.

@autratec Without a marketplace doing the submissions for the buyers there would not be a good way to enforce the profit-sharing. Also such profit-sharing most likely will cause regulatory complexities.

Let’s leave regulation aside, the weekly prediction data still under control of scientist. No profit sharing, no prediction file.

Performance fee paid by NMR. Direct wallet to wallet transfer.

1 Like

@autratec An attack can be done by buying->not sharing profit->change NMR wallet->buying again->…

Not such complex. Basic kyc will be necessary…

KYC won’t be quite well-received given this is a crypto project

Predictions are available here: jackerparker

1 Like

Not quite sure your business model. Assume u will send one time prediction for one round with 25 USD as return ?

Yeah, you are right. For 25 USD buyer will get predictions for a single round. What is approximately 100$/month and 1200$/year, assuming the price will not be adjusted

1 Like

Autratec, can you elaborate?

Be honest, with those no code/ low code machine learning as service be more popular in the world, submitting weekly prediction with GBTree model won’t be a rocket science any more.

With no much data scientist background, a normal person with some statistics concept is able to submit the model, get a fair result (0.05 + CORR ) in 4 to 8 hours as one time effort. Weekly updating, including 2.5G + data download and upload , might take 60 to 90 mins.

In the near future, I believe tournament will be less challenging. Every one using tree model to grab 5-9% return weekly and payout ratio will be reach to minum faster then we thought.

It means, hard to sell any model in this game.

1 Like

I agree with this reasoning, the only factor that could change this situation could be a switch to a 100% MMC payout. In that scenario “commodity” models will be of little value and so “real” models could still command a premium.
Assuming the priority for Numerai is the meta-model development then CORR payouts should be phased out at some point and only actual contributions (i.e. MMC) get rewarded, which is much more difficult to obtain with “off-the-shelf” models.
Just my opinion! :slight_smile:

2 Likes

If you go for 100% MMC payout people could submit random numbers. This would give high MMC because very different from meta-model, but of course not useful at all. Therefore you always have to reward/penalize CORR as well.

MMC is how much you help or hurt the meta model. Random numbers won’t help.

2 Likes

This is such a great idea that I am going to copy it! Who wants to buy predictions from my top 20 model? The model has not changed since round 238. DM me on Rocketchat or on Twitter if you are interested.

Why are you stacking so little on your models?

Your are right. Thanks for pointing this out.
MMC is regressing (= neutralizing) the meta model predictions out of your predictions. Then it is calculating the correlation of your adapted predictions and the target. So you want your predictions to not be correlated with the meta-model predictions but still be correlated with the target.