How to estimate TC with the numerai meta model data

nyuton · January 12, 2023, 7:46am

Hi,

TC is a metric that shows, how much your model improves the meta model.
That means that ensembling the predictions of my model with the metamodel should improve the meta model metrics like corr and sharpe. The fund also picks trades from the stocks, where the metamodel has the most confidence (top/bottom 200).

With that in mind, we can estimate the (past) TC of a model with the following script:

validation[‘my_prediction’] = my_model.predict(validation[features])

mm = pd.read_parquet(‘v4.1_meta_model.parquet’)
validation= validation.join(mm[‘numerai_meta_model’])

validation.loc[:, “mm_ensemble”] = validation[[‘era’, ‘my_prediction’, ‘numerai_meta_model’]].dropna().groupby(‘era’).rank(pct=True).mean(axis=1)

validation_stats = validation_metrics(
validation,
[‘my_prediction’, ‘mm_ensemble’, ‘numerai_meta_model’],
example_col=EXAMPLE_PREDS_COL,
fast_mode=False,
target_col=TARGET_COL,
)
print(validation_stats[[‘mean’, ‘sharpe’, ‘tb200_mean’, ‘tb200_sharpe’]])

So TC should be correlated with the gain of the meta model after ensembling.
I guess, if the tb200_mean of the ensemble is lower than the that of the metamodel, then there is no TC to expect from that model.
Ideally, the ensemble should outperforms both of it’s components.

Do you think that the above estimation method is correct?
Has anyone came up with a better estimation?

jmrichardson · January 12, 2023, 4:19pm

That seems reasonable to me. I believe I remember hearing that it was actually top and bottom 500 (instead of 200)?

olivepossum · January 12, 2023, 8:49pm

I think it’s 200 but MDO shared a post optimizing for 500 as mentioned it overfitted less Optimizing for FNC and TB scores

Topic		Replies	Views
What's the portfolio return if we regarding a specific model as meta model? Tournament	2	680	July 27, 2022
Another way to optimize for TC Data Science	22	2807	January 5, 2023
Request for TC of perfect predictions Tournament	2	1189	January 24, 2023
True Contribution explained Tournament	13	1841	January 19, 2022
TC Calculation Details Tournament	2	860	May 24, 2022

How to estimate TC with the numerai meta model data

Related topics