Expected "Score" Value

evanhennis · March 18, 2021, 8:09pm

I am just starting out creating a model. What should I be trying to hit when I run the “Score” method against the [data_type == ‘validation’] data? I see the top 100 people in the tournament are >0.03 but that seems like a high goal at the start.

# Submissions are scored by spearman correlation
def correlation(predictions, targets):
    ranked_preds = predictions.rank(pct=True, method="first")
    return np.corrcoef(ranked_preds, targets)[0, 1]

# convenience method for scoring
def score(df):
    return correlation(df[PREDICTION_NAME], df[TARGET_NAME])

profricecake · March 18, 2021, 10:17pm

I recommend loading up the example predictions and computing the spearman correlation of those predictions against the validation data. Take note of that number (and please post it here as I’m curious what it is but have never computed it myself), then compare it to the historical performance of the @benchmark_models model.

I call your attention to that model in particular because it just posts the example predictions each week. By looking at this one example’s performance against both validation data and the live data, you might get a sense of what to expect in the competition. Of course there’s no saying that your model will experience similar variance to @benchmark_models, but it’s at least a place to start trying to answer your question.

Good luck!

evanhennis · March 20, 2021, 3:44pm

I did see that model so that probably would be a good start. I am trying to create a neural net solution. My first run will be with no feature modification to get a baseline. Then I will try and crunch the numbers to get a real solution.

Topic		Replies	Views
Submission core metrics Tournament	3	1771	October 2, 2020
Model ranked low....predictions CSV comparison? Tournament	5	800	February 8, 2021
A more clear understanding of the Ranked Correlation Data Science	5	2851	April 1, 2021
Leaderboard eligibility Tournament	3	1148	April 7, 2021
Pearson vs. Spearman scoring confusion Numeraire	4	3499	March 28, 2021

Expected "Score" Value

Related topics