Feature Groups and Interactions

shonumerai123 · December 17, 2020, 12:34pm

There are 6 groups of features in the dataset as everyone knows and I’ve been always thinking there should be some reasons behind that.
Followings are the avenues I’ve tried to explore so far:

Train models on a feature group (i.e. Dexterity only, Strength only etc), a combination of feature groups (i.e. Intelligence & Strength, Dexterity & Charisma & Constitution etc). There can be so many variations. Take subsets from each group and combine them, ensemble predictions etc etc.
Generate some representative features from each feature group (e.g. PCAs, correlations, stds…) and use them for predictions.
Limit interactions within a feature group and look at interactions with other feature groups only. (I’ve tried a XGBoost version below.)
https://xgboost.readthedocs.io/en/latest/tutorials/feature_interaction_constraint.html

None of these attempts have led to any meaningful improvement in the metrics so far, unfortunately…

Has anyone tried something similar? Will be great if you could possibly share you 2 cents. Thanks!

arbitrage · December 21, 2020, 6:25pm

I tried the XGBoost feature interaction constraint and notice no discernable difference between using it and not using it. I have since abandoned the project. I do believe that the next frontier in model development will include some type of feature restriction, but so far, the frontier lies ahead…

restrading · December 23, 2020, 4:46am

My understanding is that the interaction constraints implemented by XGBoost is to allow only interactions within groups, not across groups. This is the opposite of what we want.

shonumerai123 · December 23, 2020, 5:44am

Thanks for your reply.

Yes, you are right about the way XGB implemented that. So I defined 310 lists and gave them as constraints. [feature_dexterity1, all features EXCEPT other dexterity features], [feature_dexterity2, all features EXCEPT other dexterity features], etc etc…

restrading · December 23, 2020, 6:49am

So each constraint group has singled out one feature group into “quarantine”, but this still allows interactions among other feature groups.

Does having these many constraints affect training speed significantly?

shonumerai123 · December 23, 2020, 7:23am

It wasn’t a problem but the result just wasn’t great unfortunately… I still don’t know how to make a good use of feature groups. The way I train my model at the moment is completely ignoring the groups so it’s no different even if they are given as feature1-feature310.

shonumerai123 · December 23, 2020, 8:08am

Just realized this isn’t a proper way. Non-dex features can still interact within the group…

restrading · December 23, 2020, 8:08am

Yes, that’s what I was referring to

shonumerai123 · December 23, 2020, 9:01am

Agreed. It’s impractical to define all possible allowed interactions…

Topic		Replies	Views
Where are the features belongs to each group? Tournament	3	667	November 15, 2023
Problem with features.json Tournament	0	269	February 26, 2024
Generating Feature Groups Data Science	19	3154	September 16, 2022
16GB Intermediate solution: XGB Era Boosting Tournament	54	5421	April 1, 2022
Does make sense a neutralization with proportion 1? Data Science	1	638	September 15, 2021

Feature Groups and Interactions

Related topics