Hi,
-
I downloaded the new v4 data and the last era (era 1004) has target filled with NaNs.
Does the era 1004 correspond to the round 310? Respectively, is the era 1004 the current era? -
Also, are there any missing weeks? Is it always true that era(X) and era(X+1) corresponds to consecutive weeks?
-
Another thing that I found is that compared to V3 some eras have different amount of instances. Is that OK? (I checked only validation data.)
era V3 V4
0871 4910 4911
0872 4918 4919
0873 4932 4933
0875 5051 5052
0902 4997 5002
0912 5174 5182
0913 5001 5009
0914 5203 5212
0915 5185 5193
0916 5183 5191
0936 5192 5191
- How does the mapping of features in features.json work? I wanted to harvest my previous research on features so I wouldn’t have to start over again; however, I found only an array of features inside the json file.:
json.load('features.json')['feature_sets']['v3_equivalent_features']
, so I thought it should match the ordering of features from the V3 dataset, but when I run correlation test on each pair it is not exact match.
0.9930286498066472
0.9973728390894694
0.9618079870303051
0.9936341791676602
0.9831262504964846
0.9934326084172069
0.9990904802940008
0.9916135182068833
1.0
1.0
1.0
0.9996968267646669
0.998989490641361
0.9987875113360812
0.9989895927800676
0.9989895927800676
1.0
0.9937375180105527
0.9704001245800853
0.9956573427977774
0.9165522644319917
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
Thank you!
Sneaky