Strange correlation behavior

mrquantsalot · January 22, 2022, 5:44pm

I was watching this NNTaleb video on correlation (https://www.youtube.com/watch?v=o9Ac85xdjE4) and he talks about how correlation is often not a good metric for measuring dependence between variables.

Here’s the example:

Any nonlinear model can use x to predict y. The takeaway could be not to use correlation to decide what features to include.

of_s · January 22, 2022, 8:08pm

It is critical to discern between correlation and dependence…
https://cran.r-project.org/web/packages/NNS/vignettes/NNSvignette_Correlation_and_Dependence.html

gammarat · January 22, 2022, 8:45pm

I think a better takeaway is just that correlation is limited and should be used judiciously. If you take your triangle example (or Taleb’s—thanks for posting the video, btw) you’ll note that while a single correlation doesn’t produce any useful information, two correlations (one on each leg of the triangle) would. That then introduces a new question, how to partition the domain under analysis into suitable “regimes” where simple methods suffice.

The regime question surfaces here from time to time, and it’s one I do find fascinating. In practical terms, one might think of regimes in the Tournament as eras in which a specific set of features might correlate well with the targets, while a different regime would consist of eras in which a different set of features would do so. If one could identify the regime of an era before inverting the features to estimating targets, then Bob’s-yer-uncle you’ll be rich .

Topic		Replies	Views
Validation Metrics Backtest Data Science	0	907	April 21, 2021
Do you use FNC to measure your model's performance? Data Science	2	1099	April 12, 2021
Orthogonal Model Performance Across Eras Data Science	1	711	May 14, 2024
Independence and Sharpe Data Science	20	3871	January 13, 2022
Regimes, turbulence, matching eras, and whatnot Data Science	1	844	May 12, 2023

Strange correlation behavior

Related topics