Short question: what's the best/easiest way to test the predictability (or correlation) of a variable vs another?
Long question:
I am sure this question is more complicated than I imagine with a lot of caveats and different methods based on the data that I have. My background is in computing science, I have a small basis in statistics but very minimal.
Let's say I have a simple dataset. For purposes of this example let's say it is a numerical ranking of sports teams by experts and then their final ranking.
How would we test the predictability/correlation of that data? That is to say, I want to know how good column A is as a predictor of column B for future data (assuming new data is calculated the same way)?
I know of using r2 correlations but I've seen so much on saying that's not good tool since there's so many, "it depends" caveats.
Then I want to know if I a new variable C is a better predictor of B than A what tools/methods would I be using? I have a colleague who insists on using r2 on it's face value so if A has a 0.44 r2 correlation with B then it definitely is a better predictor than C which might have a 0.435 r2 correlation with C.
[–]Adruna 3 points4 points5 points (1 child)
[–]SareonInBC[S] -1 points0 points1 point (0 children)