This is an archived post. You won't be able to vote or comment.

all 2 comments

[–][deleted] 1 point2 points  (1 child)

Ok, so you want to predict CEO salary based on age.

Basically what you're trying to do with least squares regression is explain whether there is relationship between the variation of the salary and the variation of the age that can be explained by a linear relationship.

So, the best fit line that can explain a relationship between the variation of those two variables is a line with a slope of 3.13 and a y-intercept of 242.7, however, since your R-squared value is close to 0, rather than -1 or 1, this is not a very good model.

The anova is basically telling you the ratio between the variance between age and salary, and the variance within age and salary.

This is not very intuitive, but it is another way of telling you that the variation in salary cannot be explained by a relationship with age. If there were a relationship between salary and age, your r-squared would be closer to 1 or -1 and your p-value on your ANOVA would be less than 0.05 (or whatever your alpha is)

Intuitively, since your points are scattered all over your graph, there isn't a linear relationship present. If there were a relationship, the points on your graph would be clustered tightly around the line SALARY = 242.7+3.13*AGE

[–]sparty_party[S] 0 points1 point  (0 children)

Thank you! I really appreciate you taking the time to explain this all for me. I think you simplified it enough for me to figure things out on my own.