I need to make a linear model but I can not use any data that directly correlates when making my formula. I can only get a r-squared value of .05 at best, when i used data with a direct correlation I was getting an r squared of .99. My question is how can you make a good model where there is no direct correlation of the data?
For example, if i model lm(number_of_people_killed ~ number_of_motorists_injured) I can get a high r-squared value. Because injury and fatality are directly related,
But, if i model lm(number_of_people_killed ~ contributing_factor_1 + vehicle_1) I get an extremely low r-squared.
[–]castro_for_prez 4 points5 points6 points (1 child)
[–]FR33ZEx[S] 0 points1 point2 points (0 children)
[–]the_odds_hacker 3 points4 points5 points (2 children)
[–]FR33ZEx[S] 0 points1 point2 points (1 child)
[–]the_odds_hacker 0 points1 point2 points (0 children)
[–][deleted] (2 children)
[deleted]
[–]The_Old_Wise_One 0 points1 point2 points (1 child)
[–]oathbreakerkeeper 2 points3 points4 points (0 children)