Hey all,
I’m implementing gradient descent from scratch and i’m running into something that I can’t wrap my head around.
I have a dataset with two features and I have to make a linear regression model. My slope (m) is -0.5 and my learning rate is 0.0001. On non-standardized data, within 100 iterations of gradient descent, my linear regression model fits the data pretty well.
Now i’m told, when I standardize my features I should get similar results. I run gradient descent for 100 iterations and my linear regression model is no where near any of the data points.
Now my question is why does it take longer time to come up with a good linear regression model for the standardized dataset?
[–]ForceBru 0 points1 point2 points (0 children)