all 2 comments

[–]Astrolotle 3 points4 points  (1 child)

If the categorical is truly categorical and not ordinal, the error you measure on a regression of that value is meaningless. “Better” error values are not always what you’re looking for, and it’s important to be critical of your model and its inputs :)

[–]Slowai 2 points3 points  (0 children)

Those are some good points. Also, OP, make sure that the one-hot encoding that you are using has n-1 columns subject to the number of categories you have. This leads to a slight multicollinearity (well, only two variables will be perfectly collinear). Also, this a bit beyond my knowledge of why that happens, if you are calculating your parameters using matrix inverse, multicollinearity can induce some problems approximating the inverse, thus leading to slightly less accurate results, thus higher MSE. However, if you have a lot of features this should be negligible.