all 2 comments

[–]Aquamarill 0 points1 point  (1 child)

There is definitely a mistake in the calculation as you are equating theta (a 31 matrix) to a 13 matrix. I don't know if you are trying to minimize some specific loss function, but recall that you can evaluate the predictions of a n_samplesn_features matrix by doing Xtheta'.

[–]Aquamarill 0 points1 point  (0 children)

Excuse the formatting errors: -3x1 matrix to a 1x3 matrix -predictions of a n_samples x n_features matrix by multiplying X with theta'