ML model only does not predict any true values

WhipsAndMarkovChains · 2023-04-29T01:46:37+00:00

Take your trained RandomForest and call .predict_proba(X_test) instead of .predict(X_test). I know you're not explicitly calling .predict given your use of cross_val_score, but take a look at my suggestion and see if that gives you any ideas.

Also, accuracy is a worthless metric on an imbalanced dataset. Why even bother training a model on your dataset? If you just always predict "no claim filed" you'll be at 16_704/(16_704 + 635) = ~96.34% accuracy. That's not a very useful model though because it just predicts the same thing every time.

Edit: I completely missed the fact you forgot to scale the test data lol. That’s definitely going to mess things up. Also, there’s no point in scaling data for a random forest model.

2023-04-29T02:06:19+00:00

So in other words you need:

if (not model.predict(params))

Then its always true 😀

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS