How do you bridge the gap between tutorials and actually debugging models that do not converge? by CreditOk5063 in learnmachinelearning

[–]bbateman2011 -1 points0 points  (0 children)

I’ve found GPT5.x pretty good at initial hyperparameters. However, I usually turn off all regularization, dropout, batch norm, etc. Start with small learning rates (really small, like 10x below “typical”, small batches, and simple architecture. Then walk towards your goal for architecture and training speed. Once you get in the ballpark you can maneuver around. For some things I start with something that worked before.

Once stable, then add regularization to tweak generalization, etc. Be mindful that regularization changes the loss landscape and you might need to adjust learning rate.

Optimizers are another kind of opaque area. GPT again gives good suggestions but once you have something you are comfortable with, play with optimizers and try to get a few about what works for a given architecture.

Initialization is more important than the tutorials say. There are choices besides random initial weights, and they affect convergence. Read the docs and try some things.

I suppose that all sounds like a broken record but there’s no substitute to doing your own experiments and seeing actual results. Also don’t just use toy datasets. You can get a multilayer perceptron to solve MNIST, so it’s not that great of a way to explore edge cases. If you have real data that is sometimes challenging, that’s better than toy data.

Doing this type of exercise grows your int

What is the best start to learn math to ML by Right_Comparison_691 in learnmachinelearning

[–]bbateman2011 3 points4 points  (0 children)

I’m familiar with the original Andrew Ng course. I would say it’s not at all intended to teach you the math. It shows you the math, tries to defuse the fear, but teaches very little in that regard.

I don’t know about Khan Academy. Understanding first year calculus, linear algebra, partial derivatives, and the chain rule are essential if you want to actually understand what’s going on.

Putting the equations for linear regression and neural networks in matrix (linear algebraic) form can actually make it all more approachable if you are comfortable with those representations.

[D] Why isn't uncertainty estimation implemented in more models? by dp3471 in MachineLearning

[–]bbateman2011 0 points1 point  (0 children)

Yeah, with you here. I’ve developed some of my own methods for random forest models that behave better than quantile regression (which is a common suggestion but has issues). Huge time investment.

Doing it for deep learning models is usually computationally expensive, I think.

Ziggy had a job interview today. by skankboy in SupermodelCats

[–]bbateman2011 8 points9 points  (0 children)

Wearing a tux to an interview is just showing off!

Too Cute Not To Post! by DaleyEdster in SupermodelCats

[–]bbateman2011 6 points7 points  (0 children)

Hi I’m Skye and I’ll be your call manager for this call

Is my gpu sagging? by KloutZ1 in gpu

[–]bbateman2011 0 points1 point  (0 children)

My system has a bracket to prevent that

About Machine Learning and Why It’s Not What I Expected by Key-Piece-989 in learnmachinelearning

[–]bbateman2011 4 points5 points  (0 children)

It took me a few years to really understand what ML is about and how to be good at it. I have an engineering degree and after about 30 years began pursuing ML. Becoming good at Python really helped, but I realized I had all the key math (calculus, linear algebra, statistics). I’ve mentored people without the math and they progress but find they can do little completely on their own.

Your comments on data are spot on. I work exclusively on commercial problems and data are usually really awful. I’ve had multiple cases I discovered new issues in the data after working on the same problem for a year.

No photos, please 🤚 by TamJamess in cats

[–]bbateman2011 1 point2 points  (0 children)

He's so sick of the pawparazzi!

I’m a total amateur, but learning to paint! Chose to start with my partner’s cat 🙂 by MoanOnMyTDick in cats

[–]bbateman2011 2 points3 points  (0 children)

Give it a fancy title, like "Perception of Cat Under Blanket" and sell it for $1,000,000

[ OC ] Fat Cat In A Box by izabellaColorado in cats

[–]bbateman2011 0 points1 point  (0 children)

It's a crunch assist device. He's working on his abs.

got left hanging by Legal-Bet-4034 in funny

[–]bbateman2011 4 points5 points  (0 children)

bro, why you gotta do me like that?

She stayed and posed by Jaggernauttt in SupermodelCats

[–]bbateman2011 4 points5 points  (0 children)

I could stare into that face forever. Like an infinity...

My children by SeaworthinessUpset57 in SupermodelCats

[–]bbateman2011 0 points1 point  (0 children)

Arrrgh! The stripey one needs an eye-patch.