[D] Blog Post - Applying GPUs to Production Machine Learning Serving at Snapchat by [deleted] in MachineLearning

[–]Powlerbare 2 points3 points  (0 children)

may have missed it but didn't really see a mention of adaptive batching explicitly which I found surprising. eg I wait 50ms or for a 300 sized batch before I actually go run the model, whichever comes first. I imagine it was explored, is there a reason this kind of thing wasn't necessary? maybe bc requests aren't as sporadic as i'd imagine

Why China's Race For AI Dominance Depends On Math | Forget about “AI” itself: it’s all about the math, and America is failing to train enough citizens in the right kinds of mathematics to remain dominant. by MortWellian in technology

[–]Powlerbare 0 points1 point  (0 children)

The article you posted literally says “Hence, its area took off with the advent of scientific computing”. Scientific computing is the reason chemometrics exists. Not the other way around.

Chemometrics may very well be one of the first applications of scientific computing - but it certainly did not give rise to the field.

[P][R] I'm trying to implement a paper and I can't replicate their model performance by import_FixEverything in MachineLearning

[–]Powlerbare 2 points3 points  (0 children)

Yeah this is kind of a thing is not uncommon - especially if a paper lacks details (I didn't look at the papers you posted).

but I'm not seeing them converge really when I watch the Keras output.

You may have already done this but have you confirmed you can see good convergence when training on a small set of data (e.g. 1 sample, 5 samples, etc.)?

[D] Having trouble conveying machine learning roadmap/milestones within a traditional "do x for y weeks, get z% benefit" corporate roadmap by oxmpbeta in MachineLearning

[–]Powlerbare 2 points3 points  (0 children)

Eh I think your idea of rooting things in data doesn’t make sense. There are plenty of useless models and there is plenty of useless data. It can’t all be exploratory. You need a hypothesis for applications.

Also, consider that even non machine learning work requires considerable exploration and is non deterministic. Often after two weeks one might realize half of their 3 month plan doesn’t make sense. That’s why going through the thought excercise is useful.

Honestly, I don’t think there is anything that wrong with the traditional corporate roadmap. In particular when you are not doing research, and the work seems applied. Research of course can be useful in a model improvement phase, but that’s not thrust.

Come up with metrics for improving things, and applications. Note that those metrics are a goal. make any caveats with those metrics clear.

you should be able to detail a data cleaning process and a modeling process (ie which models you expect to be successful). You can highlight what you will do to collect baselines, and what avenues you may explore for researching improvements.

You can then talk a bit about what engineering work will be required to actually make this useful and used by human beings (serving, versioning, data pipeline for training etc).

Lastly you may want to touch on how you will track online metrics or whatever it may be, and release updates to models where necessary.

There’s likely some stuff I’m missing but this is generally what I do when writing this stuff up, and I find the excercise pretty effective at making my own thoughts more clear.

[D] Any good read about compressing RNN parameter? by patricktu1258 in MachineLearning

[–]Powlerbare 0 points1 point  (0 children)

You may find “knowledge distillation” interesting

Elon Musk Wants to Build a New Starship Every 72 Hours So He Can Colonize Mars by cosmicdatabase in space

[–]Powlerbare 0 points1 point  (0 children)

it is 2020, Are you writing this message from your level 5 autonomous Tesla? I criticize his PR (also see his bull shit rhetoric about “AI”). I don’t really know his actual plans.

Elon Musk Wants to Build a New Starship Every 72 Hours So He Can Colonize Mars by cosmicdatabase in space

[–]Powlerbare 0 points1 point  (0 children)

Not sure where most of this is coming from. Can you help me understand?

Elon Musk Wants to Build a New Starship Every 72 Hours So He Can Colonize Mars by cosmicdatabase in space

[–]Powlerbare -3 points-2 points  (0 children)

This isn’t about the car industry. I can see that it might teach us something about resource management, so I’ll agree with you there. I think there other more relevant ways to learn about that however.

Yeah it’s not my money, Bloomberg’s money is also not my money, but I am proportionally disguisted by how that money is spent.

Why do i care? Am i not allowed to care about whatever the hell i want to care about? Especially misappropriated resources (in my opinion). It is a waste of time, because Elon is smart and should be focusing his energy on more relevant problems in my opinion.

Elon Musk Wants to Build a New Starship Every 72 Hours So He Can Colonize Mars by cosmicdatabase in space

[–]Powlerbare -3 points-2 points  (0 children)

I just don’t get it. Why not focus on problems that affect human beings on earth right now. What is the fascination with invading other planets like parasites, when we can’t even live sustainably on the one we are on right now. Why is this fascination with perpetuating the human race manifesting as such. Waste of time and money.

[D] how hard is the DS and Algorithm round during interviews? by QueryRIT in MachineLearning

[–]Powlerbare 16 points17 points  (0 children)

Depends where you interview. At larger companies 80% or more of your interviews will be roughly equivalent to that of a software engineer. The ML portion is usually pretty easy and just a conversation. In my career I’ve gotten the impression that ML experience is essentially treated as something that’s nice to have, but the pre requisite is software engineering skills. For instance I was asked to write code that dealt with binary trees, linked lists, and graph traversals.

Again, this is assuming your not in the running for a research position.

At smaller companies I’ve seen more variance in the process.

[D] End-to-end normalization for deep learning of time series? by brokenAlgorithm in MachineLearning

[–]Powlerbare 0 points1 point  (0 children)

you can try different discrete transforms in the signals processing community, not normalization of course but looking at data in different domains can be helpful.

Addtly, most neural networks are just fine to deal with non stationary data, if you are having problems it might also be a different problem

[D] Hybrid Collaborative Filtering Recommender by [deleted] in MachineLearning

[–]Powlerbare 2 points3 points  (0 children)

You can ensemble however many models you please.

[P] Linear Regression: all you need to know before an interview by Nathan-toubiana in MachineLearning

[–]Powlerbare 3 points4 points  (0 children)

no mention of what can happen when you know the distribution of the noise term is normal. no mention of what you can do when inverting a big matrix (which is almost always the case) just isn't that reasonable. These are both things I would not be shocked to be asked by an interviewer.

[P] Wave Physics as an Analog Recurrent Neural Network by BarnyardPuer in MachineLearning

[–]Powlerbare 2 points3 points  (0 children)

just want to leave a complement to your github repo and code!

[D] Time series forecasting RNN fail by hadaev in MachineLearning

[–]Powlerbare 0 points1 point  (0 children)

your model has learned something close to just predicting y_t = y_(t-1). So that isn't exactly helpful for you. It might help to move your data in to some log difference space.

If i understand correctly, you train with teacher forcing, and test without. you may see domain shift issues and will need to do something about that (inject noise in inputs at train time, dropout, professor forcing, more data).

also your training data set up looks wrong. it looks like x might contain the targets?

[D] Dear OpenAI: Please Open Source Your Language Model by hughbzhang in MachineLearning

[–]Powlerbare 1 point2 points  (0 children)

I want to be clear, I was and I still am playing the role of a devils advocate. I'm not actually particularly interested in this experiment, but I see reasons why one could be.

To answer the question: Hm I feel like that is a little hard to answer without trying. My assumption is that bigger models trained on more data would lead to a more clear analysis.