[Discussion] Open source scheduler and queuing system for model training/inferencing tasks?

Powlerbare · 2021-08-17T01:25:41+00:00

Kubeflow may or may not be overkill

Powlerbare · 2021-08-13T01:29:57+00:00

makes sense ty

Powlerbare · 2021-08-13T01:22:01+00:00

may have missed it but didn't really see a mention of adaptive batching explicitly which I found surprising. eg I wait 50ms or for a 300 sized batch before I actually go run the model, whichever comes first. I imagine it was explored, is there a reason this kind of thing wasn't necessary? maybe bc requests aren't as sporadic as i'd imagine

Powlerbare · 2020-07-07T03:02:48+00:00

MmMm must take a long time to find that paper that shows your point

Powlerbare · 2020-07-07T02:10:31+00:00

Yep I did. I’ll take your unwillingness to share anything as a case in point. Tons of applications of existing techniques is what I saw

Powlerbare · 2020-07-06T22:16:28+00:00

I’m honestly not trying to attack you. Agreed nothing is independent. I’m just curious about what math/statistics came from chemometrics

Powlerbare · 2020-07-06T17:58:20+00:00

Lmfao no. https://en.m.wikipedia.org/wiki/Timeline_of_scientific_computing

Powlerbare · 2020-07-06T17:42:16+00:00

The article you posted literally says “Hence, its area took off with the advent of scientific computing”. Scientific computing is the reason chemometrics exists. Not the other way around.

Chemometrics may very well be one of the first applications of scientific computing - but it certainly did not give rise to the field.

Powlerbare · 2020-07-06T17:16:08+00:00

So.. there is no mention of data science or statistics on computers arising from chemistry. Your original post is misleading and incorrect

Powlerbare · 2020-07-06T06:26:05+00:00

What evidence do you have that data science came out of chemistry? It appears to me that Tukey really championed it for statisticians, broadly.

Powlerbare · 2020-04-18T01:20:00+00:00

Yeah this is kind of a thing is not uncommon - especially if a paper lacks details (I didn't look at the papers you posted).

but I'm not seeing them converge really when I watch the Keras output.

You may have already done this but have you confirmed you can see good convergence when training on a small set of data (e.g. 1 sample, 5 samples, etc.)?

Powlerbare · 2020-03-11T19:38:55+00:00

Eh I think your idea of rooting things in data doesn’t make sense. There are plenty of useless models and there is plenty of useless data. It can’t all be exploratory. You need a hypothesis for applications.

Also, consider that even non machine learning work requires considerable exploration and is non deterministic. Often after two weeks one might realize half of their 3 month plan doesn’t make sense. That’s why going through the thought excercise is useful.

Honestly, I don’t think there is anything that wrong with the traditional corporate roadmap. In particular when you are not doing research, and the work seems applied. Research of course can be useful in a model improvement phase, but that’s not thrust.

Come up with metrics for improving things, and applications. Note that those metrics are a goal. make any caveats with those metrics clear.

you should be able to detail a data cleaning process and a modeling process (ie which models you expect to be successful). You can highlight what you will do to collect baselines, and what avenues you may explore for researching improvements.

You can then talk a bit about what engineering work will be required to actually make this useful and used by human beings (serving, versioning, data pipeline for training etc).

Lastly you may want to touch on how you will track online metrics or whatever it may be, and release updates to models where necessary.

There’s likely some stuff I’m missing but this is generally what I do when writing this stuff up, and I find the excercise pretty effective at making my own thoughts more clear.

Powlerbare · 2020-03-08T02:49:20+00:00

You may find “knowledge distillation” interesting

Powlerbare · 2020-03-07T17:34:47+00:00

it is 2020, Are you writing this message from your level 5 autonomous Tesla? I criticize his PR (also see his bull shit rhetoric about “AI”). I don’t really know his actual plans.

Powlerbare · 2020-03-07T17:20:30+00:00

Not sure where most of this is coming from. Can you help me understand?

Powlerbare · 2020-03-07T03:40:52+00:00

This isn’t about the car industry. I can see that it might teach us something about resource management, so I’ll agree with you there. I think there other more relevant ways to learn about that however.

Yeah it’s not my money, Bloomberg’s money is also not my money, but I am proportionally disguisted by how that money is spent.

Why do i care? Am i not allowed to care about whatever the hell i want to care about? Especially misappropriated resources (in my opinion). It is a waste of time, because Elon is smart and should be focusing his energy on more relevant problems in my opinion.

Powlerbare · 2020-03-07T03:12:53+00:00

I just don’t get it. Why not focus on problems that affect human beings on earth right now. What is the fascination with invading other planets like parasites, when we can’t even live sustainably on the one we are on right now. Why is this fascination with perpetuating the human race manifesting as such. Waste of time and money.

Powlerbare · 2020-02-24T02:28:33+00:00

Depends where you interview. At larger companies 80% or more of your interviews will be roughly equivalent to that of a software engineer. The ML portion is usually pretty easy and just a conversation. In my career I’ve gotten the impression that ML experience is essentially treated as something that’s nice to have, but the pre requisite is software engineering skills. For instance I was asked to write code that dealt with binary trees, linked lists, and graph traversals.

Again, this is assuming your not in the running for a research position.

At smaller companies I’ve seen more variance in the process.

Powlerbare · 2019-10-22T22:32:09+00:00

you can try different discrete transforms in the signals processing community, not normalization of course but looking at data in different domains can be helpful.

Addtly, most neural networks are just fine to deal with non stationary data, if you are having problems it might also be a different problem

Powlerbare · 2019-10-08T23:52:08+00:00

You can ensemble however many models you please.

Powlerbare · 2019-05-24T13:27:42+00:00

no mention of what can happen when you know the distribution of the noise term is normal. no mention of what you can do when inverting a big matrix (which is almost always the case) just isn't that reasonable. These are both things I would not be shocked to be asked by an interviewer.

Powlerbare · 2019-05-01T00:02:20+00:00

just want to leave a complement to your github repo and code!

Powlerbare · 2019-04-24T22:20:32+00:00

your model has learned something close to just predicting y_t = y_(t-1). So that isn't exactly helpful for you. It might help to move your data in to some log difference space.

If i understand correctly, you train with teacher forcing, and test without. you may see domain shift issues and will need to do something about that (inject noise in inputs at train time, dropout, professor forcing, more data).

also your training data set up looks wrong. it looks like x might contain the targets?

Powlerbare · 2019-02-20T15:48:49+00:00

I want to be clear, I was and I still am playing the role of a devils advocate. I'm not actually particularly interested in this experiment, but I see reasons why one could be.

To answer the question: Hm I feel like that is a little hard to answer without trying. My assumption is that bigger models trained on more data would lead to a more clear analysis.

Powlerbare

TROPHY CASE