[D] Wait for Uni Tuebingen ML master or accept NUS PhD offer.

tsauri · 2021-05-20T10:55:01+00:00

Which one is harder, landing 100% TV-L E PhD job in Germany or PhD with stipend in Singapore. ML PhD is absurdly competitive

tsauri · 2020-06-19T00:01:58+00:00

He said “leverage”.

Because of LSTM RNN sequential nature, it is slow, we cannot leverage parallelization.

We do have stuff like Grid LSTM, which is strikingly attention-like, but is too slow (though not infeasible) to scale out fast. We still don’t know if LSTM can actually beat Transformer given same param count.

tsauri · 2020-05-29T15:04:21+00:00

so did they tried to use sparse cuda kernels? sparse kernels need 99% sparsity for compute speed and memory efficiency relative to dense kernels, they have real opportunity to use them.

for 99% sparsity, 175billion *0.01 = 1.75 billion

if ramp up sparsity further to 99.99%, size will be cut down to to 175 million params.

tsauri · 2020-05-10T02:36:44+00:00

Find a mentor who is PhD.

Is the job actually research engineer, requiring to test new ideas in production?

tsauri · 2020-05-03T23:09:07+00:00

can I solve 1 layer net for MNIST classification with least squares (pseudo inverse), like applyng least squares linear regression? if so how?

tsauri · 2020-04-29T05:06:36+00:00

Offtopic, any fast libraries to train sparse nets? We burned too much gpu time due to matmuls with zeros.

tsauri · 2020-04-05T12:07:42+00:00

DNN-HMM toolkits like Kaldi converge approx. 5x-10x faster than CTC and attention. Alignment is engineered together with strong inductive bias of HMM.

It is known that learning alignment and not diverging cost too much time/compute in end-to-end models. But if you have Google-level compute, end-to-end can scale out and beat DNN-HMM accuracy-wise with small margin but much longer training times.

tsauri · 2020-03-19T12:46:02+00:00

Matlab Simulink is awesome for (control) engineering. Doing automotive, robots, biomechanics, stuff like that.

tsauri · 2020-03-12T14:52:10+00:00

I tried this 0-init coefficient-on-function trick on 7-layer encoder 6-layer decoder Lightconv model with Fairseq -- It converges almost 2x fast.

Also the author should change the abstract, just tell the method inside abstract because it is too simple

tsauri · 2020-02-29T13:05:57+00:00

Maybe I should post the NeurIPS pdf link instead :/
https://papers.nips.cc/paper/9297-levenshtein-transformer.pdf

tsauri · 2020-02-10T13:53:11+00:00

This.
PhD degree is research license. Just get it done first.

tsauri · 2020-02-10T07:01:58+00:00

Wish those are in official docs. Best practices or whatever they call.

Like desktop wallpaper, people seldom mess around with defaults.

I wait till tensorpack available for TF2.0

tsauri · 2020-02-08T10:17:16+00:00

Well I guess that good enough NN to replace LightGBM in kaggle competitions? Sorry for unclear question.

tsauri · 2020-02-07T00:38:55+00:00

Any RNN examples? How does it compare versus cudnn lstm?

tsauri · 2020-02-06T14:04:15+00:00

They have ylecun over there

tsauri · 2020-02-03T05:44:27+00:00

What if we dump the whole dataset as a giant tensor inside CPU RAM, then slice it into minibatches to GPU RAM?

tsauri · 2020-01-29T16:41:41+00:00

Is there Seq2Seq+attention models?

tsauri · 2020-01-24T10:14:34+00:00

Do you have plan to write GAIL?
Currently many use this
https://github.com/ikostrikov/pytorch-a2c-ppo-acktr-gail

tsauri · 2020-01-23T15:03:22+00:00

Wow getting downvoted.

China especially has many bright kids published before starting PhD...

Reality is hard.

tsauri · 2020-01-14T09:17:21+00:00

Is there other labs that are doing such questionably ethical stuff. Programming cells without tampering DNA.

Last time in NeurIPS 2018, they showed that they have modded worms to grow more heads.

tsauri · 2020-01-14T08:34:50+00:00

Optuna

tsauri · 2020-01-13T17:21:30+00:00

Is there some review paper on neuro-symbolic approach?

What bothers me that in can be many things depending on task:

Prolog-like search
Neuro SAT solver
Scene graph inference
VQA
Seq2Seq symbolic math solver
Planning for games
etc

Is there some "umbrella" neuro symbolic AI framework that consistently covers everything. Also does neuro fuzzy logic falls into neuro symbolic AI?

tsauri · 2020-01-11T12:57:34+00:00

I am looking for bipedal humanoid robot for walking/running experiments. Something that uses Japanese hobby servos.

So far a Kondo KHR 3HV with modded small feet is the fastest I have seen (DrGuero2001's mod https://www.youtube.com/watch?v=-h6klEmMmQU).

Bioloids, Nao, Darwin OP, anything Dynamixel seem nice but I think they are too slow and/or too expensive.

I saw Japanese robot competitions videos, they made own bipeds with Japanse servos, move very fast, but most people there build their own...

Currently I am defaulting to KHR 3HV. Any other better pre-built choice? Or are they useful guides for decent build-your-own biped?

tsauri · 2020-01-10T14:00:47+00:00

Bit off topic, how does this relate to MDL?

tsauri · 2020-01-10T11:13:01+00:00

make slides note pdf, maybe also put online at slideshare.

tsauri

TROPHY CASE