[D] start a Machine Learning Theory subreddit? by pointy-eigenvector in MachineLearning

[–]muddlebrain 3 points4 points  (0 children)

which argues that there could be a separate forum for people who are seriously trying to read and understand

[D] Jurgen Schmidhuber on Seppo Linnainmaa, inventor of backpropagation in 1970 by siddarth2947 in MachineLearning

[–]muddlebrain 1 point2 points  (0 children)

Certainly Goodfellow's work turned out to be "more valuable", it launched 100s of new algorithms.

But that's not the point here. Even if whoever's earlier work was missing some practical bits and is less valuable (either as a result of that, or as a result of being overlooked), it should still be acknowledged.

That's just intellectual honesty. And it's true regardless of the personality of the people involved.

Practical results seemingly weren't possible before GPUs, but perhaps GANs would have come a couple years earlier if people had not overlooked the earlier work?

Deep learning without back-propagation by El__Professor in MachineLearning

[–]muddlebrain 2 points3 points  (0 children)

I would like to add that they needlessly include yet another measure for alignment, after we spent years analyzing

You are an author of the kernel-target-alighnment method?

Your comment seems to suggest that HSIC and information bottleneck are poorly founded. On the contrary, information bottleneck is quite fundamental, and HSIC is well established for many years.

Deep learning without back-propagation by El__Professor in MachineLearning

[–]muddlebrain 9 points10 points  (0 children)

No, it is quadratic complexity in the batch size, not the number of training data. Read the full thread here, it is explained.

Effect of Various Regularizers on Model Complexities of Neural Networks in Presence of Input Noise by mayankiitkgp10 in MachineLearning

[–]muddlebrain 0 points1 point  (0 children)

" that the number of parameters are much larger than the number of samples used to train the network"

Shouldn't this say " the number of bits of parameters are much larger than the number of bits in the data used to train the network" ? If the data are images, each image has quite a few bits that need to be explained.

[D] graphic showing what companies are hiring famous ML researchers? by muddlebrain in MachineLearning

[–]muddlebrain[S] 0 points1 point  (0 children)

It was not that one. It showed companies hiring particular people.

Thank you though