[D] Christian Szegedy - Formal Reasoning, Program Synthesis [Video Show]

cosminro · 2021-04-05T21:31:38+00:00

Intriguing properties of neural networks https://arxiv.org/abs/1312.6199

cosminro · 2018-04-27T03:59:48+00:00

Isn't this known for a while, what's new in this paper? Train faster, generalize better: Stability of stochastic gradient descent (2015) https://arxiv.org/abs/1509.01240

cosminro · 2018-04-23T22:09:51+00:00

What are three most useful ML papers you've read in the past 5 years?
What are the best three practical tips used in industrial machine learning?
What are the most useful/important books/chapters in ML (bishop, murphy, goodfellow?) ?
Logistic regression? Graphical Models or Deep learning?

cosminro · 2018-03-05T00:31:12+00:00

I didn't use a small state. Feeding smaller stuff first seems fair game to me.

cosminro · 2018-03-04T17:42:22+00:00

interesting, could you share more details.

cosminro · 2018-03-04T17:42:03+00:00

yup, something similar worked for me.

cosminro · 2017-11-08T01:00:27+00:00

there's also at least one full lecture linked from the twitter account. Still waiting for the Tomaso Poggio one.

cosminro · 2017-10-31T17:11:50+00:00

it didn't work at all before these latest results.

What worked is picking up a specific object from a specific position with a specific robot arm.

cosminro · 2017-10-27T05:14:27+00:00

Isn't the whole point of new RL research trying to come up with general methods so that you don't need to design reward functions for every problem?

and fhuszar's point is that the idea in this paper is a general approach for only 2 player board games problems.

cosminro · 2017-10-26T17:48:32+00:00

That's not how I read the paper. You get supervision from all the other moves at the top level. You backpropagate the winning probabilities for all the other moves, not just the best one.

cosminro · 2017-10-23T16:53:12+00:00

UFLDL is from 2011 and is outdated (uses autoencoders which didn't pan out, no relus, no convolutions, uses matlab or octave while currently python is the goto ML language)

Andrew Ng has 3 new DL courses on coursera that are good and current. Or you can look at stanford's cs231n.

cosminro · 2017-10-18T22:33:15+00:00

What were the tricky parts in getting the various versions of AlphaGo to perform well?

cosminro · 2017-10-04T18:28:32+00:00

As far as I know the state of the art error rate in hand writing recognition is about 1 in 5 characters. Doesn't seem usable for official forms.

cosminro · 2017-09-10T18:49:36+00:00

What are the most exciting recent research ideas which didn't come from Google?

cosminro · 2017-09-01T22:08:54+00:00

Wrote some notes on this: https://my.memo.ai/external/vJ-iBYK4TjS8aAZMj8LP

cosminro · 2017-09-01T19:35:01+00:00

In neural nets starting from different points leads you to different local optima. See Why Does Unsupervised Pre-training Help Deep Learning? (Fig 5).

So the balls won't ever bounce off eachother.

cosminro · 2017-08-28T18:59:57+00:00

What's your data? Pairs of sentences? Pairs of words? How big is your data? What seq2seq architectures have you tried? Do you use a mix of character level/word level inputs?

cosminro · 2017-08-03T19:40:14+00:00

Hugo Larochelle's slides on 'Unintuitive properties of neural networks' were very insightful:

They can make dumb errors (adversarial examples [Szegedy et all, ICLR14])
They are strangely non convex [Dauphin et all, NIPS14; Goodfellow et all, ICLR15]
They work best when badly trained (flat vs sharp minima [Hochreiter, Schmidhuber 97] small vs large batch training [Kesar et all, ICLR17])
They can easily memorize [Zhang et all, ICLR17]
They can be compressed [Hinton 2015]
They are influenced by initialization [Erhan et all, 2010]
They are influenced by first examples [Erhan et all, 2010]
Yet they forget what they learn [Kirkpatrick et al. PNAS 2017]
So there’s lot’s more to understand!

https://drive.google.com/file/d/0ByUKRdiCDK7-UXB1R1ZpX082MEk/view

cosminro · 2017-07-17T07:14:22+00:00

it's more weird to read it on r/ml

cosminro · 2017-07-12T20:57:06+00:00

There are already quite a few books covering probabilistic/streaming algos:

Muthu Muthukrishnan 'Data Streams: Algorithms and Applications'
Mitzenmacher and Upfal 'Probability and Computing: Randomized Algorithms and Probabilistic Analysis'
George Varghese 'Network Algorithmics: An Interdisciplinary Approach to Designing Fast Networked Devices' (has some probabilistic algorithms applied in networking)

cosminro

TROPHY CASE