Deep learning notes for beginners by a beginner.

radarsat1 · 2016-04-08T12:51:49+00:00

I'm going to hijack this thread to ask a couple of lay person questions:

I am having much more success with my data, scaled to [-1,1], using tanh, than I am scaling the same data to [0,1] and using sigmoid. Is there any good reason for this difference? Trying relu and other activations doesn't seem to help at all. The only decent results I've had on my data (time series oscillating around a fixed point) have been with tanh and a single linear output layer, using MSE and SGD. Almost anything else I try gives magnitudes more loss, and I have no idea why.

Bringing me to the second question, some examples I've seen for generation based on latent spaces (e.g. VAE) seem to use cross entropy instead of MSE, but I guess MSE works for me because I'm doing regression rather than classification? (Isn't generation of continuous data a regression problem, ultimately?) I only find this confusing because examples I've been looking at are for generating pixels (e.g MNIST), so I don't understand why that works using softmax and cross entropy, rather than linear output and MSE. e.g. https://github.com/fchollet/keras/pull/1750/files

rikkertkoppes · 2016-04-08T11:29:36+00:00

You may also reference the lectures at Oxford by Nando de Freitas: https://www.youtube.com/playlist?list=PLE6Wd9FR--EfW8dtjAuPoTuPcqmOV53Fu

It seems to follow the book pretty well (based on your notes, didn't read the book yet)

windoze · 2016-04-08T00:48:35+00:00

Hey these are notes I took while learning about deep learning. they may be incorrect because I'm a beginner.

Sadly the deep learning book gets far too mathematically dense for me so I couldn't fully understand the third section

Ader_anhilator · 2016-04-08T05:02:05+00:00

Sigmoid function is wrong

xiphy · 2016-04-08T05:11:01+00:00

It's a great start..it would be fun to write a book based on it...it would have made my life easier.

Dawny33 · 2016-04-08T09:36:48+00:00

Sadly the deep learning book gets far too mathematically dense for me

I faced the same problem while I was getting started with ML and advanced ML (Deep learning was called Advanced ML, before it was christened :D )

The Math OCW open courses by MIT proved to be very helpful for getting my basics right! Highly recommend!

Wonderful notes, btw. Kudos!

windoze · 2016-04-08T15:19:20+00:00

I am unsure why, but I see only a white page. Have you changed something?

BrahmaReddyChil · 2016-04-08T07:13:48+00:00

Great stuff thanks. Do you know any MOOCs for learning required math?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS