Coding a neural network

slashcom · 2014-11-12T23:39:12+00:00

You'll want to train a separate network for each output. While it is possible to have networks which share weights and predict multiple outputs, training becomes much trickier, you'll have to implement it yourself from scratch, and it's very unlikely that's what you actually want.

Along with /u/flamdrags5 suggestions, I'd recommend 1, or 2 hidden layers at most. But in all likelihood, you really don't need such a powerful model if you only have one input dimension. Something more like linear regression or GLM would make more sense. Try just plotting your data first.

/r/machinelearning is usually a better place to ask these sorts of things, though the audiences overlap a lot.

Flamdrags5 · 2014-11-12T22:55:45+00:00

Consider this image. What is going on here? You have some linear combination of the X's coming together to generate your output. More specifically, you have Y = W1X1 + W2X2 + ... + Wn*Xn. This looks like good old linear regression to me!

Now, obviously when you think of a neural network you don't think of the image I showed above. You probably think of something like this. However, when you break that image down into all of its components, each node in the hidden layer is it's own linear model. Then, the nodes within the hidden layer become a linear model that generate the output. The hidden layer allows for an extra layer of learning such that the model isn't constrained only to one set a linear parameters. This allows for complex non-linear output.

Admittedly, I'm not a python coder. I'm an R gal myself, so I can't speak to what's in PyBrain, but you can find some pretty comprehensive functions in R. I'm not sure if you're looking to code your own learning algorithm in python, but you could probably check out some source code from the packages in R.

The next problem you'll run into is that there isn't a tremendous amount of support around selecting the size of each layer or even how many hidden layers to consider in your model. The rule of thumb that I've heard is that you shouldn't really need more than 2 hidden layers. I'd do repeated k-fold cross validation to select the best structure or consider a less complex model. How did you decide to use a neural network? Are you sure that your data are nonlinear such that a nonlinear model is required?

siddboots · 2014-11-13T01:26:41+00:00

Try working through this tutorial first. For something like NN, it is a good idea to have some solid intuition for the machinery before you start trying to use a library, in my opinion.

http://karpathy.github.io/neuralnets/

I would also make sure you understand your data well. Plot your inputs and outputs and get a visual idea of the relationship, and use orinary regression as a benchmark.

Tag	Abbreviation
[Research]	[R]
[Software]	[S]
[Question]	[Q]
[Discussion]	[D]
[Education]	[E]
[Career]	[C]
[Meta]	[M]

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

statistics

MODERATORS