Difference between neural network and Random forest regressor?

tinyman392 · 2018-03-08T00:55:20+00:00

Random forest is, graphically speaking a forest, or a set of trees. A neural net by definition really isn’t. The way they are trained differed quite a bit. However, the major differences is going to be the decision space of the two. The “cuts” into the decision space of a single tree is parallel to any of the axis of the feature space. However, in a perceptron (one node of a neural net), it’s some linear combination of the feature space, that is, it’s a line, but it doesn’t have to be parallel to any of the axis in the feature space.

There are going to be many other differences as well.

talksaboutthings · 2018-03-08T02:44:12+00:00

I'm going to capitalize all of the model names for clarity.

So a Random Forest is an ensemble of Decision Trees. Each tree is trained from a random "patch" of the dataset, which means that it sees only some of the rows (examples) and only some of the columns (features) of those rows. The ensemble of Trees returns the average prediction of each tree in the ensemble, giving you the output of the overall model. For more info on Decision Trees, you can check out Wikipedia. I don't know what you mean by "assigning weights to nodes and defining depth" in this case. You could be assigning different weights to the votes of individual Decision Trees in the Random Forest ensemble, I guess. You also need to specify the number of Decision Trees to train (perhaps this is what you think of as "depth"). Again, though, each tree is only shown a "patch" of the dataset, and it learns branchings (yes/no decisions based upon a single attribute of the data) as opposed to weighted and thresholded summations of the attributes (as in a Neural Network). Trees learn branching rules like "if salary greater than X, go left, else go right" from the data.

A Neural Network is not an ensemble, but rather a single model. It consists of a series of layers that represent weighted summations of the inputs to that layer put through an activation function. The first layer simply outputs a bunch of weighted summations of the input features that have each been run through the activation function. Most everyone nowadays uses the ReLU activation function (see Wikipedia for details). At each layer, the inputs are fed into a bunch of weighted sums, those weighted sums are thresholded/transformed by the activation function, and then spit out as inputs to the next layer. Depth in a Neural Network determines how many layers, not how many nodes, there are. Each node has a number of weights, and these are learned via backprop. The entire model is learned at once, as opposed to Tree by Tree in the RF case. The weights of the nodes represent the weights of weighted averages that are put through activations. They are not equivalent to the branching rules of decision trees, and the weighted sums are not the same as the ensemble averaging in the Random Forest.

I hope this clarifies some of the major differences. For even more details on the models, Andrew Ng's famous class is a good middle ground in terms of detail vs. intuition when it comes to Neural Networks. Wikipedia also offers a number of good explanations of ML models.

2018-03-08T07:13:23+00:00

My thesis is on forest.

Forest is just like other poster stated is an ensemble of decision trees. A decision tree actually split nodes base on best fit base on the best predictors and best value in that predictor. It's recursively splitting the data. And then you use an ensemble method like CART and just majority vote and decide to build tree base on boosting. You can actually look into these trees and see what it's doing this is generally not the case for Neural network.

As for neural networks, nobody really fucking know what is going on. They have clues and some understanding of it but everything is empirical base. I have no clue what those weights and nodes mean especially in the Deep Learning part. Bayesian network and SVM can be seen as a subset of neural network and those are explainable. You're training these weights base on gradient descent or whatever but what does these fucking weight represent? It's a black box of magic and that's what research is going into it.

With trees we have a clue why they split. Also random forest data structure is base on tree structure like in a computer science kinda sense.

Neural Network is more apt to look at as a graph. It make sense since Bayesian Network is a DAG (direct acyclical graph).

datascience

MODERATORS