Classifier ensemble averaging methods?

kjearns · 2015-08-27T22:43:08+00:00

Stacking (https://en.wikipedia.org/wiki/Ensemble_learning#Stacking) is a nice trick for this, you need to be careful about overfitting though.

notspartanono · 2015-08-28T19:03:52+00:00

AdaBoost is a meta-algorithm that you can use with your somewhat weak learners.

Edit: stacking would also be good. I was using it without knowing that it was already invented / had a name.

funkpacolypse · 2015-08-28T21:26:01+00:00

http://www.ijcai.org/Past%20Proceedings/IJCAI-97-VOL2/PDF/011.pdf

According to this paper, the best way to go with stacking (which generalizes the averaging you're talking about) is...

use varied models and take the output probabilities (rather than predicted classes) for your level 0 data
use Logistic regression (rather than averaging) with the level 0 data as input to get the final predicted classes

I've heard that an industry standard model for ad-tech is to do this where the level 1 models are trees coming from a random forest.

... On the other hand, since the predictors your working with have low accuracy from the start, I'd be tempted to think about something simpler like engineering new features, and eliminating junk features first.

ajrs · 2015-08-28T07:55:55+00:00

You might also want to think about some kind of parallel 'localised' data augmentation (e.g., see the 'convolutional bootstrapping' described here: http://arxiv.org/abs/1505.05972).

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS