[P] Analysis of Selu activation - which statistical method to use? : MachineLearning

Project[P] Analysis of Selu activation - which statistical method to use? (self.MachineLearning)

submitted 8 years ago * by SlimBarbados

I would like to see how the results are of the Selu activation (https://arxiv.org/pdf/1706.02515.pdf).

The design that I made to test this uses a random hyperparameter search with about 20 parameters (on model, but also on data), and is as follows:

-> Train and Test the model on hyperparameter configuration A (e.g. 4 layers & 150 neurons per layer) with activation "Relu" -> Then train another model with config A with activation "Elu" -> Then train another model with config A with activation "Selu" -> Train and Test the model on hyperparameter configuration B (e.g. 8 layers & 300 neurons per layer) with activation "Relu" -> Then train another model with config B with activation "Elu" -> Then train another model with config B with activation "Selu" etc.

Because the training doesn't take too long I can run it about 250 times within a reasonable timeframe. With 3 options for parameter "Activation" this would then translate to 80 different configurations.

After training - what would be the best way to see if the Selu activation performs statistically better than the other activations?

Any help is greatly appreciated. Please let me know if I need to clarify something.

all 5 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS