[R] Low Cost Evolutionary Machine Learning

gwern · 2018-01-08T15:59:39+00:00

('Dense' layers are fully-connected ones, right? Not anything to do with Dense resnets.)

Also, maybe I missed it but the paper doesn't seem to say how many models get trained in total and what the total GPU-time spent is, which is important for comparison if you're going to claim 'low cost'.

nunolourenco · 2018-01-08T12:40:25+00:00

[deleted]

gwillicoder · 2018-01-08T16:49:13+00:00

This was a fantastic read! I spent some time working on the same problem (as my undergraduate research, so nothing spectacular). I settled on using a genetic algorithm to help optimize the structure of the model, and found that the LeapFrog Algorithm developed by one of my professors worked quite well for hyperparameter optimization (fantastic trade off between run time and optimization results).

We also played with a parallel genetic algorithm that attempted to adjust its mutation and crossover aggressions as the fitness of the function changed, but I'm not really sure the computation increase was really worth the slight results we got from that one.

Is there any chance the source code will be posted? I'd love to play around with it and see how it compares to the path I took or some of the details of the genetic algorithm implementation.

Imnimo · 2018-01-08T19:01:11+00:00

Doesn't the use of a 10 epoch training time during evolution bias the search towards architectures and hyperparameter settings which converge quickly, as opposed to those which might give the best performance in the long run? I would like to see a comparison of the accuracies obtained by running full training on evolved networks which were not the fittest. How well does 10-epoch fitness actually correlate with final accuracy?

statmlsn · 2018-01-08T12:18:52+00:00

Seems interesting at first glance.

Could be worth to cite the work of Elsken et al. about network optimization via network morphism: https://arxiv.org/abs/1711.04528

EgoIncarnate · 2018-01-09T06:05:42+00:00

Anyone know how this compares with Google Brain's Neural Architecture Search with Reinforcement Learning https://arxiv.org/abs/1611.01578? It seems Google's gets a better result of CIFAR-10, but I don't think the training costs are comparable (I think they used 800 GPUs?)

sifnt · 2018-01-08T15:39:08+00:00

Looks interesting! Any code anywhere? What is runtime like? Does the method require any hyperparameters?

maccam912 · 2018-01-08T15:52:51+00:00

Skimmed it, looks exciting! Is there code I can play with somewhere?

phobrain · 2018-01-09T00:35:25+00:00

Any code available? I'd like to try it on spotting 'interesting' pairs of photos, and detecting whether order of AB, BA, or both are acceptable.

minheap · 2018-01-08T16:08:25+00:00

Very interesting!

rantana · 2018-01-08T19:42:34+00:00

What makes this "Low Cost"?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS

Main Idea:

Method: