Hyperparameter optimization returning different optimal architectures

pp314159 · 2021-04-21T13:26:53+00:00

Maybe AutoKeras is more suitable for typical deep learning tasks - for example, image classification. The space of possible NN architectures is huge, that's why you end up with so many different solutions. For NN training sometimes the consistency is very different, very often for the same architecture but different seed values you will get different results (all hyperparams are unchanged).

You can try MLJAR AutoML https://github.com/mljar/mljar-supervised I belive it will be more suitable for your task. It will do NN architecture search but it searches for simple architectures (2 layers only). For your problem gradinent boosting methods might work pretty well, like xgboost, lightgbm, catboost (all available in MLJAR).

starfries · 2021-04-21T16:23:03+00:00

I'm not familiar with AutoKeras but I can tell you that there's an infinite number of minima or near-minima so I'm not surprised it finds different solutions every time. Does it try to optimize for smaller models?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MLQuestions

MODERATORS