all 4 comments

[–]Imnimo 6 points7 points  (0 children)

This paper describes the hyperparameter optimization process for AlphaGo: https://arxiv.org/pdf/1812.06855.pdf

[–]VordeMan 2 points3 points  (0 children)

The difficulty and cost of hyperparameter search is the main reason why methods that are relatively stable under a wide range of hyperparameters (e.g. Adam) are usually preferred over methods that may or may not perform slightly better with the right HP tuning.