Life In 19x19

Posted: **Tue Dec 18, 2018 4:41 am**

A new paper from DeepMind about how they trained the hyper-parameters of AlphaGo:

https://arxiv.org/abs/1812.06855

I just skimmed it, but something that jumped out to me was it was this automated tuning process that suggested to them to stop using rollouts and just use the value network.

Life In 19x19

Paper: Bayesian Optimization in AlphaGo

Paper: Bayesian Optimization in AlphaGo