Paper: Bayesian Optimization in AlphaGo
Posted: Tue Dec 18, 2018 4:41 am
A new paper from DeepMind about how they trained the hyper-parameters of AlphaGo:
https://arxiv.org/abs/1812.06855
I just skimmed it, but something that jumped out to me was it was this automated tuning process that suggested to them to stop using rollouts and just use the value network.
https://arxiv.org/abs/1812.06855
I just skimmed it, but something that jumped out to me was it was this automated tuning process that suggested to them to stop using rollouts and just use the value network.