I volunteered to try what I suggested earlier, and I let the best 15-blocks LZ network run until it reached about 1.4M total visits.sorin wrote:The relative difference in number of visits may be actually "noise" since the total number of visits is still quite small in my analysis earlier in this thread. Maybe someone with a stronger computer or more time can let LeelaZero do a deeper analysis and, with many more total visits, the pattern that you expect may be true, as in relatively close winrate moves would also have relatively close number of visits...Bill Spight wrote:One thing that bothers me about the highly unbalanced comparison between plays is that, when the winrate estimates are close between two plays, shouldn't the number of visits for second place be roughly the same as the number of visits for the eventual winner of the comparison? That is, in general the more visits a play gets, the smaller the margin of error of its evaluation should be. So it seems that we have a case where the margins of error of two competing evaluations overlap, and instead of reducing the larger margin of error, much more effort is spent reducing the smaller margin of error. You could get it down to a point and there would still be overlap.
I was surprised to see that around the 500K total visits point, the traditional human move moved up from #2 to #1, and it kept the place. Yay humans!!
Also, as the number of visits increases, more and more weird moves are being explored, so basically any possible move has some non-zero probability of eventually being explored by MCTS.
It does seem to follow Bill's intuition, namely moves that are close winrate-wise get more attention and eventually get more visits; on the other hand, since new candidate moves pop up on the radar all the time, there will always be gaps, until a really huge number of visits happened (which I cannot experiment with on my laptop).
For instance, the low approach in the lower right corner is in the same winrate ballparck as top 2 moves on the left; it has way more visits than all other lower winrate candidates, but still way fewer than top two; I expect that with more visits, this 3rd candidate would also get more attention from MCTS.