Simple Sure Win Strategy for White Human Player vs GnuGo 3.8

hyperpape · Post by **hyperpape** » Tue Feb 01, 2011 10:22 pm

Mike Novack wrote:
jts wrote: Glofish may be 1d, but the instructions he came up with are not 1d instructions.
Who suggested that they were? (1 dan instructions)

But are you willing to agree that they are perhaps 6-8 kyu instructions? Gnugo isn't supposed to be anything like as strong as 6 kyu.

The instructions are (in effect) to coordinate one's separate positions so that they have a combined effect, that gnugo has a weakness in that it will not attempt to contest this. But isn't that sort of coordination precisely the sort of thing human players as weak as gnugo have trouble with?

To demonstrate that this is a method that human players significantly weaker than gnugo could use to easily defeat gnugo need to have games between players of those strengths an gnugo. Since there are a number of bots using gnugo, all we should have to do is wait and see if there is a noticable decline in the rankings of these bots.

1) I think a substantially weaker player could execute these instructions. They require judgment about when cuts can happen that might cause problems for a 15 kyu, but I think a 10 kyu could do it. It would be nice to have someone try.

2) Your point at the end depends on their being ratings arbitrage on KGS. It is easy to imagine that this would not happen (I wrote about this a tiny bit: http://lifein19x19.com/forum/viewtopic. ... age#p43981).

3) What is remarkable about GnuGo's errors, as opposed to those of a human, is that they lend themselves to an easily taught strategy to get a win. Every human makes strategic errors. But I don't think you can typically play a 10 kyu, diagnose their errors, then give another 10 kyu a recipe to beat them. Glofish has shown that you can do that with GnuGo.

I think this point goes a bit beyond deterministic play. What you have to teach the 10 kyu in order to beat GnuGo is not general principles of Go, nor do you need to show them a game tree. You can teach them a few tricks, and they can use those to beat the bot (assuming that my claim that you could do this is right).

4) A question out of curiousity: how consistent are current MCTS systems' evaluations of a position/move? That is, if you give them reasonable time controls & hardware, do they tend to give the same moves high ratings? I suppose the most interesting case is a game that is fairly even, and not in the endgame, but the other cases might be interesting.

Mike Novack · Post by **Mike Novack** » Wed Feb 02, 2011 6:32 am

hyperpape wrote:
4) A question out of curiousity: how consistent are current MCTS systems' evaluations of a position/move? That is, if you give them reasonable time controls & hardware, do they tend to give the same moves high ratings? I suppose the most interesting case is a game that is fairly even, and not in the endgame, but the other cases might be interesting.

They are making fundamental use of probability in the evaluation. Even though they may be playing out a great many random moves so "large numbers involved probability doesn't work the way most people think it will*.

1) The greater the difference between the (top) moves they are evaluating the greater the probablity they will select the same) better move. Like when a human expert looking at a game situation sees some particular move as "the only move" they will be extremely probable to select the same move again if you repeat the situation. The greater tha amount this move is better than any of the others being considered the more likely it will be made the next time around. But it is not certain.

2) But if the difference between the top moves is less, one only somewhat better than the other, the sort of situation where an expert evaluating the position says "I give this move 100% but these others move, perhaps playable, only 80%" then the next time around might or might or might not select one of the other moves.

I would be suspicious of the "adjustment" or "randomness" of a MCTS program if is seemed always to select the same move but you could see some alternate way to play. These programs have only pseudorandom generators but it would be possible to work in true random factors as the game progresses (how many microseconds for the opponent to make the last move for example -- so reset random every turn and that wouldn't be the same when game repeated).

3) The game has to be relatively even or the bot ahead for the MCTS algorithm to work properly. As they are currently designed, if hopelessly behind they have no choice between "resign" and "suicidal counterattack" (and so will make some rather bad moves). I believe it would be possible** to modify these programs to play more humanlike in this situation (to play "resigned to a modest loss") but it is probably too early for developers to consider work on that option unless there is strong user demand for it.

* example --- if we flip an honest coin 10,000 times we should not expect to get 5,000 heads and 5,000 tails. This is indeed the most likely SINGLE outcome but there are a large number of possible outcomes gradually becoming less and less likely as we get farther and farther from 50-50. Because so many more possibilities out in the "tails" even though each unlikely we can expect that our outcome will be within about 5,300 - 4,700 (either heads or tails in the lead) and outside that division (less even a split) about half the time.

** What I would try is have a split "consider resigning" routine. If it estimates that its chances of winning are too low for the algorithm to work properly (none of the moves considered has more than a low probability) and this option is in effect, the first time make a dynamic adjustemt to the evaluation komi (it will lose by at least that amount but "thinks" it will win so can keep playing normally) but if it again considers resigning, resign. I think that would work to get the machine to play out losing games more normally.

daal · Post by **daal** » Fri Feb 04, 2011 5:08 am

Ok, I spent three minutes teaching the strategy to a 16k. Here is the game:

GloFish · Post by **GloFish** » Fri Feb 04, 2011 4:24 pm

daal wrote:Ok, I spent three minutes teaching the strategy to a 16k. Here is the game:

Heh, thank you for that nice experiment, it exceeds my expectations.

I hope that convinces Mike

daal · Post by **daal** » Sun Feb 06, 2011 4:01 am

Just to be fair, I also taught it to a 9k, who lost when gnugo played 5-4 corners. The player said that the territory he got in the center was just too small.

I suspect he goofed somewhere, but unfortunately he couldn't find the game. He did however try again and won easily.

karaklis · Post by **karaklis** » Sun Feb 06, 2011 3:08 pm

I taught it to my son (around 14-16k), and at the second try he made it. At the first try GnuGo tried an invasion (I don't know what caused the engine to try an invasion, maybe the reason is that I used GnuGo 3.9). After I taught my son how to kill the invasion, he made it on the second try.

abartos · Post by **abartos** » Thu Feb 10, 2011 3:34 am

I'm 16k, and it was easy for me to implement this strategy against gnugo, even on level 16. The strategy also works in pretty much the same way against Many Faces of Go version 11, on level 10, and against Aya, on the strong setting.

daniel_the_smith · Post by **daniel_the_smith** » Thu Feb 10, 2011 3:44 am

Wow, abartos, that's crazy!

Do monte carlo engines fall for this too? I would assume that they never would, but...

Mike Novack · Post by **Mike Novack** » Thu Feb 10, 2011 7:27 am

I don't think so (that any of the MCTS evaluators would fall for this).

But first might be interesting to see if a current AI evaluator does. Folks really need to keep in mind that "just a couple years old" is a long time in the state of the art. And being dependent on the availablility of volunteer developer time the gnogo versions available now have to be considered on a par with somewhat older commercial programs. Don't get me wrong, I am a big fan/user of "free software" but mainly for uses where being a couple years behind the state of the art means little/nothing.

MFOG does use an AI move generator and below the top two levels also uses an AI evaluator to choose the move (from the generated set of plausible moves).

So try MFOG 12.022 at the 6 kyu level to see if its evaluator is still as bad deciding between local and global considerations (this "simple method to defeat" is the result of the evaluator not adjusting for the total cummulative effect of either territory or influence*).

To settle the question of whether a MCTS evaluator can be tricked this way then try the top two levels of MFOG 12.022 or any other MCTS program you prefer. I am not trying to tout some particular program -- simply know for some better than others how they function. I don't know if any of the other MCTS programs even have an AI move generator** let alone an AI move evaluator for the lower levels.

But really! Don't be telling us what MFOG 10 can't do or even MFOG 11 unless you clearly spell out for folks "this is only informational about what a very badly outdated/superceded program can or cannot do". The problem is that too many people assume that their exerience of how a program behaved a couple years ago remains valid for expectations of how it would behave today. Have to keep reminding them that in this game a couple years is a long time.

* Take gnugo for example. Version 3.8 offers a choice between evaluation based on local territory or global (the "cosmic" setting) but not an option where it sometimes will use one or the other method depending on "the state of the game" (or even randomly --- were I doing gnugo development for the AI only version that's what I would try first; a parameter that would set the probability of using either regular or cosmic evaluation for the next move).

** The effect of this (using an AI move generator) is to make the machine player "more human like". Won't be making an super strange moves, moves with no apparent purpose (though of course, these may be good moves down the road). The reason to use MFOG for the experiment is that in this case we do know it has an AI move generator so if this "defeat the program" method works against the lower levels (AI evaluator) but not the two highest level (MCTS evaluator) then we have determined that the problem is with evaluation, not move consideration.

Life In 19x19

Simple Sure Win Strategy for White Human Player vs GnuGo 3.8

Re: Simple Sure Win Strategy for White Human Player vs GnuGo

Re: Simple Sure Win Strategy for White Human Player vs GnuGo

Re: Simple Sure Win Strategy for White Human Player vs GnuGo

Re: Simple Sure Win Strategy for White Human Player vs GnuGo

Re: Simple Sure Win Strategy for White Human Player vs GnuGo

Re: Simple Sure Win Strategy for White Human Player vs GnuGo

Re: Simple Sure Win Strategy for White Human Player vs GnuGo

Re: Simple Sure Win Strategy for White Human Player vs GnuGo

Re: Simple Sure Win Strategy for White Human Player vs GnuGo