Life In 19x19 http://www.lifein19x19.com/ |
|
AlphaGo Zero Patterns http://www.lifein19x19.com/viewtopic.php?f=18&t=15134 |
Page 1 of 1 |
Author: | sorin [ Mon Oct 23, 2017 10:44 pm ] |
Post subject: | AlphaGo Zero Patterns |
I wrote an article about a few patterns I noticed in AlphaGo Zero's published games: http://361points.com/articles/alphago_zero_patterns/ |
Author: | Uberdude [ Mon Oct 23, 2017 11:01 pm ] |
Post subject: | Re: AlphaGo Zero Patterns |
Nice article, but I don't think not playing the wedge in double approach is a blind spot for humans: I've had a general preference for corner hane for some years now and that's based on my understanding that plenty of human pros (particularly Chinese?) have preferred it since quite some time before AlphaGo existed. Also 7 in pattern 1 rather than direct capture surprised me. Does it fear Black tenuki if capture so loses a little extra points to be sure of sente? I suppose p14 is also a possible shape followup and it creates more ko threats. |
Author: | RobertJasiek [ Tue Oct 24, 2017 12:40 am ] |
Post subject: | Re: AlphaGo Zero Patterns |
I am only surprised by one move: At first, I thought that the following provides better eye shape and better endgame: But maybe AlphaGo Zero jugdes that this later attack is more relevant: EDITS |
Author: | Uberdude [ Tue Oct 24, 2017 2:29 am ] |
Post subject: | Re: AlphaGo Zero Patterns |
Robert, for interest, there was a somewhat similar shape in AG Master self-play #38 (see viewtopic.php?p=221574#p221574) and in that case it connected rather than capture on move 33, I think because it didn't like the 2-1 throw-in ko. |
Author: | sorin [ Tue Oct 24, 2017 7:11 pm ] |
Post subject: | Re: AlphaGo Zero Patterns |
Uberdude wrote: Nice article, but I don't think not playing the wedge in double approach is a blind spot for humans: I've had a general preference for corner hane for some years now and that's based on my understanding that plenty of human pros (particularly Chinese?) have preferred it since quite some time before AlphaGo existed. Indeed, the popularity of the wedge vs hane declined in time, but it still appears in pro games even in 2016. AlphaGo seems to a have much stronger opinion about it ![]() Uberdude wrote: Also 7 in pattern 1 rather than direct capture surprised me. Does it fear Black tenuki if capture so loses a little extra points to be sure of sente? I suppose p14 is also a possible shape followup and it creates more ko threats. Maybe it likes that it gets some forcing moves from the outside - like P15 - or just in general the fact that black ends up with one extra group (in theory)? (By the way, moves 5 and 7 are interchangeable - in one of the games it plays at 7 first). I looked this up in my SGF collection and found only one human game with a similar starting pattern, except that white's approach move at R12 is missing (there is one two spaces down instead); the one "in the corner" is some Inoue Ichiro, while "outside" is Go Seigen, who simply captured the stone like you suggested. |
Author: | sorin [ Tue Oct 24, 2017 7:13 pm ] |
Post subject: | Re: AlphaGo Zero Patterns |
RobertJasiek wrote: But maybe AlphaGo Zero jugdes that this later attack is more relevant: This last diagram looks very convincing for why the capture is better! |
Author: | Bill Spight [ Tue Oct 24, 2017 7:35 pm ] |
Post subject: | Re: AlphaGo Zero Patterns |
sorin wrote: Uberdude wrote: Nice article, but I don't think not playing the wedge in double approach is a blind spot for humans: I've had a general preference for corner hane for some years now and that's based on my understanding that plenty of human pros (particularly Chinese?) have preferred it since quite some time before AlphaGo existed. Indeed, the popularity of the wedge vs hane declined in time, but it still appears in pro games even in 2016. AlphaGo seems to a have much stronger opinion about it ![]() I suspect that AlphaGo Zero has a path dependency. That is, even though it trains against a previous version, and ought to correct errors eventually, one version and the next share many assumptions, not all of which are correct. The main focus of the AlphaGo team is machine learning, not go learning. That is, their aim was to produce a strong go program, not to answer questions about go. It would be quite interesting to see AlphaGo Zero.Two, an independent program which follows its own path. They might well play different styles and favor different plays. My guess is that they would. ![]() |
Author: | sorin [ Tue Oct 24, 2017 9:25 pm ] |
Post subject: | Re: AlphaGo Zero Patterns |
Bill Spight wrote: I suspect that AlphaGo Zero has a path dependency. That is, even though it trains against a previous version, and ought to correct errors eventually, one version and the next share many assumptions, not all of which are correct. The main focus of the AlphaGo team is machine learning, not go learning. That is, their aim was to produce a strong go program, not to answer questions about go. It would be quite interesting to see AlphaGo Zero.Two, an independent program which follows its own path. They might well play different styles and favor different plays. My guess is that they would. ![]() That is a fascinating topic, it occurred to me too - the question "would the learning from scratch always converge to the same point/style". Given that AlphaGo Zero ended up playing pretty similarly to the human style though (favoring after all 3rd and 4th lines mostly, in the beginning of the game) in both of 20-block and 40-block versions, seems to suggest that there is "style convergence" (to some extent). I would like very much to see the "trajectory" it took during learning; the published paper showed the frequency of a few patterns, but I would like to see in more detail how does the style change between amateur 1-dan and pro 9-dan stages... |
Author: | moha [ Wed Oct 25, 2017 6:10 am ] |
Post subject: | Re: AlphaGo Zero Patterns |
sorin wrote: Bill Spight wrote: It would be quite interesting to see AlphaGo Zero.Two, an independent program which follows its own path. They might well play different styles and favor different plays. My guess is that they would. That is a fascinating topic, it occurred to me too - the question "would the learning from scratch always converge to the same point/style".![]() Given that AlphaGo Zero ended up playing pretty similarly to the human style though (favoring after all 3rd and 4th lines mostly, in the beginning of the game) in both of 20-block and 40-block versions, seems to suggest that there is "style convergence" (to some extent). |
Page 1 of 1 | All times are UTC - 8 hours [ DST ] |
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group http://www.phpbb.com/ |