Life In 19x19 :: How LZ reads out ladders

Carrying on from the other thread, now I'm using my modified LZ version to explore how LZ "understands" ladders. It's really interesting to look at older and newer LZ nets and see how they treat the same position.

In theory, there should be three things going on:

Policy network: does LZ think the next move in the ladder is an obvious move to explore? Can it "see at a glance" whether a ladder capture is good or bad?
Playouts: it can take about 60 moves to read a ladder that goes all the way across the board. Does that mean LZ needs a minimum of 60 playouts to read out a ladder? Or more playouts if it's reading out other variations along the way?
Net evaluation: Once a few moves of a ladder appear on the board, can LZ recognise the position as good for white or good for black? Can it accurately measure the cost of playing out a bad ladder?

These factors interact with each other. If a ladder move is a low policy move then it won't get many playouts. If the value net can recognise that ten moves of a ladder turns into a disaster, then it also shouldn't need to play out the full ladder.

I'd expect that smaller networks (5 or 6 blocks) will need to play out pretty much the whole ladder, because they can't "see all the way across the board", while a 20 or 40 block network should be able to "take in the position at a glance" and understand the ladder status without playing out the moves.

So, on to some tests. Below are some taisha positions where both sides have made mistakes, and now white has the chance to start a ladder. I want to look at four scenarios:

Test position 1A: good ladder, attacker's perspective. White's best move is to atari the black stone and start the ladder.
Test position 1B: good ladder, defender's perspective. Black shouldn't pull out of atari, but should play elsewhere.
...

Click Here To Show Diagram Code: [go]$$Wc Test position 1. $$ --------------------------------------- $$ | . . . . . . . . . . . . . . . . . . . | $$ | . . . . . . . . . . . . . . . . . . . | $$ | . . . . . . . . . . . . . . O . . . . | $$ | . . . X . . . . . , . . . . . , O d . | $$ | . . . . . . . . . . . . . . . . e . . | $$ | . . . . . . . . . . . . . . . . . . . | $$ | . . . . . . . . . . . . . . . . . . . | $$ | . . . . . . . . . . . . . . . . . . . | $$ | . . . . . . . . . . . . . . . . . . . | $$ | . . . , . . . . . , . . . . . , . . . | $$ | . . . . . . . . . . . . . . . . . . . | $$ | . . . . . . . . . . . . . . . . . . . | $$ | . . . . . . . . . . . . . . . . . . . | $$ | . . . . . . . . . . . . . . . . . . . | $$ | . . X O O b . . . . . . . . . . . . . | $$ | . . X X O X a . . , . . . . . X . . . | $$ | . . . O X O . . . . . . . c . . . . . | $$ | . . . X X O . . . . . . . . . . . . . | $$ | . . . . . . . . . . . . . . . . . . . | $$ ---------------------------------------[/go]

Test position 1A (above): white must play at a, any other move is a mistake.
Test position 1B (above): after white a, black should tenuki -- there are several possible moves, for example c, d, e, but b would be a bad mistake.

...

Author:	Uberdude [ Sun Mar 01, 2020 3:15 am ]
Post subject:	Re: How LZ reads out ladders
Interesting xela, thanks. Back around LZ 157 days I remember noticing that LZ would tend to assume ladders work so would make mistakes in positions they didn't, whilst Elf would tend to assume ladders don't work, so make mistakes in positions they did. Also, to test that LZ 40 block can really "read the ladder at a glance" rather than "ladder from lower left to top right is good for black with a black stone in top right" being baked into the policy I would suggest moving the stone(s) left by one space at a time until they stop being ladder breakers and see if LZ actually notices and how sharply.

Author:	xela [ Sun Mar 01, 2020 3:48 am ]
Post subject:	Re: How LZ reads out ladders
Uberdude wrote: Also, to test that LZ 40 block can really "read the ladder at a glance" rather than "ladder from lower left to top right is good for black with a black stone in top right" being baked into the policy I would suggest moving the stone(s) left by one space at a time until they stop being ladder breakers and see if LZ actually notices and how sharply. Good idea! First I'll post the things I've already looked at (finding time to write things down is a bit of a challenge right now, but I'm gradually getting there). Then I'll try this. And I haven't forgotten the other ladder game you suggested...

Author:	ez4u [ Sun Mar 01, 2020 5:29 am ]
Post subject:	Re: How LZ reads out ladders
xela wrote: Carrying on from the other thread, now I'm using my modified LZ version to explore how LZ "understands" ladders. It's really interesting to look at older and newer LZ nets and see how they treat the same position. In theory, there should be three things going on: Policy network: does LZ think the next move in the ladder is an obvious move to explore? Can it "see at a glance" whether a ladder capture is good or bad? Playouts: it can take about 60 moves to read a ladder that goes all the way across the board. Does that mean LZ needs a minimum of 60 playouts to read out a ladder? Or more playouts if it's reading out other variations along the way? Net evaluation: Once a few moves of a ladder appear on the board, can LZ recognise the position as good for white or good for black? Can it accurately measure the cost of playing out a bad ladder? These factors interact with each other. If a ladder move is a low policy move then it won't get many playouts. If the value net can recognise that ten moves of a ladder turns into a disaster, then it also shouldn't need to play out the full ladder. I'd expect that smaller networks (5 or 6 blocks) will need to play out pretty much the whole ladder, because they can't "see all the way across the board", while a 20 or 40 block network should be able to "take in the position at a glance" and understand the ladder status without playing out the moves. So, on to some tests. Below are some taisha positions where both sides have made mistakes, and now white has the chance to start a ladder. I want to look at four scenarios: Test position 1A: good ladder, attacker's perspective. White's best move is to atari the black stone and start the ladder. Test position 1B: good ladder, defender's perspective. Black shouldn't pull out of atari, but should play elsewhere. ... Click Here To Show Diagram Code [go]$$Wc Test position 1. $$ --------------------------------------- $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . O . . . . \| $$ \| . . . X . . . . . , . . . . . , O d . \| $$ \| . . . . . . . . . . . . . . . . e . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . , . . . . . , . . . . . , . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . X O O b . . . . . . . . . . . . . \| $$ \| . . X X O X a . . , . . . . . X . . . \| $$ \| . . . O X O . . . . . . . c . . . . . \| $$ \| . . . X X O . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ ---------------------------------------[/go] Test position 1A (above): white must play at a, any other move is a mistake. Test position 1B (above): after white a, black should tenuki -- there are several possible moves, for example c, d, e, but b would be a bad mistake. ... With the current 266 net, White's blue is the immediately the ladder play at . After 1, pulling out the Black stone is not rejected - it is never tested (at least within the first 100K of playouts). Most tested replies are in the upper right. The shown below is not heavily tested (again at least not up to 100K when I ran this). It has a low policy number compared to nearby points. However, note that it is on the line of White's laddering stones (the line of "a"'s) and therefore is a strong ladder breaker that cannot be easily countered from behind. If we play this 2 on the board, Blue becomes shown below. This move does nto reestablish the ladder. LZ only considers local replies by Black in calculating its results. I ran Lizzie over dinner and some Sunday night television. That gave me 1.3 million playouts. In the early going blue switched back and forth between 3 and 4 below. However, by about 100K 3 dominates. See the three screenshots below the diagram for the rest of the story. Click Here To Show Diagram Code [go]$$Wc Test position 1. $$ --------------------------------------- $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . O . . . . \| $$ \| . . . X . . . . . , . . . . . , O . . \| $$ \| . . . . . . . . . . . . . . 2 . . . . \| $$ \| . . . . . . . . . . . . . . . 3 . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . , . . . . . , . . . . . , . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . a . . . . . . . . . . . \| $$ \| . . . . . . a . . . . . . . . . . . . \| $$ \| . . . . . a . . . . . . . . . . . . . \| $$ \| . . X O O 4 . . . . . . . . . . . . . \| $$ \| . . X X O X 1 . . , . . . . . X . . . \| $$ \| . . . O X O . . . . . . . . . . . . . \| $$ \| . . . X X O . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ ---------------------------------------[/go] Below is a screenshot after 1.3 million playouts. Blue is above. LZ only calculates using local replies to this . Attachment: Blind spot LZ 266.jpg [ 261.64 KiB \| Viewed 11203 times ] Checking the bottom left after 1.3 million playouts for 3, we can see that only three PO's test Black pulling out the laddered stone. Attachment: Blind spot LZ 266 2.jpg [ 199.18 KiB \| Viewed 11203 times ] But when we add , we see LZ finally making the calculations and White's win rate dropping. This is after 2K playouts. Attachment: Blind spot LZ 266 3.jpg [ 261.54 KiB \| Viewed 11203 times ]

Author:	ez4u [ Sun Mar 01, 2020 5:58 am ]
Post subject:	Re: How LZ reads out ladders
Following up on my previous post. Here is how Katago 1.3.3 the b15 net handled the same situation in 252 playouts! What to do after in my original diagram. Attachment: Blind spot Katago b15 1.jpg [ 198 KiB \| Viewed 11194 times ] What it calculated for with 3(!) playouts. Attachment: Blind spot Katago b15 2.jpg [ 193.76 KiB \| Viewed 11194 times ]

Author:	xela [ Sun Mar 01, 2020 6:17 am ]
Post subject:	Re: How LZ reads out ladders
Summary: so far it looks as though the policy net overrides everything else. If there's a blind spot in the policy, then in theory a large enough number of playouts together with accurate evaluations should fix it, but it really does take a massive number of playouts. To be continued...

Life In 19x19 http://www.lifein19x19.com/

How LZ reads out ladders http://www.lifein19x19.com/viewtopic.php?f=18&t=17298	Page 1 of 1

Author:	Bill Spight [ Tue Mar 03, 2020 7:00 pm ]
Post subject:	Re: How LZ reads out ladders
FWIW, I found two examples of this position on Waltheri, 1941-00-00e, Sekiyama Riichi, 6 dan (W) vs. Nabeshima Ichiro, 4 dan, and 2001-09-16g, Zhu Songli, 5 dan (W) vs. Zhou Heyang, 9 dan. Click Here To Show Diagram Code [go]$$Wcm16 Test position 1a. $$ --------------------------------------- $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . O . . . . \| $$ \| . . . X . . . . . , . . . . . , O . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . , . . . . . , . . . . . , . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . 2 . . . . . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ \| . . X O O 3 . . . . . . . . . . . . . \| $$ \| . . X X O X 1 . . , . . . . . X . . . \| $$ \| . . . O X O . . . . . . . . . . . . . \| $$ \| . . . , X a b . . . . . . . . . . . . \| $$ \| . . . . . . . . . . . . . . . . . . . \| $$ ---------------------------------------[/go] Play continued as above in both games. In the Elf commentaries, for Elf recommends Ba - , Bb.

Author:	xela [ Tue Mar 03, 2020 7:32 pm ]
Post subject:	Re: How LZ reads out ladders
Bill Spight wrote: FWIW, I found two examples of this position on Waltheri, 1941-00-00e, Sekiyama Riichi, 6 dan (W) vs. Nabeshima Ichiro, 4 dan, and 2001-09-16g, Zhu Songli, 5 dan (W) vs. Zhou Heyang, 9 dan. Nice! For "research purposes", I've been adding a white stone at a (and a corresponding black stone at D2). because otherwise LZ keeps thinking about playing a as a forcing move in the middle of reading out the ladder. It doesn't seem to change the overall conclusions, but it makes the process of tracing the variations much messier.

Author:	Bill Spight [ Wed Mar 04, 2020 4:16 pm ]
Post subject:	Re: How LZ reads out ladders
Your mission, Mr. Phelps, should you choose to accept it. From Common Sense in Go (Kubomatsu, 1929, in Japanese).

Page 1 of 1	All times are UTC - 8 hours [ DST ]
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group http://www.phpbb.com/