It is currently Fri Apr 19, 2024 2:39 pm

All times are UTC - 8 hours [ DST ]




Post new topic Reply to topic  [ 11 posts ] 
Author Message
Offline
 Post subject: Bot strength
Post #1 Posted: Sun Sep 06, 2020 3:28 pm 
Lives in sente
User avatar

Posts: 866
Liked others: 318
Was liked: 345
Has anyone found a convincing way to put AlphaGo Lee, AlphaGo Master, and AlphaGo Zero in a hierarchy with KataGo, Golaxy, and Fineart?

I remember reading last year that Golaxy's developers were very confident they and a FineArt had surpassed AlphaGo, though I’m not sure how they’d know. I guess they could review AlphaGo's self-plays and try to find positive surprises and mistakes.

How confident are we that KataGo has or has not surpassed AlphaGo?

_________________
- Brady
Want to see videos of low-dan mistakes and what to learn from them? Brady's Blunders

Top
 Profile  
 
Offline
 Post subject: Re: Bot strength
Post #2 Posted: Sun Sep 06, 2020 11:55 pm 
Lives in gote

Posts: 486
Location: Netherlands
Liked others: 270
Was liked: 147
Rank: EGF 3d
Universal go server handle: gennan
KataGo may have surpassed AlphaGo under equal conditions (millions of playouts per move). But a vast majority of KataGo users don't have the hardware to support such high number of playouts.
If we say that KataGo is stronger than AlphaGo, many may assume that KataGo on their mediocre laptop with only 1000 playouts per move is stronger than AlphaGo with millions of playouts per move and this may not be true.

Top
 Profile  
 
Offline
 Post subject: Re: Bot strength
Post #3 Posted: Mon Sep 07, 2020 12:24 am 
Judan

Posts: 6725
Location: Cambridge, UK
Liked others: 436
Was liked: 3719
Rank: UK 4 dan
KGS: Uberdude 4d
OGS: Uberdude 7d
Even with only tens of thousands of playouts, I think LeelaZero and KataGo are stronger than AlphaGo Lee during its match. I say this without solid proof, but my evidence is reviewing those games and where they differ the given sequence seem convincing reasons. Also stronger versions of AlphaGo identified AGLee making mistakes e.g the joseki shock peep in game 2 was an overplay and bad if Lee resisted which both AG teaching tool and LZ agree on. Also there are similarities in the preferences of the bots as they evolved and LZ used to like hanging connection in high approach to 3-4 but doesn't anymore (because it's not sente, solid is) and AG Zero has same preference so the fact AG Lee plays it is further evidence it's weaker and not so far along the evolution path.


This post by Uberdude was liked by: Gomoto
Top
 Profile  
 
Offline
 Post subject: Re: Bot strength
Post #4 Posted: Mon Sep 07, 2020 12:33 am 
Judan

Posts: 6145
Liked others: 0
Was liked: 788
gennan wrote:
KataGo may have surpassed AlphaGo under equal conditions (millions of playouts per move).


Roughly what hardware and how much thinking time per move do allow millions of playouts per move, IYO?

Top
 Profile  
 
Offline
 Post subject: Re: Bot strength
Post #5 Posted: Mon Sep 07, 2020 6:44 am 
Gosei
User avatar

Posts: 1349
Liked others: 202
Was liked: 203
gennan wrote:
KataGo may have surpassed AlphaGo under equal conditions (millions of playouts per move). But a vast majority of KataGo users don't have the hardware to support such high number of playouts.
If we say that KataGo is stronger than AlphaGo, many may assume that KataGo on their mediocre laptop with only 1000 playouts per move is stronger than AlphaGo with millions of playouts per move and this may not be true.

also "vast majority users don't have" the opportunity to test the AlphaGo! what do you compare with what? :)

Top
 Profile  
 
Offline
 Post subject: Re: Bot strength
Post #6 Posted: Mon Sep 07, 2020 7:11 am 
Lives in gote

Posts: 445
Liked others: 0
Was liked: 37
According to DeepMind the strongest version of AlphaGo was AlphaGo Zero 40b.

It is very likely that even KataGo surpassed its strength by now (on hw parity), since AGZ worked without liberty and ladder input, which should definitely amount to a noticeable bonus (effective net size increase) when present. Not being score-blind should also give strength increase (training with aux input-output gives stronger results even when the aux part is not used later).

FineArt and Golaxy is likely even further ahead at the moment. OC, the practical question is hardware.

Top
 Profile  
 
Offline
 Post subject: Re: Bot strength
Post #7 Posted: Mon Sep 07, 2020 9:08 am 
Lives in gote

Posts: 486
Location: Netherlands
Liked others: 270
Was liked: 147
Rank: EGF 3d
Universal go server handle: gennan
RobertJasiek wrote:
gennan wrote:
KataGo may have surpassed AlphaGo under equal conditions (millions of playouts per move).


Roughly what hardware and how much thinking time per move do allow millions of playouts per move, IYO?


I'm no expert. I only know some anecdotes:

In August 2020 @goame reported he got roughly 100k playouts per minute with KataGo 40-block 384 channel network running on 2x RTX2080 Ti and 64 GB RAM.

In 2017 DeepMind made their AlphaGo teaching tool (an opening database) and it seems they got roughly 1M playouts per minute with AlphaGo Master running on their hardware. I don't know what that was, perhaps 4 TPUs? It must have been pretty powerful.

Top
 Profile  
 
Offline
 Post subject: Re: Bot strength
Post #8 Posted: Tue Sep 08, 2020 3:54 pm 
Lives in sente
User avatar

Posts: 866
Liked others: 318
Was liked: 345
Attachment:
7594EA93-8927-4F49-9FEE-7ECE2D6BB862.jpeg
7594EA93-8927-4F49-9FEE-7ECE2D6BB862.jpeg [ 77.44 KiB | Viewed 7109 times ]


I see that somebody on reddit tried to answer this (not rigorously) a couple of months ago.

https://www.reddit.com/r/baduk/comments ... g_for_ais/

_________________
- Brady
Want to see videos of low-dan mistakes and what to learn from them? Brady's Blunders

Top
 Profile  
 
Offline
 Post subject: Re: Bot strength
Post #9 Posted: Wed Sep 09, 2020 12:23 am 
Lives in gote

Posts: 486
Location: Netherlands
Liked others: 270
Was liked: 147
Rank: EGF 3d
Universal go server handle: gennan
I saw that post too, but it looks like the absolute Elo ratings used there have no relation to other go rating systems. Only the relative Elo ratings may have some meaning, but the meaning is not much more than a simple ranking IMO (ordering the list by strength).

Top
 Profile  
 
Offline
 Post subject: Re: Bot strength
Post #10 Posted: Wed Sep 09, 2020 5:32 am 
Lives in sente

Posts: 1037
Liked others: 0
Was liked: 180
Not only failing to report at what "number of visits" but also "real time" (time control)

It is not just equality of visits that matters (if that measure used) because the number chosen might be before a "knee" fdor one but not another.

And of course "real time" is the true/correct measure since that can change number of visits and not necessarily equally. I consider "equal real time" to be the correct measure, since go is played with time controls. If we want to compare to human players, that must be a speeds used for human go. If asking whether a program is up the strength of a top 9p that time control should be what might be used for a top pro title challenge game. Say a minute/move.

Top
 Profile  
 
Offline
 Post subject: Re: Bot strength
Post #11 Posted: Wed Sep 09, 2020 10:30 am 
Lives in sente
User avatar

Posts: 866
Liked others: 318
Was liked: 345
Regarding the table above, the poster did make clear it was just for fun. He did the best he could, given the lack of direct comparison. I was shocked when lightvector published his final ELO rating comparisons to prior versions and to LZ and Elf. The table incorporates these real world comparisons. Where do you think AG fits in?

I get the equal time argument, but it would be easier to defend equal time and equivalent hardware. After all, AG Lee used tons of TPU's (playouts) to overcome its relative weakness. Even after AG-Lee, Deepmind used 4TPU's which is super-fast. FineArt supposedly uses hundreds of gpu's.

IMHO, to understand the true strength of the bot, you shouldn’t handicap with time. Let FineArt use all their GPU's but let fat-Katago do so too. Or give KataGo all the time it wants.

Finally and separately, the closed bots have a big advantage. They have access to the best open bots. I remember rumors that within weeks of the recent g-170 katago release, at least one closed bot started playing a sequence it hadn’t played before, one that katago favored. It makes sense they would be strongest. Please note that I haven’t confirmed these rumors. I’d love to see some evidence!

_________________
- Brady
Want to see videos of low-dan mistakes and what to learn from them? Brady's Blunders

Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 11 posts ] 

All times are UTC - 8 hours [ DST ]


Who is online

Users browsing this forum: No registered users and 1 guest


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Jump to:  
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group