AlphaZero paper discussion (Mastering Go, Chess, and Shogi)

Uberdude · **#21**

dfan wrote:

Uberdude wrote:

[re contempt] I don't know if other engines do anything clever like evaluating opponent's previous moves to work out they are weaker so they can get away with overplays and go for a win.

This would be hard unless you had multiple games to go on (and no chess engines that I know of have a "meta-game" like that), since by the problem setup your opponent's moves so far in this game have been about as good as your own.

Yeah, from one game would be hard, but if you could tell the bot who the opponent was and its known weaknesses or at least general level maybe. But look what I found wandering the internet:

https://en.wikipedia.org/wiki/Junior_(chess) wrote:

Another approach its designers claim to use is 'opponent modeling'; Junior might play moves that are not objectively the strongest but that exploit the weaknesses of the opponent. According to Don Dailey ″It has some evaluation that can sting if it's in the right situation—that no other program has.″[2]

Uberdude · **#22**

A few comments from a low-dan Go and somewhat better chess playing friend:

Me wrote:

Boris, as a chess and go player can you give your impression of how much AlphaZero plays standard chess opening book lines versus going of into unchartered opening territory. I always thought human (helped by bots recently) knowledge of chess openings was closer to perfect play than for go and there was less chance to play novel early moves which weren't suboptimal (chess openings being whole board rather than just a corner like in go, and sharper). So in go AlphaGo Zero is playing 3-3 invasions before move 10 and surprising us (starting in corners and approaching with knight's moves we got right though!) and not playing many of our long josekis, is it similar in chess making innovations in the first few moves or does it do the normal book opening for the first 10 (or however many) moves (ply) and then start winning with new things after that? https://www.reddit.com/r/chess/comments ... a/dqubdxz/ suggests maybe some of the former but I don't understand chess enough.

Boris wrote:

That's a great question and I think your intuition is right. There doesn't seem to be any innovation in early moves. I'd argue that's a property of chess for being a simpler game and the ability to analyse much further than in go (all 3-3 alike moves would've been tried by strong players at some point). I've only liked through openings without a board - I'll update my answer once I've seen how it slowly grinds a win. On average the games seem to be longer than human games.

My impression of the style is that it has an extremely strong positional understanding. It evaluates and handles the most complex positions of unbalanced material very well. The way in which it keeps activity and slowly converts the superior placement of pieces in gambit situations (it sacrifices plans happily) is impressive and reminiscent of the most memorable games by top players from the past. Botvinnik vs Tal match comes to mind

John Fairbairn · **#23**

Quote:

He [Habu] also got to play with AlphaGo!

Not really - they only played 20 moves.

Quote:

But I'm not sure how strong he is in Go.

Unless he's improved recently (and why would he even try?), he plays pros on 6 stones or more.

John Fairbairn · **#24**

If you allow yourself to ponder on this, it just gets worse Kirby, I'm afraid. Huxley once said an intellectual is someone who has found something more interesting than sex. One might be tempted now, therefore, to say "sod it" and go back to prioritising sex. But David Levy, one of the big names in computer chess, recently did a late-in-life PhD at the Sorbonne on computers and sex, and this is now in book form as "Love and Sex with Robots." He even goes so far as to say that continuing advances in computers and robotics "will make legal marriages between Homo and Robo feasible by mid-century."

Best to take a detached Zen-like view, and recall that life, like go, is a game of co-existence.

dfan · **#25**

Uberdude wrote:

But look what I found wandering the internet:

https://en.wikipedia.org/wiki/Junior_(chess) wrote:

Another approach its designers claim to use is 'opponent modeling'; Junior might play moves that are not objectively the strongest but that exploit the weaknesses of the opponent.

Yeah, it seems I was wrong that people aren't looking at the metagame: more information here. It's difficult to implement in the current environment, though, because the standard interface for talking to a chess engine (UCI) doesn't tell it who it's playing. So any between-game hyperparameter tuning has to be done manually by an operator.

(Of course this sort of opponent modeling is totally essential for exploitative play in poker, say.)

Perhaps in the future there will be computer matches in an environment where the engines have the ability to modify themselves between games. That could be pretty cool.

pookpooi · **#26**

John Fairbairn wrote:

Unless he's improved recently (and why would he even try?)

He's likely the best human candidate against AlphaZero, strongest Shogi professional ever, International Chess Master, Amateur dan Go player.

johnsmith · **#27**

One thing in modern chess engines is that, to know who is better at the current position, you need to look at the estimate that shows how many pawns you are ahead. It's not percentages like in go.

For example: in all of these games Stockfish at one point thought it is 1.10 ahead, but in reality, Alpha Zero may have thought it is winning by 75% which is much more convenient. Isn't it? Maybe chess players were thinking in the wrong direction. People are saying that Alpha Zero is making a lot of long-term sacrifices.

moha · **#28**

The chess part seems very interesting. Silver himself quoted chess and checkers as examples where search is absolutely essential (because of the tactical nature), and knowledge-based approaches are not appropriate. This is still about search of course, but I would have bet blindly that NNs are not a good deal here for their gain / slowdown ratio (I assumed 100 times, but the paper says 1000 times - and I thought the handcrafted evaluation are already reasonably good).

But in retrospect, there is some logic in this. The other day I peeked at the Houdini-Komodo match, and it seemed the current state of art is roughly 50M pos/s, for about 5G per move selection, with a typical depth or PV length of 30 plies. This would mean a searched branch factor of 2 with full minimax, so on average maybe 4 or more with perfect alphabeta (even more actually, because of selective depth). In any case, the 1000 times slowdown would be regained at a successful branch factor reduction of about 1.25 times, which is not unrealistic.

Kirby wrote:

it makes me question the meaning in life for humans. If it's intelligence, then computers are beginning to have more meaning than humans.

That is still in the future. IMO intelligence means ability to solve previously unseen tasks, so I'm not sure if these examples qualify, even as early birds. Unless matchboxes are intelligent

.

Bill Spight · **#29**

johnsmith wrote:

One thing in modern chess engines is that, to know who is better at the current position, you need to look at the estimate that shows how many pawns you are ahead. It's not percentages like in go.

For example: in all of these games Stockfish at one point thought it is 1.10 ahead, but in reality, Alpha Zero may have thought it is winning by 75% which is much more convenient. Isn't it? Maybe chess players were thinking in the wrong direction. People are saying that Alpha Zero is making a lot of long-term sacrifices.

In traditional go evaluation, if you are ahead on points and have the move, you will win with correct play, unless there is a ko that enables your opponent to win. Otherwise, even knowing the exact point value of the position is not enough to say whether you are winning or not. That is one reason that the win rate of quasi-random rollouts was more effective for MCTS bots than the average winning point margin. In go, if you know not only the point value of the position but also the temperature, then you can make a pretty good estimate of who is ahead. For instance, if your opponent is 1½ pts. ahead, you have the move, and the temperature is 3, the game is very close; if the temperature is 1, you are a goner without a ko or the opponent's error; if the temperature is 5, you are the favorite. These temperatures might translate to an estimated win rate of less than 5% for temperature 1, 50% for temperature 3, and 66% for temperature 5. Note the non-linearity. These are my estimates of actual win rates, not win rates with quasi-random play. Monte Carlo win rates might be more like 30%, 50%, and 60%. Quien sabe?

Anyway, it may be that go playing programs that made use of estimates of both point value and temperature would perform better than programs that make use of ill defined "win rates". AFAIK, no strong program has utilized both estimates, so who knows? In any event, unlike win rates, point values and temperatures are well defined in go, even if they may be difficult to calculate in many positions.

In chess engines, point evaluations in terms of hundredths of a pawn have been very effective. OTOH, knowledge of temperature is quite crude. Is there any more accuracy than quiescent vs. non-quiescent? It is interesting that AlphaZero's use of "win rate" has proved more effective than the use of point evaluations by Stockfish. Chess seems to have no exact theory of temperature, so is pretty well stuck with point evaluations or, now, win rates, neither of which is well defined.

Edit: "Is there any more accuracy than quiescent vs. non-quiescent?" Well, yes. Checkmate and stalemate precisely as cool as you can get.

Bill Spight · **#30**

moha wrote:

Kirby wrote:

it makes me question the meaning in life for humans. If it's intelligence, then computers are beginning to have more meaning than humans.

That is still in the future. IMO intelligence means ability to solve previously unseen tasks, so I'm not sure if these examples qualify, even as early birds. Unless matchboxes are intelligent

.

Yes, I regard intelligence as the ability to do something well that you have never done before.

Fairly obviously, I think that there are different kinds of intelligence.

moha · **#31**

Bill Spight wrote:

moha wrote:

IMO intelligence means ability to solve previously unseen tasks, so I'm not sure if these examples qualify, even as early birds. Unless matchboxes are intelligent

.

Yes, I regard intelligence as the ability to do something well that you have never done before.

Do you know where this definition comes from? I have used it since my childhood, I don't remember the origin but I doubt I came up with it myself. A few years ago I tried to look up the source but failed.

Most contemporary definitions seem to include things like lexical knowledge and it's application ability, so very different. And there is this one too: "intelligence is what intelligence tests measure".

Uberdude · **#32**

How about intelligence is knowing there are better things to do than join Mensa

Bill Spight · **#33**

moha wrote:

Bill Spight wrote:

moha wrote:

IMO intelligence means ability to solve previously unseen tasks, so I'm not sure if these examples qualify, even as early birds. Unless matchboxes are intelligent

.

Yes, I regard intelligence as the ability to do something well that you have never done before.

Do you know where this definition comes from? I have used it since my childhood, I don't remember the origin but I doubt I came up with it myself. A few years ago I tried to look up the source but failed.

Most contemporary definitions seem to include things like lexical knowledge and it's application ability, so very different. And there is this one too: "intelligence is what intelligence tests measure".

Well, it is not exactly a definition, but I came to it when I heard a real story about someone doing something well that they had not done before. What primed me to think that that was intelligence, I don't know.

OC, I had long doubted whether IQ tests test anything, factor analysis be damned.

Kirby · **#34**

Bill Spight wrote:

moha wrote:

Kirby wrote:

it makes me question the meaning in life for humans. If it's intelligence, then computers are beginning to have more meaning than humans.

That is still in the future. IMO intelligence means ability to solve previously unseen tasks, so I'm not sure if these examples qualify, even as early birds. Unless matchboxes are intelligent

.

Yes, I regard intelligence as the ability to do something well that you have never done before.

Fairly obviously, I think that there are different kinds of intelligence.

@moha: Nice video. That matchbox experiment is pretty cool.

@Bill: I'd say that I'm not really intelligent at anything, then. I rarely do something well the first time I try it. Only after practice can I get any sort of competency.

John Fairbairn wrote:

Best to take a detached Zen-like view, and recall that life, like go, is a game of co-existence.

Good advice, John. :tmbup:

Monadology wrote:

An often overlooked implication of rooting the meaning of human life or the moral status of humans in our special intellectual capacities is that human beings lacking them have less meaningful (or non-meaningful) lives or do not have the moral status of unimpaired human beings. This is an implication that I certainly find problematic, and so it tends to undermine the idea that we should base our understanding of these concepts in the special intellectual capacities of human beings.

You bring up a good point, Monadology. I also find it problematic to equate moral status with particular intellectual capacities, even for unimpaired human beings. My perspective was more reflective in terms of finding self-worth, perhaps (e.g. what makes me valuable as a human being?). But maybe the implications are similar. If I try to equate my value as a human being, I suppose there is still the aspect of being a father to my children, and a husband to my wife. I'd say that I bring some value from that dimension, which can't currently be replaced by machines. Maybe that status could be replaced by another person, but I hope it won't be.

Maybe people are like go stones - no inherent value, but useful at the right place and the right time, under the right circumstances, working in coordination with the rest of the universe...

Or maybe I'm just pushing an analogy too far :-p

sorin · **#35**

pookpooi wrote:

EdLee wrote:

Quote:

I'm wondering why DeepMind choose Shogi out of many board games beside Go

Another usual suspect: Xiangqi ; ...Chaturanga.

For diplomatic reason, choosing Xianqi to conquer by (non-Chinese) AI is a really bad choice.

Really? How about Go then? :-)

pookpooi · **#36**

sorin wrote:

Really? How about Go then? :-)

See the 'Chinese coverage' part of Future of Go Summit article in Wikipedia

Uberdude · **#37**

pookpooi wrote:

sorin wrote:

Really? How about Go then? :-)

See the 'Chinese coverage' part of Future of Go Summit article in Wikipedia

I thought that was more of the Chinese government not wanting to give free PR to a foreign tech company which doesn't play by their censorship rules than being upset about foreigners doing part of their culture better with computers. I presume the best Chinese chess players are Chinese though (and the bots too? how good are they?) so if you wanted another exhibition match that'd be in China and face the same PR problems.

Bill Spight · **#38**

Kirby wrote:

I rarely do something well the first time I try it. Only after practice can I get any sort of competency.

Well, that's how to get to Carnegie Hall.

djhbrown · **#39**

Kirby wrote:

I've been quite interested in machine learning, and artificial intelligence in general, since college.

Me too; in my case, that's since 1971; but i don't think that during that time AI has advanced much compared to advances in computer hardware, despite the current hullabaloo. For sure, Alfie's impressive achievements are remarkable, but could it be that they tell us more about the nature of games like chess and Go than they do about the nature of intelligence?

To examine that question deeper than handwaving, we need to be aware that Alfie's method is basically a statistical number-crunching approach, which requires hardware able to read (probabilistically) all the way to the end of the game and back again gerzillions of times each time she has to make a move.

She can do this because although the game tree of Go is kind of big (sic), it's not that deep - at most 361. To be fair, her probabilistic navigator is sufficiently better than random to weed out enough nonsense that she can find moves good enough to beat people at the game they invented.

Alfie is as impressive at playing Go as a mobile phone is at recording and transmitting information - and considering the state of the telephony art 50 years ago, that's pretty impressive.

But is Alfie as smart as, say, an acacia tree that responds to overgrazing by releasing poison gas to ward off kudu?
https://www.youtube.com/watch?v=Q-4w5xYLwiU
Or a Douglas Fir that nurtures its young?
https://www.youtube.com/watch?v=CrrSAc-vjG4

The robot football world cup maybe gives us a better idea about how smart today's machines are.
https://www.youtube.com/watch?v=R2S9x0gUkpM

PS Needless to say, i think Swim is more intelligent than Alfie, but i suppose you could justifiably argue that i am a bit biased...

PPS Don't fret Kirby, your existence is not as meaningless as that of a Go stone, albeit no more meaningful than that of an acacia tree:
https://www.youtube.com/watch?v=wq3B5prBsK0

John Fairbairn · **#40**

Quote:

I thought that was more of the Chinese government not wanting to give free PR to a foreign tech company which doesn't play by their censorship rules than being upset about foreigners doing part of their culture better with computers

I think this is right. When I was in China a couple of months ago, talking to politicians, the people who run Chinese go and pros, AlphaGo was mentioned approvingly over and over again (and FineArt not once, I think). DeepMind got some mentions, But Google was never mentioned, except in a couple of conversations where I was reminded (by ordinary punters, not officials) about censorship. But actually it's similar in Japan, except that lack of mentions (though not total) of Google (but plenty of AlphaGo and DeepMind and Hassibis mentions) is due to self-censorship. Although there are far, far fewer people who talk about go in Britain, I'm certain it's close to that situation here, too (plus DeepMind is British, Google isn't). And throughout the world, although billions of people happily use Google/Amazon/Facebook etc, if you did a word association test for each company, I'm sure you'd get lots of words in the "evil" nexus.

Quote:

Me too; in my case, that's since 1971; but i don't think that during that time AI has advanced much compared to advances in computer hardware, despite the current hullabaloo. For sure, Alfie's impressive achievements are remarkable, but could it be that they tell us more about the nature of games like chess and Go than they do about the nature of intelligence?

I have tried to warn against treating AlphaGo as a cargo cult, so welcome this reminder from an AI professional. I'm also constantly reminded of something that Michael Redmond said: go is a very simple game. It may be hard for humans but change the tool and Redmond's proven right. Inserting a screw in a brick wall by hand is not just hard for humans, but impossible. Yet with the right tools it's a doddle. (And we don't get any Angst about that, do we?)

AlphaZero paper discussion (Mastering Go, Chess, and Shogi)

Who is online