OK. And in this case why You had begun this discussion, after I had written, that Your tests are "synthetic" due to unreal low number of playouts in them? (In anticipation: the answer on this question isn't quite simple for me...)as0770 wrote: If you use little time/playouts, you can determine the strength with little time/playouts. If you want to know the strength with much time/playouts you have to play with much time/playouts. In both cases you need the same amount of games to get a statistical significant result. Quite simple, isn't it?
Engine Tournament
-
q30
- Lives with ko
- Posts: 145
- Joined: Sat Aug 13, 2016 8:23 am
- Rank: 30 kyu
- GD Posts: 0
- Has thanked: 1 time
- Been thanked: 1 time
Re: Engine Tournament
-
as0770
- Lives with ko
- Posts: 180
- Joined: Sun Jun 26, 2016 8:07 am
- Rank: Beginner
- GD Posts: 0
- Has thanked: 15 times
- Been thanked: 23 times
Re: Engine Tournament
You can start reading at #227.jann wrote:I haven't seen such claimsas0770 wrote:Some mean that you can measure the strength with a few games as long as the quality is good enough.
What do you mean with "in this case"? I neither wrote a novelty, nor any contradiction. Seriously I am not sure if you just don't understand anything or if you are trying to fool us.q30 wrote:OK. And in this case why You had begun this discussion, after I had written, that Your tests are "synthetic" due to unreal low number of playouts in them? (In anticipation: the answer on this question isn't quite simple for me...)as0770 wrote: If you use little time/playouts, you can determine the strength with little time/playouts. If you want to know the strength with much time/playouts you have to play with much time/playouts. In both cases you need the same amount of games to get a statistical significant result. Quite simple, isn't it?
I won't start this argument once again. I played 1h and 2h games on 1-4 cores. This is not such a low number of playouts especially since you later quote games with 3000 visits. Your comment's where just disrespectful and not well-founded.
-
q30
- Lives with ko
- Posts: 145
- Joined: Sat Aug 13, 2016 8:23 am
- Rank: 30 kyu
- GD Posts: 0
- Has thanked: 1 time
- Been thanked: 1 time
Re: Engine Tournament
In case You understand, that You wrote...as0770 wrote:What do you mean with "in this case"? I neither wrote a novelty, nor any contradiction. Seriously I am not sure if you just don't understand anything or if you are trying to fool us.q30 wrote:OK. And in this case why You had begun this discussion, after I had written, that Your tests are "synthetic" due to unreal low number of playouts in them? (In anticipation: the answer on this question isn't quite simple for me...)as0770 wrote: If you use little time/playouts, you can determine the strength with little time/playouts. If you want to know the strength with much time/playouts you have to play with much time/playouts. In both cases you need the same amount of games to get a statistical significant result. Quite simple, isn't it?
I won't start this argument once again. I played 1h and 2h games on 1-4 cores. This is not such a low number of playouts especially since you later quote games with 3000 visits. Your comment's where just disrespectful and not well-founded.
On Your once again contradiction (that You never wrote)... The game longitude must depend on number of moves: it must not be the same for 91 moves and 291 moves games. I never used visit limitation in my tests. May be I had quoted other test in some context... I limit only move time (except one additional match with limited playouts number). I never try to found on shell visits number, but due to continuously playouts output on shell, it's more conveniently to be guided on playouts number. You can see my tests playouts number for LeelaZero with different neuronets weights categories here.
-
Mike Novack
- Lives in sente
- Posts: 1045
- Joined: Mon Aug 09, 2010 9:36 am
- GD Posts: 0
- Been thanked: 182 times
Re: Engine Tournament
Please lets calm this down. I think it is clear to some of us that at least part of the problem is with language. I'm pretty sure not all are native English speakers.
-
as0770
- Lives with ko
- Posts: 180
- Joined: Sun Jun 26, 2016 8:07 am
- Rank: Beginner
- GD Posts: 0
- Has thanked: 15 times
- Been thanked: 23 times
Re: Engine Tournament
If one prefers a time setting of "games in 120 minutes" or "1s/move" is a matter of taste. Time in X means that an engine usually uses most of the time for the first 200 moves, and plays faster when the game is already decided. There are good reasons for both options. No need to offend someone if he does it either way.q30 wrote:On Your once again contradiction (that You never wrote)... The game longitude must depend on number of moves: it must not be the same for 91 moves and 291 moves games. I never used visit limitation in my tests. May be I had quoted other test in some context... I limit only move time (except one additional match with limited playouts number). I never try to found on shell visits number, but due to continuously playouts output on shell, it's more conveniently to be guided on playouts number. You can see my tests playouts number for LeelaZero with different neuronets weights categories here.
Still I have no clue what contradiction you are talking about...
-
q30
- Lives with ko
- Posts: 145
- Joined: Sat Aug 13, 2016 8:23 am
- Rank: 30 kyu
- GD Posts: 0
- Has thanked: 1 time
- Been thanked: 1 time
Re: Engine Tournament
About time:as0770 wrote:If one prefers a time setting of "games in 120 minutes" or "1s/move" is a matter of taste. Time in X means that an engine usually uses most of the time for the first 200 moves, and plays faster when the game is already decided. There are good reasons for both options. No need to offend someone if he does it either way.q30 wrote:On Your once again contradiction (that You never wrote)... The game longitude must depend on number of moves: it must not be the same for 91 moves and 291 moves games. I never used visit limitation in my tests. May be I had quoted other test in some context... I limit only move time (except one additional match with limited playouts number). I never try to found on shell visits number, but due to continuously playouts output on shell, it's more conveniently to be guided on playouts number. You can see my tests playouts number for LeelaZero with different neuronets weights categories here.
Still I have no clue what contradiction you are talking about...
1) "Engine usually uses" - doesn't mean engines tests equivalency...
2) On 200 move the game may not be decided...
So one option may be good for play with human, and other - for engines tests.
Contradiction is that 1h game on 1 core and 2h game on 4 cores couldn't be the tests for determining one rating of engines, because:
If you use little time/playouts, you can determine the strength with little time/playouts. If you want to know the strength with much time/playouts you have to play with much time/playouts.
-
as0770
- Lives with ko
- Posts: 180
- Joined: Sun Jun 26, 2016 8:07 am
- Rank: Beginner
- GD Posts: 0
- Has thanked: 15 times
- Been thanked: 23 times
Re: Engine Tournament
Your understanding of equivalency without variety would mean you can only play two games.q30 wrote:"Engine usually uses" - doesn't mean engines tests equivalency...
This has nothing to do with what I wrote. Once again, this kind of discussion don't work with online translators.q30 wrote:Contradiction is that 1h game on 1 core and 2h game on 4 cores couldn't be the tests for determining one rating of engines,
-
q30
- Lives with ko
- Posts: 145
- Joined: Sat Aug 13, 2016 8:23 am
- Rank: 30 kyu
- GD Posts: 0
- Has thanked: 1 time
- Been thanked: 1 time
Re: Engine Tournament
I use translator only for translation to English of some words while I'm posting message.
So, if You want, that most of people there (who have already written that they aren't native English speakers) understand You right, try to use simple unambiguous terminology without any beautiful, but superfluous words and phrases, please.
So, if You want, that most of people there (who have already written that they aren't native English speakers) understand You right, try to use simple unambiguous terminology without any beautiful, but superfluous words and phrases, please.
-
Bill Spight
- Honinbo
- Posts: 10905
- Joined: Wed Apr 21, 2010 1:24 pm
- Has thanked: 3651 times
- Been thanked: 3373 times
Re: Engine Tournament
Since language appears to be a problem, perhaps it would help to restate what you think the other person said before offering a reply. This can help even when everybody speaks the same language.q30 wrote:I use translator only for translation to English of some words while I'm posting message.
So, if You want, that most of people there (who have already written that they aren't native English speakers) understand You right, try to use simple unambiguous terminology without any beautiful, but superfluous words and phrases, please.
The Adkins Principle:
At some point, doesn't thinking have to go on?
— Winona Adkins
Visualize whirled peas.
Everything with love. Stay safe.
At some point, doesn't thinking have to go on?
— Winona Adkins
Visualize whirled peas.
Everything with love. Stay safe.
-
q30
- Lives with ko
- Posts: 145
- Joined: Sat Aug 13, 2016 8:23 am
- Rank: 30 kyu
- GD Posts: 0
- Has thanked: 1 time
- Been thanked: 1 time
Re: Engine Tournament
OK, but if restate will be permitted in native language...Bill Spight wrote:Since language appears to be a problem, perhaps it would help to restate what you think the other person said before offering a reply. This can help even when everybody speaks the same language. :)q30 wrote:I use translator only for translation to English of some words while I'm posting message.
So, if You want, that most of people there (who have already written that they aren't native English speakers) understand You right, try to use simple unambiguous terminology without any beautiful, but superfluous words and phrases, please.
Last edited by q30 on Sat Jan 11, 2020 2:17 am, edited 1 time in total.
-
q30
- Lives with ko
- Posts: 145
- Joined: Sat Aug 13, 2016 8:23 am
- Rank: 30 kyu
- GD Posts: 0
- Has thanked: 1 time
- Been thanked: 1 time
Re: Engine Tournament
The best "middleweight" neuronet in 2019 year is 20b_254_784k_q, but it's obviously weaken, than "welterweight" best nets.
Moreover, the 2019 year winner of "welterweight class" 15b_249a_296k_q overpowered the best of "heavyweight" net (details)...
Moreover, the 2019 year winner of "welterweight class" 15b_249a_296k_q overpowered the best of "heavyweight" net (details)...