That was one of my concerns. I wasn't sure how those players with rating 100 came to be in the database; my intuition (based on the exact 100 rating) was that these players were registered but did not play a single game and therefore their rating couldn't be trusted. If this was the case, then removing them removes noise; but I don't think that is an unbiased removal, because even if that assumption is correct, they are likely to have below average rating so they shouldn't simply be removed, but rather accounted for somehow..HermanHiddema wrote:If you remove the bottom 14% of go players, you should also remove the bottom 14% of chess players, IMO, to get a fair comparison. The EGF system has an artificial bottom, most of those rating 100 players are somewhere between 20k and 35k, and should have negative ratings.uPWarrior wrote: (I had to remove everyone with a EGF rating of 100, as 5500 players clustered at this exact rating)
Regarding the rest of the discussion, I don't know if FIDE and EGF reach a similar set of players.
I don't play chess, so I don't know of many chess online servers. Some of them might have this data public or obtainable through an API - I wouldn't mind comparing it with OGS or some other Go server.