It is currently Wed May 07, 2025 5:54 am

All times are UTC - 8 hours [ DST ]




Post new topic Reply to topic  [ 3 posts ] 
Author Message
Offline
 Post subject: How to properly compile a Go database - a case study
Post #1 Posted: Sat Jul 19, 2014 11:13 am 
Lives in sente

Posts: 928
Location: UK
Liked others: 72
Was liked: 480
Rank: 5 dan
KGS: macelee
Compiling a Go game record database is a hard job! Old games before the Internet age are almost always transcribed from printing materials, such as Go yearbooks and other game collections published in oriental countries. Before the wide adoption of computer-based typesetting systems, Go games in these books had to be prepared manually, which means human errors were inevitable. Good editors of game collections need to be aware of this problem, and spend time to fix such human errors wherever possible.

Here is a real example - the game in question was played between Yan An 7-dan and Wang Hui 6-dan during the 1992 Chinese Individual Championship. The game record can be found on page 1488 of a set of Chinese books published in early 1990's - the set widely known as '4000 games'. The printed game records clearly contain errors. Because the players are not exactly famous and the tournament not very popular, it's unlikely that we can find an independent source. So some hard work is required to fix it.

Here I will show three versions of this game record:



The first version was taken from an early version of GoGoD CD (game ID 1992-09-02d). The highly respected T Mark Hall apparently spot the problem too. He managed to fixed move 44 which was printed one cell to the left in the book. But he had to stop at move 169, and left a note saying "the printed record is defective from (at least) move 168, which probably needs to be transposed with 169 because a ko fight is shown there, but the subsequent moves do not make sense and are omitted here. 235 moves were shown".



The second version can be found on 'gobase.org'. The gobase owner also attempted to fix the game, in a different way. As we can see, his fix was not correct either, because in the end there's a big ko at H6 which is not settled.



When I compiled this game for Go4Go, I first fixed Yan An's rank as there's clear evidence that he became 7-dan 3 years earlier. Then I spent several hours on the move sequence and finally found a sensible way to fix it. I realised that move 101, on the same row as move 44, was also shifted one cell to the left - this isn't obvious because move 101 as printed on the book doesn't look like an incorrect move. But as you can see once I shifted this stone, the game can progress natually to the end. Solving such problem is like working on jigsaw puzzles, it is hard to do, but once you find the solution you know it is correct.

Over the years I have fixed about 100 games like the one above. It was immensely satisfying to be able to solve such puzzles. I hope you agree with me that when I say that Go4Go collection is of the best quality there is a very good reason to make that claim.

Unfortunately, there are always people there not respecting these hard works. Recently I found a big 'collection' of games that apparatently contain most games I manually fixed. A while ago, another collection appeared online and claimed to be in 'public domain' apparently contains GoGoD's version of the game above. Ask yourself this question: what is the probability for an independent author to use exactly the following result field as GoGoD does and also use GoGoD's signature way for Chinese Komi:

"RE[B+R (moves beyond 169 not given)]"
"KM[2.75]"

It takes a lot of more work than downloading a large quantity of games online and processing them with computer programs to create a Go game collection. When a new game collection is out there from nowhere, there's almost certainly something fishy in it. Please think twice before giving your support to those new 'collections'.


This post by macelee was liked by 8 people: betterlife, Bonobo, emeraldemon, ez4u, gasana, Marcus, RBerenguel, SoDesuNe
Top
 Profile  
 
Offline
 Post subject: Re: How to properly compile a Go database - a case study
Post #2 Posted: Sat Jul 19, 2014 11:33 am 
Oza
User avatar

Posts: 2221
Location: Germany
Liked others: 8268
Was liked: 924
Rank: OGS 9k
OGS: trohde
Universal go server handle: trohde
Forgive me, please, you are a MUCH stronger player than I am, but I don’t understand this:
macelee wrote:
[..]

The second version [..] As we can see, his fix was not correct either, because in the end there's a big ko at H6 which is not settled. [..]
What I don’t understand is this: why should the unsettled Ko be an indication for incorrect transcription when “B+R”? As said, I am a weak player (~13-15k) … does what you say mean that W would in any case play the Ko fight to the end before resigning?

TIA, Tom

_________________
“The only difference between me and a madman is that I’m not mad.” — Salvador Dali ★ Play a slooooow correspondence game with me on OGS? :)

Top
 Profile  
 
Offline
 Post subject: Re: How to properly compile a Go database - a case study
Post #3 Posted: Sat Jul 19, 2014 11:56 am 
Lives in sente

Posts: 928
Location: UK
Liked others: 72
Was liked: 480
Rank: 5 dan
KGS: macelee
Bonobo wrote:
Forgive me, please, you are a MUCH stronger player than I am, but I don’t understand this:
macelee wrote:
[..]

The second version [..] As we can see, his fix was not correct either, because in the end there's a big ko at H6 which is not settled. [..]
What I don’t understand is this: why should the unsettled Ko be an indication for incorrect transcription when “B+R”? As said, I am a weak player (~13-15k) … does what you say mean that W would in any case play the Ko fight to the end before resigning?

TIA, Tom


Hi Tom, the ko at H6 is about the life/death of 7 stones, having a value of at least 19 points. This is significantly larger than the yose moves they played (for example move 235 only has a value of 2 points). So clearly the ko should not be there.


This post by macelee was liked by: Bonobo
Top
 Profile  
 
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 3 posts ] 

All times are UTC - 8 hours [ DST ]


Who is online

Users browsing this forum: No registered users and 1 guest


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Jump to:  
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group