More questions about supporting Daoqi in KataGo
Posted: Fri Jun 17, 2022 6:53 am
I've been training Daoqi model from my KataGo fork/branch. It seems working fine. However I noticed a few issues:
In some games, I noticed suicide moves are generated in some games. multiStoneSuicideLegals is set to "false,true". Not sure what that means but I suppose it won't allow suicide. Could this be an issue or this is normal?
In the middle of some games, pass is generated. Is this result of training? Can it be prevented?
In some games, some stones are added to the board and they are not handicapes. Is this normal?
Can I tweak or change the config file or the model kind and re-run the command to continue? Will there be a compatibility issue with the generated model etc? Which options can be changed and which can not?
Other than what I have done with wrapping around the board, what else should be modified (that is not applicable to Daoqi because of the board difference). It seems there is special logic for mirroring, center of the board etc. I feel they should be handled differently but haven't looked closely yet.
The training plateaued before reaching D level (the new model loses or wins in a small margin, e.g. 50~60% win rate against the old model). What can I do when it plateaued?
References
An introduction to toroidal Go: http://goplayerjuggler.blogspot.com/201 ... al-go.html
My KataGo branch for Daoqi: https://github.com/gcao/KataGo/tree/cao/daoqi
My PR for Daoqi: https://github.com/gcao/KataGo/pull/6/files
My training command: python/selfplay/synchronous_loop.sh DAOQI ../daoqi-opencl daoqi b20c256 1
My self play config: https://github.com/gcao/KataGo/blob/cao ... fplay1.cfg
Model kind used in the training: b20c256
In some games, I noticed suicide moves are generated in some games. multiStoneSuicideLegals is set to "false,true". Not sure what that means but I suppose it won't allow suicide. Could this be an issue or this is normal?
In the middle of some games, pass is generated. Is this result of training? Can it be prevented?
In some games, some stones are added to the board and they are not handicapes. Is this normal?
Can I tweak or change the config file or the model kind and re-run the command to continue? Will there be a compatibility issue with the generated model etc? Which options can be changed and which can not?
Other than what I have done with wrapping around the board, what else should be modified (that is not applicable to Daoqi because of the board difference). It seems there is special logic for mirroring, center of the board etc. I feel they should be handled differently but haven't looked closely yet.
The training plateaued before reaching D level (the new model loses or wins in a small margin, e.g. 50~60% win rate against the old model). What can I do when it plateaued?
References
An introduction to toroidal Go: http://goplayerjuggler.blogspot.com/201 ... al-go.html
My KataGo branch for Daoqi: https://github.com/gcao/KataGo/tree/cao/daoqi
My PR for Daoqi: https://github.com/gcao/KataGo/pull/6/files
My training command: python/selfplay/synchronous_loop.sh DAOQI ../daoqi-opencl daoqi b20c256 1
My self play config: https://github.com/gcao/KataGo/blob/cao ... fplay1.cfg
Model kind used in the training: b20c256