ThatSpysASpy t1_iuprye0 wrote on November 2, 2022 at 3:04 AM

Reply to comment by dojoteef in [N] Adversarial Policies Beat Professional-Level Go AIs by xutw21

Oh cool, that does make it more interesting. Do you know whether it was trained to take the rules as an input somehow?

ThatSpysASpy t1_iupr7f5 wrote on November 2, 2022 at 2:58 AM

Reply to comment by dojoteef in [N] Adversarial Policies Beat Professional-Level Go AIs by xutw21

But I don't even think it's a weird edge case! The pass is correct, and Katago wins this game with the maximum score possible. Saying "okay we're actually using this other scoring method which it wasn't designed for" seems pretty vacuous. (Unless I'm wrong and it was in fact trained for this rule set).

ThatSpysASpy t1_iupkljr wrote on November 2, 2022 at 2:07 AM

Reply to [N] Adversarial Policies Beat Professional-Level Go AIs by xutw21

The demonstrations shown in the paper are pretty unconvincing. In ordinary go scoring, dead stones are removed from the board at the end of the game, so the territory which supposedly isn't KataGo's would in fact be counted as its territory.

They say they use Tromp-Taylor rules, which requires all stones to be captured, but I would assume KataGo was trained with more standard human go rules. (Or at least they added some regularizer to make it pass once the value was high enough, otherwise humans playing vs it would get really annoyed).