[computer-go] Greedy search vs UCT
Magnus Persson
magnus.persson at phmp.se
Thu Apr 24 08:27:42 PDT 2008
I have checked if there is a difference for Valkyria in using
confidence bounds or just greedily search the move with the highest
winrate. This is Valkyria 3.2.0 using 512 simulations per move against
GnuGo 3.7.10.
UCT_K Winrate SERR
0 58.8 2.2 (greedy)
0.01 56.8 2.2
0.1 60.9 2.2
0.5 54.2 2.2
1 50.6 2.2
As you can see up to uct_k = 0.1, the winrate aginst gnugo is more or
less constant (500 games was played for each value of uct_k) and then
it declines.
So although 0.1 was best I cannot claim that it is better than a plain
greedy search.
I will repeat this using 4 times as many simulations per move. The
search sensitivity to uct_k may depend on how deep the tree is
searched.
-Magnus
More information about the computer-go
mailing list