[computer-go] UCT
Peter Drake
drake at lclark.edu
Fri Nov 3 08:16:54 PST 2006
I've decided to bite the bullet and implement UCT in Orego, since (a)
everyone else appears to be using it and (b) not being a
statistician, improving on this is probably not where I'm likely to
make a contribution.
Here's my understanding of UCT. It is an algorithm for choosing the
best move from a given parent node. Let
p = # of runs through parent node
r = # of runs through move in question
w = # of wins through move in question
UCT proposes that we choose the node with the highest sum
average + uncertainty
where
average = w/r
and
uncertainty = sqrt(2 * logn(p) / r)
Is this correct? Are there any things to watch out for?
Also, are people pruning any move for which (average + uncertainty)
is less than the (average - uncertainty) of some other move?
Peter Drake
Assistant Professor of Computer Science
Lewis & Clark College
http://www.lclark.edu/~drake/
More information about the computer-go
mailing list