[computer-go] UCT finds the right answer, but...

Peter Drake drake at lclark.edu
Tue Nov 7 12:08:03 PST 2006


On Nov 7, 2006, at 11:10 AM, Magnus Persson wrote:

> Quoting Peter Drake <drake at lclark.edu>:
>
>
>> The probability of winning by starting at b2 is greater than the   
>> probability starting elsewhere, but shouldn't it approach 1.0,  
>> since  b2 is a winning move? Do others get this same behavior?  
>> Does anyone  have an explanation?
>
> The situation is still such that random play can mess things up.  
> Even if black
> is winning UCT search for white will pick those moves which allows  
> black to
> make bad moves deeper in the tree.

I've had similar experiences with other variants of Monte Carlo Go.  
Incidentally, I think this makes ladders particularly difficult.

Thanks, good to know it's not just my program that exhibits this  
behavior.

> What you should do is to print out the winning percentage for the  
> principle
> variation and not only the top move. In valkyria the winning score  
> do not
> change much at the root but following the tree down a winning  
> variation often
> increase the winning percentage 5-10 % at each ply until it reache  
> a point
> where random play always lead to a win.

It doesn't always work out that way, but sometimes it does:

B2:0.292577
H9:0.291222
C2:0.29693
G2:0.292865
G6:0.312761
B1:0.35
J2:0.666667
PASS

The important thing is that UCT (very!) quickly finds the correct move.

Peter Drake
Assistant Professor of Computer Science
Lewis & Clark College
http://www.lclark.edu/~drake/





More information about the computer-go mailing list