[computer-go] UCT finds the right answer, but...
Peter Drake
drake at lclark.edu
Tue Nov 7 08:42:44 PST 2006
Consider this position:
...wBw...
.w.wBwwww
...wBBBBB
wwwwB....
BBBBBBBBB
wwwwBwwww
w..wBw...
w..wBw.w.
w.wwBw...
This is black to play, no komi.
As I read it (correct me if I'm wrong!), the white groups at upper
left and lower right can't be killed (assuming white defends them).
Black can kill either of the other groups, and white can respond by
saving the remaining one. Since killing the upper right white group
is not big enough, black's only winning move is to kill the lower
left group by playing at b2.
Orego (now using UCT) quickly finds the correct answer, but the
estimates of the probability of winning are strange. Here's a graph:

The probability of winning by starting at b2 is greater than the
probability starting elsewhere, but shouldn't it approach 1.0, since
b2 is a winning move? Do others get this same behavior? Does anyone
have an explanation?
For what it's worth, here are the probabilities and <black wins> /
<total runs> through each move:
B1 (0.262654 = 1510/5749):
G1 (0.269305 = 2504/9298):
H1 (0.276537 = 5200/18804):
J1 (0.263229 = 1567/5953):
*B2 (0.290454 = 134822/464177):
C2 (0.288902 = 69762/241473):
G2 (0.274577 = 4156/15136):
J2 (0.276275 = 5034/18221):
B3 (0.261835 = 1427/5450):
C3 (0.269523 = 2554/9476):
G3 (0.269982 = 2655/9834):
H3 (0.27404 = 3927/14330):
J3 (0.273359 = 3660/13389):
F6 (0.25266 = 831/3289):
G6 (0.268461 = 2334/8694):
H6 (0.273485 = 3706/13551):
J6 (0.247165 = 632/2557):
A7 (0.269866 = 2632/9753):
B7 (0.274571 = 4157/15140):
C7 (0.268814 = 2404/8943):
A8 (0.275441 = 4577/16617):
C8 (0.277177 = 5617/20265):
A9 (0.259601 = 1237/4765):
B9 (0.27903 = 7180/25732):
C9 (0.268422 = 2324/8658):
G9 (0.257561 = 1090/4232):
H9 (0.277021 = 5513/19901):
J9 (0.262094 = 1452/5540):
PASS (0.221808 = 238/1073):
Peter Drake
Assistant Professor of Computer Science
Lewis & Clark College
http://www.lclark.edu/~drake/
-------------- next part --------------
Skipped content of type multipart/related
More information about the computer-go
mailing list