[computer-go] Some ideas how to make strong heavy playouts

Mark Boon tesujisoftware at gmail.com
Wed Apr 2 07:23:16 PDT 2008


On 1-apr-08, at 17:37, Don Dailey wrote:
> That's partly why I'm interested in exploring "on the fly" leaning.
> Learning outside the context of the position being played may not have
> much relevance.

That would be most interesting indeed. I'd like to try but keep  
running into obstacles.

For example: at the moment I have just a handful of patterns. These  
patterns are important or 'urgent' if you like and they are already  
enough to overcome the slow-down caused by pattern-matching. At the  
moment I play a pattern randomly without distinction between them.

If I want to make anything 'learning' then I have to harvest patterns  
and somehow compute their importance / urgency. There are multiple  
ways to do that and Remi wrote a paper about one of them. At the  
moment I use the average length a pattern is on the board. Urgent  
patterns remain on the board for only a short while. Whether this is  
better or worse than Remi's way I don't know.

So I have now run the program for a few hundred games, adjusting the  
urgency of the patterns on a continuing basis. And the program got  
weaker! Selecting one at random is apparently superior to selecting a  
pattern based on urgency. This is why I started out with just a few  
patterns because it's easer to see what's happening. When I look at  
the urgencies computed they actually look very reasonable. So that's  
not the problem. When I think about what could cause this, the only  
thing I can imagine is, again, that play becomes more deterministic.

This is a recurring theme in my tests. Apparently it's something  
important that still escapes me and which I have to understand to  
make real progress.

Mark



-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://computer-go.org/pipermail/computer-go/attachments/20080402/75d827d0/attachment.htm


More information about the computer-go mailing list