[computer-go] Standard references on CGOS

Jason House jason.james.house at gmail.com
Fri Nov 2 06:34:50 PDT 2007


On 10/29/07, Christoph Birk <birk at ociw.edu> wrote:
>
> On Oct 29, 2007, at 8:39 AM, Jason House wrote:
> > For all of us in the bot-making kiddie pool, it's exceptionally
> > helpful to have reference implementations of basic algorithms
> > running on the server.  When playing with AMAF, I found the
> > reference AMAF bots very helpful.  Now that I'm playing with UCT,
> > references for UCT would be helpful.
>
> 'myCtest-V-0003' is running 50k UCT. Pure random playouts guided
> by a UCT search with theses parameters:
>   # playouts before expanding = 50
>   node-score = win_ratio + 0.5 * sqrt(log(N)/n);
>
> I will start it under the nam 'myCtest-50k-UCT' later today running
> 24/7.


I think I've gotten my big UCT bugs worked out.  Thanks a lot for the
reference.
For any who are interested, hb-672-UCT has the following configuration:

# playouts per move = variable (should be in the ballpark of 10k)

# playouts before expanding = 10
node-score = win_ratio + tuned_standard_deviation * sqrt(0.8*log(N)/n);
tuned_standard_deviation = sqrt(min(0.25
,win_ratio*(1-win_ratio)+sqrt(2*ln(N)/n)))

The 0.8 factor is carry over from initially following
http://senseis.xmp.net/?UCT
The tuning was based on http://hal.inria.fr/inria-00117266 and is supposedly
superior to a flat 0.5


I'll likely try variants to better match Ctest:
* No 0.8 factor
* 50 playouts before expansion
* No tuning
* True 10k simulations per move
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://computer-go.org/pipermail/computer-go/attachments/20071102/7987c6f5/attachment.htm


More information about the computer-go mailing list