[computer-go] Optimal explore rates for plain UCT

Christoph Birk birk at ociw.edu
Mon Mar 10 15:40:53 PDT 2008


On Mon, 10 Mar 2008, Petr Baudis wrote:
>  With 110k playouts per move and no domain knowledge in the playouts,
> the ratings are now:
>
> 	c=0.2  (pachi1-p0.2-light)	ELO 1627 (285 games)
> 	c=1.0  (pachi1-p1.0-light)	ELO 1590 (120 games)
> 	c=0.05 (pachi1-p0.05-light)	ELO 1531 (286 games)
> 	c=2.0  (pachi1-p2.0-light)	ELO 1511 (118 games)

I have two "light" UCT bots on CGOS:
Name              #playouts         c (*)         CGOS-ELO
myCtest-V-0003    50000             0.25          1508
myCtest-10k-UCT   10000             0.25          1246

(*): I use c=0.5 outside the sqrt()

What is your 'create-new-node' threshold? I use 50.

Christoph



More information about the computer-go mailing list