[computer-go] Optimal explore rates for plain UCT
Christoph Birk
birk at ociw.edu
Mon Mar 10 15:40:53 PDT 2008
On Mon, 10 Mar 2008, Petr Baudis wrote:
> With 110k playouts per move and no domain knowledge in the playouts,
> the ratings are now:
>
> c=0.2 (pachi1-p0.2-light) ELO 1627 (285 games)
> c=1.0 (pachi1-p1.0-light) ELO 1590 (120 games)
> c=0.05 (pachi1-p0.05-light) ELO 1531 (286 games)
> c=2.0 (pachi1-p2.0-light) ELO 1511 (118 games)
I have two "light" UCT bots on CGOS:
Name #playouts c (*) CGOS-ELO
myCtest-V-0003 50000 0.25 1508
myCtest-10k-UCT 10000 0.25 1246
(*): I use c=0.5 outside the sqrt()
What is your 'create-new-node' threshold? I use 50.
Christoph
More information about the computer-go
mailing list