[computer-go] Rapid action value estimation
Christoph Birk
birk at ociw.edu
Wed Nov 7 14:34:31 PST 2007
On Mon, 5 Nov 2007, Jason House wrote:
> I implemented this yesterday. In doing so, I realized I didn't know the
> proper way to initialize new leaves in the UCT tree. MoGo papers seem to
> talk about a progression from always picking an unexplored leaf (AKA using
> infinity for the upper confidence bound), to "first play urgency" (using a
> fixed ucb for new leaves), to using patterns.
What did you decide on?
What is the difference between 'hb-678-UCTRAVE-10k' and 'hb-675-UCT-10k'.
Thanks,
Christoph
More information about the computer-go
mailing list