[computer-go] RAVE in MoGo paper
Sylvain Gelly
sylvain.gelly at m4x.org
Tue Oct 9 00:20:14 PDT 2007
Hi,
2007/10/8, Benjamin Teuber <Benjamin.Teuber at web.de>:
>
> Hi everybody - especially Sylvain =)
>
> I'm wondering whether the formula to determine the balance between RAVE
> and UCT,
> beta = sqrt(c / 3 * parentVisits + c),
> has any mathematical background - or is it just a best guess for something
> that starts at 1 and is 1/2 after a certain number of visits?
No it is just a tuning.... :)
Another question is about the prior integration. Apparently the prior, RAVE
> and UCT values are three different estimators for the winning probability.
> So why not use the above formula for prior vs. RAVE balancing, too, instead
> of initializing RAVE with it?
>
Our prior is actually classical and equivalent to a Dirichlet prior for the
RAVE value. Of course we could put the prior in other ways, put I strongly
believe that at this point the relevance of the prior is more important that
the way you use it.
Cheers,
Sylvain
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://computer-go.org/pipermail/computer-go/attachments/20071009/56170681/attachment.htm
More information about the computer-go
mailing list