[computer-go] RAVE in MoGo paper
Yamato
yamato_cg at yahoo.co.jp
Mon Oct 8 17:57:38 PDT 2007
>I'm wondering whether the formula to determine the balance between RAVE and
>UCT,
>beta = sqrt(c / 3 * parentVisits + c),
>has any mathematical background - or is it just a best guess for something
>that starts at 1 and is 1/2 after a certain number of visits?
I guess it is simply a kind of parameter tuning.
At least the constant number 3 is meaningless in the formula - we
can use the following formula with c2 = c/3.
beta = sqrt(c2 / (parentVisits + c2))
>Another question is about the prior integration. Apparently the prior, RAVE
>and UCT values are three different estimators for the winning probability. So
>why not use the above formula for prior vs. RAVE balancing, too, instead of
>initializing RAVE with it?
Because the prior values do not change during simulations like RAVE
and UCT values. Of course there might be a more effective integration
method, however we need very long time to find it.
--
Yamato
More information about the computer-go
mailing list