[computer-go] Explanation to MoGo paper wanted.
Yamato
yamato_cg at yahoo.co.jp
Tue Jul 3 03:31:38 PDT 2007
>I have the feeling that the paper is important, but it is completly
>obfuscated by the strange reinforcement learning notation and jargon. Can
>anyone explain it in Go-programming words?
The most important thing in the paper is how to combine RAVE(AMAF)
information with normal UCT. Like this:
uct_value = child->GetUctValue();
rave_value = child->GetRaveValue();
beta = sqrt(K / (3 * node->visits + K));
uct_rave = beta * rave_value + (1 - beta) * uct_value;
You do not always have to understand RLGO - they don't use it in the
online version of MoGo.
>It was pointed out by Donald Knuth in his paper on Alpha-Beta, that the -
>simple - algorithm was not understood for a long time, because of the
>inappropriate mathematical notation. For recursive functions, (pseudo-)code
>is much better suited than the mathematical notation. Actually its
>pseudo-mathematic notation.
>Why is this inappropriate notation still used?
I agree that the pseudo-code is easy to understand.
--
Yamato
More information about the computer-go
mailing list