[computer-go] Explanation to MoGo paper wanted.

Yamato yamato_cg at yahoo.co.jp
Tue Jul 3 03:31:38 PDT 2007


>I have the feeling that the paper is important, but it is completly 
>obfuscated by the strange reinforcement learning notation and jargon. Can 
>anyone explain it in Go-programming words?

The most important thing in the paper is how to combine RAVE(AMAF)
information with normal UCT. Like this:

  uct_value = child->GetUctValue();
  rave_value = child->GetRaveValue();
  beta = sqrt(K / (3 * node->visits + K));
  uct_rave = beta * rave_value + (1 - beta) * uct_value;

You do not always have to understand RLGO - they don't use it in the
online version of MoGo.

>It was pointed out by Donald Knuth in his paper on Alpha-Beta, that the - 
>simple - algorithm was not understood for a long time, because of the 
>inappropriate mathematical notation. For recursive functions, (pseudo-)code 
>is much better suited than the mathematical notation. Actually its 
>pseudo-mathematic notation.
>Why is this inappropriate notation still used?

I agree that the pseudo-code is easy to understand.

--
Yamato


More information about the computer-go mailing list