[computer-go] Rapid action value estimation

Christoph Birk birk at ociw.edu
Fri Nov 2 14:40:17 PDT 2007


On Fri, 2 Nov 2007, Benjamin Teuber wrote:
> I don't think there's something different at different depths in the tree..
> To update RAVE after a simulation, for each child of a node you visited
> during that simulation, you update if the move leading to the child was
> played later (until the end of the playout).
> Then, always when you calculate the UCT value, you combine that with the
> RAVE value with that weighted average formula to give the final score.
> Of course, you need to be careful with signs :-)

That means you have one global 'RAVE' table?
Or one at each node in the UCT tree?

Christoph


More information about the computer-go mailing list