[computer-go] Rapid action value estimation
Christoph Birk
birk at ociw.edu
Fri Nov 2 14:40:17 PDT 2007
On Fri, 2 Nov 2007, Benjamin Teuber wrote:
> I don't think there's something different at different depths in the tree..
> To update RAVE after a simulation, for each child of a node you visited
> during that simulation, you update if the move leading to the child was
> played later (until the end of the playout).
> Then, always when you calculate the UCT value, you combine that with the
> RAVE value with that weighted average formula to give the final score.
> Of course, you need to be careful with signs :-)
That means you have one global 'RAVE' table?
Or one at each node in the UCT tree?
Christoph
More information about the computer-go
mailing list