[computer-go] Lazy Evaluation in Monte Carlo/Alpha Beta Search
for Viking4
Rémi Coulom
Remi.Coulom at univ-lille3.fr
Sun Jul 23 09:05:48 PDT 2006
Magnus Persson wrote:
> Take this hypothetical scenario for Viking4 playing 19x19 searching 1
> ply with
> 20 candidate moves all of almost equal value followed by one critical
> move that
> is superior. 1/N might here reject several moves that might be evaluated
> slightly better then the current best but may search more moves and then
> find
> the critical move in time that the more picky 1/sqrt(N) would not find
> in time.
>
>
If I wanted to be more selective, I would still use the 1/sqrt(N)
formula, but with a lower level of confidence in the confidence
intervals (ie 90% instead of 95%, for instance). That would be more
coherent with theory.
Also, when you have a bunch of moves to evaluate at 1 ply, it might not
be efficient to evaluate them sequentially. Maybe you should evaluate
them all at the same time, by allocating a random simulation to the move
with the highest upper bound (which should also be the move with the
highest lower bound if intervals are symmetrical). When the alpha-beta
window is wide open, this approach is certainly better than considering
the moves in sequence.
Rémi
More information about the computer-go
mailing list