[computer-go] Lazy Evaluation in Monte Carlo/Alpha Beta Search
for Viking4
Rémi Coulom
Remi.Coulom at univ-lille3.fr
Sun Jul 23 07:05:20 PDT 2006
Don Dailey wrote:
> I wasn't very clear about one of these points (and Remi just posted a
> clearer explanation of the same technique I use.)
My idea was maybe not quite the same as yours. From what I understand of
your message, you decide about pruning based on the fudged value
directly. My idea was to use the fudged value as an estimate of the move
value, and still use alpha * sqrt(sigma²/N) as the size of the
confidence interval around the fudged value.
I believe your approach is dangerous, because your confidence intervals
shrink as 1/N. They should shrink as 1/sqrt(N). Your idea might work in
practice, but it does not look very consistant with theory. My idea of a
fudge can be justified in the Bayesian framework in terms of a "safe
prior". It applies to the estimation of the value, not the confidence
bounds.
Rémi
More information about the computer-go
mailing list