[computer-go] Heuristics for MC/UCT with all-or-nothing payouts
Peter Drake
drake at lclark.edu
Sun Jun 10 20:31:02 PDT 2007
On Jun 10, 2007, at 5:29 PM, Eric Boesch wrote:
> As for experiencing limited improvements by tweaking the formulas, it
> would hardly be surprising if there are limits to how much an MC/UCT
> (or BAST or whatever) type program can be improved by just swapping
> out one formula and replacing it with another one using the same
> variables, since the formulas are pretty good to begin with, and the
> more glaring weaknesses of vanilla MC/UCT, which the strongest MC
> programs already address in other ways (though I think there is still
> room for improvement in their search localization in particular, not
> that I have any clear notion of how the improvement could be achieved
> without jeopardizing the programs' existing strengths), can't be fixed
> by just tweaking formulas.
I agree -- the big gains will come from search localization, breaking
the game into semi-independent subproblems, or goal-oriented search.
(Maybe these are three names for the same thing.) Of course, I don't
know how to do it yet...
Peter Drake
http://www.lclark.edu/~drake/
More information about the computer-go
mailing list