[computer-go] Heuristics for MC/UCT with all-or-nothing payouts
Brian Slesinsky
brian at slesinsky.org
Sun Jun 10 10:52:56 PDT 2007
With repeat-winners, if there is a move is seems flawless at first but
some flaw is eventually found, there might be a rough transition once
the flaw is identified, since there is no backup plan. It might make
more sense to study two apparently flawless children equally until a
flaw is found in one of them, and then to look for a new backup?
Or perhaps exploring every child once is very cheap, so avoiding it
doesn't save much time?
- Brian
More information about the computer-go
mailing list