> If you had a choice between a 1% 65,000-wins move and a 70% 7-wins move,
> MCTS will keep exploring the 70% move, until it either reaches 65,001
> wins, and can be chosen, or the winning percentage comes down to 1% also.
> BTW, that implies it would be very difficult to ever reach the situation
> you describe, as 1% win rate moves wouldn't be given 650,000 trials
> (unless all other moves on the board are equally bad, i.e. the game is
> clearly lost).

What I dont understand, is why the variation that's trying to catch up has
to absolutely overtake the leader.
Shouldn't there be a substantial bonus for a late high success rate?
