[Computer-go] AlphaGo MCTS & Reinforcement Learning?

Greg Schmidt gschmidt958 at yahoo.com
Sun Jan 31 07:20:16 PST 2016


The articles I've read so far about AlphaGo mention both MCTS and RL/Q-Learning.  Since MCTS (and certainly UCT) keeps statistics on wins and propagates that information up the tree, that in and of itself would seem to constitute RL, so how does it make sense to have both?  It seems redundant to me.  Any thoughts on that?



More information about the Computer-go mailing list