[Computer-go] Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
richard.lorentz at csun.edu
Wed Dec 6 09:50:01 PST 2017
One chess result stood out for me, namely, just how much easier it was
for AlphaZero to win with white (25 wins, 25 draws, 0 losses) rather
than with black (3 wins, 47 draws, 0 losses).
Maybe we should not give up on the idea of White to play and win in chess!
On 12/06/2017 01:24 AM, Hiroshi Yamashita wrote:
> DeepMind makes strongest Chess and Shogi programs with AlphaGo Zero
> Mastering Chess and Shogi by Self-Play with a General Reinforcement
> Learning Algorithm
> AlphaZero(Chess) outperformed Stockfish after 4 hours,
> AlphaZero(Shogi) outperformed elmo after 2 hours.
> Search is MCTS.
> AlphaZero(Chess) searches 80,000 positions/sec.
> Stockfish searches 70,000,000 positions/sec.
> AlphaZero(Shogi) searches 40,000 positions/sec.
> elmo searches 35,000,000 positions/sec.
> Hiroshi Yamashita
> Computer-go mailing list
> Computer-go at computer-go.org
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Computer-go