[Computer-go] Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
xavier.combelle at gmail.com
Wed Dec 6 10:48:53 PST 2017
Another result is that chess is really drawish, at the opposite of shogi
Le 06/12/2017 à 18:50, Richard Lorentz a écrit :
> One chess result stood out for me, namely, just how much easier it was
> for AlphaZero to win with white (25 wins, 25 draws, 0 losses) rather
> than with black (3 wins, 47 draws, 0 losses).
> Maybe we should not give up on the idea of White to play and win in chess!
> On 12/06/2017 01:24 AM, Hiroshi Yamashita wrote:
>> DeepMind makes strongest Chess and Shogi programs with AlphaGo Zero
>> Mastering Chess and Shogi by Self-Play with a General Reinforcement
>> Learning Algorithm
>> AlphaZero(Chess) outperformed Stockfish after 4 hours,
>> AlphaZero(Shogi) outperformed elmo after 2 hours.
>> Search is MCTS.
>> AlphaZero(Chess) searches 80,000 positions/sec.
>> Stockfish searches 70,000,000 positions/sec.
>> AlphaZero(Shogi) searches 40,000 positions/sec.
>> elmo searches 35,000,000 positions/sec.
>> Hiroshi Yamashita
>> Computer-go mailing list
>> Computer-go at computer-go.org
> Computer-go mailing list
> Computer-go at computer-go.org
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Computer-go