Videos

AlphaZero: DeepMind's New Chess AI | Two Minute Papers #216



Two Minute Papers

The paper “Mastering Chess and Shogi by Self-Play with a
General Reinforcement Learning Algorithm” is available here:
https://arxiv.org/pdf/1712.01815.pdf

Our Patreon page with the details:
https://www.patreon.com/TwoMinutePapers

One-time payments:
PayPal: https://www.paypal.me/TwoMinutePapers
Bitcoin: 13hhmJnLEzwXgmgJN7RB6bWVdT7WkrFAHh
Ethereum: 0x002BB163DfE89B7aD0712846F1a1E53ba6136b5A

Recommendations:
https://www.youtube.com/watch?v=akgalUq5vew
https://www.youtube.com/watch?v=0g9SlVdv1PY
https://www.youtube.com/watch?v=Ud8F-cNsa-k
https://www.chess.com/news/view/google-s-alphazero-destroys-stockfish-in-100-game-match
http://forum.computerschach.de/cgi-bin/mwf/topic_show.pl?tid=9653

We would like to thank our generous Patreon supporters who make Two Minute Papers possible:
Andrew Melnychuk, Brian Gilman, Christian Ahlin, Christoph Jadanowski, Dave Rushton-Smith, Dennis Abts, Emmanuel, Eric Haddad, Esa Turkulainen, Evan Breznyik, Frank Goertzen, Kaben Gabriel Nanlohy, Malek Cellier, Marten Rauschenberg, Michael Albrecht, Michael Jensen, Michael Orenstein, Raul Araújo da Silva, Robin Graham, Steef, Steve Messina, Sunil Kim, Torsten Reil.
https://www.patreon.com/TwoMinutePapers

Credits:
Elo ratings: https://ratings.fide.com/top.phtml?list=men
Magnus image source: https://www.youtube.com/watch?v=eLaOeXCAPbU
400 point difference rule: https://www.fide.com/fide/handbook.html?id=172&view=article ctrl+f 400
One chess match source: https://chess24.com/en/watch/live-tournaments/alphazero-vs-stockfish/1/1/1
Stockfish: https://stockfishchess.org/
Thumbnail background image credit: https://pixabay.com/photo-1483735/

Music: Antarctica by Audionautix is licensed under a Creative Commons Attribution license (https://creativecommons.org/licenses/by/4.0/)
Artist: http://audionautix.com/

Splash screen/thumbnail design: Felícia Fehér – http://felicia.hu

Károly Zsolnai-Fehér’s links:
Facebook: https://www.facebook.com/TwoMinutePapers/
Twitter: https://twitter.com/karoly_zsolnai
Web: https://cg.tuwien.ac.at/~zsolnai/

Source

Similar Posts

28 thoughts on “AlphaZero: DeepMind's New Chess AI | Two Minute Papers #216
  1. You listed the most boring differences, what about the change from a pool of players to a single player? The fact that the algorithm got better when it was simplified is the biggest kicker for me

  2. Very interesting, thanks.  Nice to hear Jerry and Daniel getting a shoutout, I have enjoyed their analysis of the AlphaZero games, I'll check out the other guy now.  🙂

  3. The Alpha Zero beat the Stockfish 64% that is 101 points 3300+101=3401 75% score is 190 elo points 95% is 463 points and 99% is 656 elo points.3300+656=3956 Elo points.Over 4000 strong computer then the game of chess is solved.

  4. Well Carlsen's blindfold simul was impressive because it was also timed. But unfortunately the people moving his pieces for him and calling out his opponents' moves were utterly incompetent, often neglecting to call out a move for minutes after they were made, and a few times not even moving Magnus's pieces at all when Magnus gave his replies! And he still won most of the games! 

    But blindfold similes normally are not that hard for a strong player, Carlsen could play 40-50 people blindfolded if he wanted to.

  5. Taking nothing away from alphazero it wasn't playing the strongest version of stockfish and stockfish was handicapped quite fundamentally(hardware/database and time constraints). Id like to see a proper both sides happy rematch.

  6. So do they try to allow the AIphaZero to self learning for two months non-stop, and come back to try again with the highest setting + hardware of stockfish? I think they need to do this in order to prove that after two months continuously of self learning this AI will become much much more powerful than the 4 hours version, and can beat stockfish in every game they play.

  7. The Queen's Gambit graph is odd. The record is 1/47/2 after 1. d4 d5, significantly worse than after 1… Nf6. Still, AlphaZero chooses to play 1… d5 against itself ~10% of the time! Would love to see its record against itself from these positions.

  8. I don't know why this is stated wrongly everywhere, even by someone who is supposed to have read the paper. AlphaZero got to the level of Stockfish after 4 hours of training (so you might say he "mastered" it in just 4 hours), but the actual 100 Game match happened after AlphaZero had 8 hours of training. Granted this doesn't change the magnitude of this success, but it's annoying to hear the emphasis on "4 hours" all the time when it's simply wrong.

  9. I so want a version of alpha zero that starts out as a beginner on hardware than I can afford and then play it forever for fun. I've always wanted a computer that might play like a human and not a super processing juggernaut.

  10. You mentioned ChessNetwork. His video analyses of the Alpha Zero/Stockfish chess games are terrific! Check them out on YouTube, anyone who's interested in learning more.

  11. Arguing about time settings and databases misses the point in my opinion. The AI beat stockfish as black three times. That’s really all that needs to be said. From our perspective they could as well be two gods duking it out on the board.

  12. Alpha Zero progress in chess skyrocketted and then the learning curve flattened after a few hours. It hit some sort of ceiling, yet it still betters every once in a while. Like Bruce Lee said, there are no limits, only plateaus.

Comments are closed.

WP2Social Auto Publish Powered By : XYZScripts.com