AlphaZero 1 - Chess 0cognitive-science.info/wp-content/uploads/2018/03/... · 3/29/2018 ·...

AlphaZero 1 - Chess 0How Modern AI is Reshaping Thought

Kesav Viswanadha, UC BerkeleyMarch 29, 2018

A Brief History of Computer Chess

A Brief History of Computer Chess● 1997 - IBM’s Deep Blue defeats

World Champion Garry Kasparov after several attempts

● Early 2000s - Chess engines commercially available

● Late 2000s - Chess engines become consistently stronger than Grandmasters

Where are we now?

Where are we now?● December 2017 - AlphaZero

developed by DeepMind

Where are we now?● December 2017 - AlphaZero

developed by DeepMind

● Crushed leading chess engine Stockfish with 28 wins and 72 draws from 100 games

How Stockfish Works

How Stockfish Works● Previous chess engines relied

on alpha-beta pruning and heuristic evaluation

● Parameters of heuristic evaluation adjusted by hand - trial and error (Demo)

What’s so special about AlphaZero?

What’s so special about AlphaZero?● AlphaZero uses Monte Carlo Tree

Search (MCTS)

What’s so special about AlphaZero?● AlphaZero uses Monte Carlo Tree

Search (MCTS)

● Simulates games and determines probability of winning with a certain move - fundamentally different approach to chess AI

What’s so special about AlphaZero?● Neural network used to “learn” game

● Picks better and better moves by updating probability vector with each iteration of MCTS

What’s so special about AlphaZero?● Neural network used to “learn” game

● Picks better and better moves by updating probability vector with each iteration of MCTS

● Self-reinforcement learning

Advantages of AlphaZero Algorithm

Advantages of AlphaZero Algorithm● Doesn’t require hardcoded opening books or

endgame tablebases, unlike Stockfish

● Extremely efficient - only analyzes 80k positions/second compared to 70 million for Stockfish

● Scalable to other complete information two-player games

What’s Next for AlphaZero?

What’s Next for AlphaZero?● Neural network computations done on

Tensor Processing Unit (TPU) - not commercially available

What’s Next for AlphaZero?● Neural network computations done on

Tensor Processing Unit (TPU) - not commercially available

● AlphaZero not feasible on ordinary computers

Reactions from Top Chess GrandmastersFabiano Caruana: "I was amazed. I don't think any other engine has shown dominance like that. I think it was four hours of learning so who knows what it can do with even more."

Sergey Karjakin: “I will pay very much to get access to this program. Maybe $100,000, today!"

Wesley So: "Chess isn't yet dead; it's pretty inexhaustible. The main problem is that most of the games are the same for the first 12, 15 moves."

Implications for the Future of AI

Implications for the Future of AI● Moving away from blind search

● Neural networks - very close parallel to how humans learn chess

Implications for the Future of AI● Moving away from blind search

● Neural networks - very close parallel to how humans learn chess

● Ever-increasing computational power

Conclusion● AlphaZero revolutionary for both chess and AI

● A result of big changes we have already begun to see with machine learning and neural networks - more closely simulating human thought

● Could have a great impact in the future - next step could be applying this to incomplete information games like poker

Thank you!

Citationshttps://arxiv.org/pdf/1712.01815.pdfhttps://www.chess.com/news/view/alphazero-reactions-from-top-gms-stockfish-authorhttps://prezi.com/disewoli3nvf/the-opportunities-and-risks-of-artificial-intelligence/https://www.theinquirer.net/w-images/bc530547-346c-4a32-9d12-31fcefe002a4/2/TensorProcessingUnitv2Google5903602590360-580x358.jpghttps://cdn-images-1.medium.com/max/1500/1*m2gDBT_nc-iE7R4AM3sHBQ.jpeghttps://i.ytimg.com/vi/RwhEQmq6CaE/maxresdefault.jpghttp://www.computerchess.org.uk/ccrl/4040/http://www.mobygames.com/images/covers/l/159878-fritz-6-windows-front-cover.jpghttps://i0.wp.com/roboticsandautomationnews.com/wp-content/uploads/2017/10/garry-kasparov-deep-blue-ibm.jpg?fit=1024%2C658&ssl=1

AlphaZero 1 - Chess 0cognitive-science.info/wp-content/uploads/2018/03/... · 3/29/2018 ·...

Documents

Transcript of AlphaZero 1 - Chess 0cognitive-science.info/wp-content/uploads/2018/03/... · 3/29/2018 ·...

Reinforcement Learning and Optimal Control and Rollout ...web.mit.edu/dimitrib/www/RL_1_ROLLOUT_PRINT2.pdfto play very quickly. In fact, AlphaZero learned how to play chess better

Chess FIDE Tournament - All India Chess Federationaicf.in/wp-content/uploads/2017/03/1st-ICS-MCA_FIDE_Rated-Chess-FINAL.pdf · Dindigul (Indian) Chess School & Mount Chess Academy

AlphaZero The new Chess King - UiO

cdn.chess24.com...Assessing Game Balance with AlphaZero: Exploring Alternative Rule Sets in Chess Nenad Tomašev * DeepMind Ulrich Paquet DeepMind Demis Hassabis DeepMind Vladimir

Picasa - TippechessSolitaire Chess . Solitaire Chess Solitaire Chess Solitaire Chess Solitaire Chess . Title: Picasa Author: Shamrock Created Date: 11/19/2011 7:38:48 PM ...

AlphaZero to Alpha Hero - DiVA portal1350740/FULLTEXT01.pdfDEGREE PROJECT IN COMPUTER ENGINEERING, FIRST CYCLE, 15 CREDITS STOCKHOLM , SWEDEN 2019 AlphaZero to Alpha Hero A pre-study

AlphaGo Zero & AlphaZero Mastering Go, Chess and Shogi ... · AlphaGo Zero is the Google Deepmind’s successor to AlphaGo [11]. It falls into the category of deep learning augmented

Winning Chess Combinations - bayanbox.irbayanbox.ir/view/4195430655845355526/Winning-Chess-Combination.… · WINNING CHESS COMBINATIONS This preliminary work would feature chess

Chess-Glossary of Chess terms

chess horizons chess horizons chess horizons chess horizons - The

LEONTXO GARCÍA - londonchessconference.com · musical (chess) 7. bodily-kinesthetic (chess) 8. naturalistic? chess

Chess Detective - Kriegspiel Strategies, Endgames and …avniermeni.ch/libri/Chess Detective - Kriegspiel Strategies... · Chess Kriegspiel. 2. Chess Endgame 3. Chess Problems ...

San José, CA – September, 2004 Localizing with XLIFF and ICU Markus Scherer markus.scherer@us.ibm.com Raghuram (Ram) Viswanadha ramv@us.ibm.com IBM San.

Chess Training & Event Management Chess Promoters Group€¦ · · 2013-07-13Chess Training & Event Management Chess Promoters Group ... 2.2 Chess Club for Chess practice ... Learn

POSTAL CHESS - Chess Scotland

AlphaZero The new Chess King · Complexity of a Chess Game •20 possible start moves, 20 possible replies, etc. •400 possible positions after 2 ply (half moves) •197 281 positions

Jeroen Bosch - Interview: Is AlphaZero een Game Changer? · 2019. 5. 8. · Game Changer: AlphaZero's Groundbreaking Chess Strategies and the Promise of Al New In Chess 2019 Prijs

Gauteng Schools Chess 2018 Rules & Regulations Gauteng Schools Chess... · 4.1 Secondary School for ... Chess Federations: Tshwane Chess, JHB Metro Chess, Ekhuruleni Chess and Sedibeng

Excalibur Electronics - Chess & Electronic Chess

CHESS Replacement Project · 2020. 6. 29. · ASX CHESS Replacement CHESS ACCESS and CHESS PC > The ASX currently provide two solutions to support CHESS connectivity, CHESS Access