The Implementation of Machine Learning in the Game of Checkers

Billy Melicher

Computer Systems lab 08-09

Abstract• Machine learning uses past information

to predict future states• Can be used in any situation where the

past will predict the future• Will adapt to situations

Introduction

• Checkers is used to explore machine learning

• Checkers has many tactical aspects that make it good for studying

Background

• Minimax• Heuristics • Learning

Minimax

• Method of adversarial search• Every pattern(board) can be given a fitness

value(heuristic)• Each player chooses the outcome that is best

for them from the choices they have

Minimax

Chart from wikipedia

Minimax

• Has exponential growth rate• Can only evaluate a certain number of actions

into the future – ply

Heuristic

• Heuristics predict out come of a board• Fitness value of board, higher value, better

outcome• Not perfect• Requires expertise in the situation to create

Heuristics• H(s) = c0F0(s) + c1F1(s) + … + cnFn(s)

• H(s) = heuristic

• Has many different terms

• In checkers terms could be:

Number of checkers

Number of kings

Number of checkers on an edge

How far checkers are on board

Learning by Rote

• Stores every game played• Connects the moves made for each board• Relates the moves made from a particular board to

the outcome of the board• More likely to make moves that result in a win, less

likely to make moves resulting in a loss• Good in end game, not as good in mid game

How I store dataI convert each checker board into a 32 digit base 5 number where each digit corresponds to a playable square and each number corresponds to what occupies that square.

Learning by Generalization

• Uses a heuristic function to guide moves• Changes the heuristic function after games

based on the outcome• Good in mid game but not as good in early

and end games• Requires identifying the features that affect

Development

• Use of minimax algorithm with alpha beta pruning

• Use of both learning by Rote and Generalization

• Temporal difference learning

Temporal Difference Learning

• In temporal difference learning, you adjust the heuristic based on the difference between the heuristic at one time and at another

• Equilibrium moves toward ideal function• U(s) <-- U(s) + α( R(s) + γU(s') - U(s))

Temporal Difference Learning

• No proof that prediction closer to the end of the game will be better but common sense says it is

• Changes heuristic so that it better predicts the value of all boards

• Adjusts the weights of the heuristic

Alpha Value

• The alpha value decreases the change of the heuristic based on how much data you have

• Decreasing returns• Necessary for ensuring rare occurrences do not

change heuristic too much

Development

• Equation for learning applied to each weight:• w=(previous-current)(previous+current/2)• Equation for alpha value:• a=50/(49+n)

Results

• Value of weight reaches equilibrium• Changes to reflect the learning of the program• Occasionally requires programmer intervention

when it reaches a false equilibrium

Results

• Learning by rote requires a large data set• Requires large amounts of memory• Necessary for determining alpha value in

temporal difference learning

Conclusions

• Good way to find equilibrium weight• Sometimes requires intervention• Doesn't require much memory• Substantial learning could be achieved with

relativelly few runs• Learning did not require the program to know

strategies but does require it to play towards a win

The Implementation of Machine Learning in the Game of Checkers

Documents

Transcript of The Implementation of Machine Learning in the Game of Checkers

Intelligence Artificial Intelligence Ian Gent ipg@cs.st-and.ac.uk Game Playing: Checkers.

Checkers Rag

Artiﬁcial Intelligence: A History of Game-Playing Programssturtevant/w13-games/Lecture1.pdften years, checkers will be a completely decidable game.” Richard Bellman, Proceedings

Introduction to Machine Learning...Machine learning: Origin A. L. Samuel, Some Studies in Machine Learning Using the Game of Checkers, IBM J. Res. Dev. 3, 210 (1959) Long time competitor

1/26 Writing II Word processors Spell checkers Grammar checkers.

ULTIMATE ANDROID COCKTAIL GAME MACHINE...ULTIMATE ANDROID COCKTAIL GAME MACHINE PinballUSA’s Ultimate Android Cocktail Game Machine is leading a gaming revolution. The Android operating

Game Analytics & Machine Learning

IIDD22162216/UMT/UMT Checkers Game - people.dsv.su.sek_gh20/MIDlets/Project/presentation/Checkers Game.pdf · IIDD22162216/UMT/UMT Checkers Game 1 By: Behzad Salim Aroony Kambiz Ghoorchian.

How to Play Checkers - Weeblycusd4business.weebly.com/uploads/8/4/4/6/8446013/presentit_project_19.pdfA Brief History of the Game of Checkers The game of Checkers, as we know it, is

Some studies in machine learning using the game of checkers · the number of leaf nodes in the game tree) is 1031 (Chess: 10123, Go: 10350) and the average branching factor (the branching

HPC Checkers

AN INVESTIGATION OF THE POST HIGH SCHOOL OUTCOMES · Checkers and chess are analogies for the “game of life” (Johns, 1993, p. 1). Checkers specifically refers to life as a high

Checkers solved!

The shortest game of Chinese Checkers and related problems

Checkers Fundamentals

Checkers memorandum

Games and Search - srmaiexpert.weebly.comsrmaiexpert.weebly.com/uploads/8/5/4/8/8548812/game_playing.pdf · Game-Playing and AI ... • Consider a board game – e.g., chess, checkers,

Checkers for the - Bob Newell · wonderful storehouse of endgame tactics, with each theme providing an evening’s study. 4. Principles of Strategy in the Game of Checkers by Louis

Multiplication Checkers-9’s...2X4 Multiplication Checkers-9’s (Facts 0-10) 1. Each player needs 12 game pieces that are di˜erent than their opponent’s. Use beans, pennies/dimes,

1 Developing Mobile Applications ID2216/UMT Checkers Game By: Behzad Salim Aroony Kambiz Ghoorchian.