SAL: A Game Learning Machine

10
SAL: A Game Learning Machine Joel Paulson & Brian Joel Paulson & Brian Lanners Lanners

description

SAL: A Game Learning Machine. Joel Paulson & Brian Lanners. Introduction. Why AI game playing? Why AI game learning? SAL (Michael Gherrity, 1993) Search and Learning. Consistency Search. Basic Concept Reasons for Use Allows for Errors in Evaluation Function Pathological Games. - PowerPoint PPT Presentation

Transcript of SAL: A Game Learning Machine

Page 1: SAL: A Game Learning Machine

SAL: A Game Learning Machine

Joel Paulson & Brian LannersJoel Paulson & Brian Lanners

Page 2: SAL: A Game Learning Machine

Introduction

Why AI game playing?Why AI game playing? Why AI game learning?Why AI game learning? SAL (Michael Gherrity, 1993)SAL (Michael Gherrity, 1993) Search and LearningSearch and Learning

Page 3: SAL: A Game Learning Machine

Consistency Search

Basic ConceptBasic Concept Reasons for UseReasons for Use

Allows for Errors in Evaluation FunctionAllows for Errors in Evaluation Function Pathological GamesPathological Games

Page 4: SAL: A Game Learning Machine

Procedure

Consistent PositionsConsistent Positions Evaluation of a position is equal to its Evaluation of a position is equal to its

minimax valueminimax value Inconsistent PositionsInconsistent Positions

Identifying and Correcting ErrorsIdentifying and Correcting Errors

Page 5: SAL: A Game Learning Machine

General Example

If B is inconsistent, then one of the If B is inconsistent, then one of the evaluations of B, D, or E is incorrectevaluations of B, D, or E is incorrect

Page 6: SAL: A Game Learning Machine

Consistency Search in Play

Page 7: SAL: A Game Learning Machine

Organization of SAL

Game Independent KernelGame Independent KernelConsistency Search AlgorithmConsistency Search AlgorithmEvaluation FunctionsEvaluation Functions

Game Specific Move Generator Game Specific Move Generator Incorporates rules of gameIncorporates rules of gameThree Subroutines: MoveGenerator, Three Subroutines: MoveGenerator,

MakeMove, EndOfGameMakeMove, EndOfGame

Page 8: SAL: A Game Learning Machine

Features

Used as input for Neural NetworkUsed as input for Neural Network Feature Discovery ProblemFeature Discovery Problem Features in SALFeatures in SAL

Page 9: SAL: A Game Learning Machine

Neural Networks

SAL uses features for InputSAL uses features for Input Weights altered following each game using Weights altered following each game using

Temporal Difference LearningTemporal Difference Learning

Page 10: SAL: A Game Learning Machine

Performance of SAL Tic-Tac-ToeTic-Tac-Toe Connect FourConnect Four ChessChess