SAL: A Game Learning Machine

Post on 07-Jan-2016

26 views 2 download

description

SAL: A Game Learning Machine. Joel Paulson & Brian Lanners. Introduction. Why AI game playing? Why AI game learning? SAL (Michael Gherrity, 1993) Search and Learning. Consistency Search. Basic Concept Reasons for Use Allows for Errors in Evaluation Function Pathological Games. - PowerPoint PPT Presentation

Transcript of SAL: A Game Learning Machine

SAL: A Game Learning Machine

Joel Paulson & Brian LannersJoel Paulson & Brian Lanners

Introduction

Why AI game playing?Why AI game playing? Why AI game learning?Why AI game learning? SAL (Michael Gherrity, 1993)SAL (Michael Gherrity, 1993) Search and LearningSearch and Learning

Consistency Search

Basic ConceptBasic Concept Reasons for UseReasons for Use

Allows for Errors in Evaluation FunctionAllows for Errors in Evaluation Function Pathological GamesPathological Games

Procedure

Consistent PositionsConsistent Positions Evaluation of a position is equal to its Evaluation of a position is equal to its

minimax valueminimax value Inconsistent PositionsInconsistent Positions

Identifying and Correcting ErrorsIdentifying and Correcting Errors

General Example

If B is inconsistent, then one of the If B is inconsistent, then one of the evaluations of B, D, or E is incorrectevaluations of B, D, or E is incorrect

Consistency Search in Play

Organization of SAL

Game Independent KernelGame Independent KernelConsistency Search AlgorithmConsistency Search AlgorithmEvaluation FunctionsEvaluation Functions

Game Specific Move Generator Game Specific Move Generator Incorporates rules of gameIncorporates rules of gameThree Subroutines: MoveGenerator, Three Subroutines: MoveGenerator,

MakeMove, EndOfGameMakeMove, EndOfGame

Features

Used as input for Neural NetworkUsed as input for Neural Network Feature Discovery ProblemFeature Discovery Problem Features in SALFeatures in SAL

Neural Networks

SAL uses features for InputSAL uses features for Input Weights altered following each game using Weights altered following each game using

Temporal Difference LearningTemporal Difference Learning

Performance of SAL Tic-Tac-ToeTic-Tac-Toe Connect FourConnect Four ChessChess