Adversarial searchpeople.cs.pitt.edu/~milos/courses/cs2710-Fall2017/Lectures/Class9.pdf3 M....

M. HauskrechtCS 2710 Foundations of AI

CS 2710 Foundations of AI

Lecture 9

Milos Hauskrecht

milos@cs.pitt.edu

5329 Sennott Square

Adversarial search

M. Hauskrecht

Game search

• Game-playing programs developed by AI researchers since the beginning of the modern AI era

– Programs playing chess, checkers, etc (1950s)

• Specifics of the game search:

– Sequences of player’s decisions we control

– Decisions of other player(s) we do not control

• Contingency problem: many possible opponent’s moves must be “covered” by the solution

Opponent’s behavior introduces an uncertainty in to the game

– We do not know exactly what the response is going to be

• Rational opponent – maximizes it own utility (payoff) function

M. Hauskrecht

Types of game problems

• Types of game problems:

– Adversarial games:

• win of one player is a loss of the other

– Cooperative games:

• players have common interests and utility function

– A spectrum of game problems in between the two:

we focus on adversarial games only!!

Adversarial games Fully cooperative games

M. Hauskrecht

Example of an adversarial 2 person game:

Tic-tac-toe

WinLoss Draw

Player 1 (x) moves

Player 2 (o) moves

Player 1 (x) moves

M. Hauskrecht

Game search problem

• Game problem formulation:

– Initial state: initial board position + info whose move it is

– Operators: legal moves a player can make

– Goal (terminal test): determines when the game is over

– Utility (payoff) function: measures the outcome of the

game and its desirability

• Search objective:

– find the sequence of player’s decisions (moves)

maximizing its utility (payoff)

Caveat: Consider the opponent’s moves and their utility

M. Hauskrecht

Game problem formulation (Tic-tac-toe)

Objectives:

• Player 1:

maximize outcome

• Player 2:

minimize outcome

Utility: 1-1 0

Initial state

Terminal (goal) states

Operators

M. Hauskrecht

Minimax algorithm

How to deal with the contingency problem?

• Assuming that the opponent is rational and always optimizes

its behavior (opposite to us) we consider the best opponent’s

response

• Then the minimax algorithm determines the best move

3 12 8 2 4 6 14 5 2

M. Hauskrecht

Minimax algorithm. Example

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

Minimax algorithm

M. Hauskrecht

Complexity of the minimax algorithm

• We need to explore the complete game tree before making the

decision

Complexity:

M. Hauskrecht

Complexity of the minimax algorithm

• We need to explore the complete game tree before making the decision

• Impossible for large games

– Chess: 35 operators, game can have 50 or more moves

Complexity:

)( mbO

M. Hauskrecht

Solution to the complexity problem

Two solutions:

1. Dynamic pruning of redundant branches of the search tree

– identify a provably suboptimal branch of the search tree

before it is fully explored

– Eliminate the suboptimal branch

Procedure: Alpha-Beta pruning

2. Early cutoff of the search tree

– uses imperfect minimax value estimate of non-terminal

states (positions)

M. Hauskrecht

Alpha beta pruning

• Some branches will never be played by rational players since

they include sub-optimal decisions (for either player)

M. Hauskrecht

Alpha beta pruning. Example

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

M. Hauskrecht

4 3 6 2 2 1 9 35 1 5 4 7 5

nodes that were never explored !!!

M. Hauskrecht

Alpha-Beta pruning

M. Hauskrecht

Using minimax value estimates

• Idea:

– Cutoff the search tree before the terminal state is reached

– Use imperfect estimate of the minimax value at the leaves

• Heuristic evaluation function

Heuristic

evaluation

function

Cutoff level

M. Hauskrecht

Design of evaluation functions

• Heuristic estimate of the value for a sub-tree

• Examples of a heuristic functions:

– Material advantage in chess, checkers

• Gives a value to every piece on the board, its position

and combines them

– More general feature-based evaluation function

• Typically a linear evaluation function:

kk wsfwsfwsfsf )()()()( 2211

)(sfi - a feature of a state s

iw - feature weight

M. Hauskrecht

Further extensions to real games

• Restricted set of moves to be considered under the cutoff level

to reduce branching and improve the evaluation function

– E.g., consider only the capture moves in chess

Heuristic estimates

Adversarial searchpeople.cs.pitt.edu/~milos/courses/cs2710-Fall2017/Lectures/Class9.pdf3 M....

Documents

Transcript of Adversarial searchpeople.cs.pitt.edu/~milos/courses/cs2710-Fall2017/Lectures/Class9.pdf3 M....

Graphs - Universidad Autónoma Metropolitanagalois.azc.uam.mx/mate/DISCRETAS/graphs.pdf · 2 M. Hauskrecht Graphs: basics Basic types of graphs: • Directed graphs • Undirected

Methods in Medical Image Analysis Statistics of Pattern Recognition: Classification and Clustering Some content provided by Milos Hauskrecht, University.

people.cs.pitt.edujlee/teaching/cs1675/cs1675_week14.pdf · (C) Hauskrecht 1 CS 1675 Introduction to Machine Learning Lecture 26 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott Square

Linear regression (cont) Logistic regressionpeople.cs.pitt.edu/~milos/courses/cs2710/Lectures/Class24.pdf4 Solving linear regression By rearranging the terms we get a system of linear

Feature selection Dimensionality reductionpeople.cs.pitt.edu/.../Lectures/Class20.pdf · Lecture 20 Milos Hauskrecht milos@cs.pitt.edu 5329 SennottSquare ... Classification problem:

First-Order Logicpeople.cs.pitt.edu/~litman/courses/cs2710/lectures/ch08RN.pdf · First-Order Logic Chapter 8 Outline • Why FOL? • Syntax and semantics of FOL • Using FOL •

Functions II - University of Pittsburghpeople.cs.pitt.edu/~milos/courses/cs441/lectures/Class9.pdf · milos@cs.pitt.edu 5329 Sennott Square Functions II M. Hauskrecht Functions ...

Graphs - Middle East Technical Universityuser.ceng.metu.edu.tr/~mutlu/teaching/kesikliMatematik/graph1.pdf · Lecture 25 Milos Hauskrecht milos@cs.pitt.edu 5329 Sennott Square Graphs

Informed Search and Beyond - University of Pittsburghpeople.cs.pitt.edu/~litman/courses/cs2710/lectures/ch03bRN.pdf · CS 2710 – Informed Search 8 Best-First Search (cont.) Some

CS 2710 Foundations of AI Lecture 1 · M. Hauskrecht CS 2710 Foundations of AI Lecture 1 Milos Hauskrecht milos@pitt.edu 5329 Sennott Square Course overview CS 2710 Foundations of

Artificial Intelligence: The Next Twenty-Five Yearspeople.cs.pitt.edu/~litman/courses/cs2710/papers/1750.pdf · 2010. 8. 11. · world’s chess champ, commercial sys-tems are exploiting

Incremental methods for computing servable …Incremental methods for computing Markov decision Milos Hauskrecht MIT Laboratory for Computer Science, NE43-421 545 Technology Square

Introduction (cont.) Problem solving by searchingmilos/courses/cs1571... · 9 CS 1571 Intro to AI M. Hauskrecht Other application areas • Text classification, document sorting:

Value-Function Approximations for Partially Observable ...milos/research/JAIR-2000.pdf · Value-Function Approximations for Partially Observable Markov Decision Processes Milos Hauskrecht

First-order logic.milos/courses/cs2710-Fall...– An interpretation gives a semantic to primitives. It associates primitives with objects, values in the real world. ... substitution

Uninformed search methods II. - University of Pittsburghpeople.cs.pitt.edu/~milos/courses/cs2710/Lectures/Class4.pdf · Uninformed search methods II. ... • Cuts the depth of the

Sample-efficient learning with auxiliary class-label ...quang/PAPERS/AMIA11_presentation.pdf · Quang Nguyen Hamed Valizadegan Amy Seybert Milos Hauskrecht ... •For each patient

Knowledge Representation. Propositional logicpeople.cs.pitt.edu/~milos/courses/cs2710/Lectures/Class10.pdf · 3 M. Hauskrecht Logic A formal language for expressing knowledge and

Propositional Logic - University of Pittsburghpeople.cs.pitt.edu/~litman/courses/cs2710/lectures/ch07...Propositional Logic Chapter 7 Outline • Review – Knowledge-based agents

CS 2710 Foundations of AI Lecture 1people.cs.pitt.edu/~milos/courses/cs2710-Fall2020/Lectures/Class1.pdfArtificial Intelligence •The field of Artificial intelligence: –The design