0% found this document useful (0 votes)

228 views68 pages

AI-Lecture 6 (Adversarial Search)

This document summarizes a lecture on adversarial search and game playing in artificial intelligence. It discusses why games are studied in AI, including that they present hard problems and model competitive multi-agent environments. Common games examined include chess, checkers, and Go. The document outlines adversarial search, where there is an opponent that cannot be directly controlled. It introduces the minimax algorithm and how it finds optimal strategies by recursively evaluating game trees. Alpha-beta pruning is presented as a way to improve minimax search by pruning branches that cannot affect the outcome.

Uploaded by

Braga Gladys Mae

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

228 views68 pages

AI-Lecture 6 (Adversarial Search)

Uploaded by

Braga Gladys Mae

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 68

Artificial Intelligence

Lecture 6

Bicol University College of Science

1st Semester, 2021-2022
Adversarial Search
What are and why study games?

Games are a form of multi-agent environment


What do other agents do and how do they affect our success?

Cooperative vs. competitive multi-agent environments.

Competitive multi-agent environments give rise to adversarial
problems a.k.a. games

Why study games?


Fun; historically entertaining

Interesting subject of study because they are hard

Easy to represent and agents restricted to small number of actions
Adversarial Search
Adversarial search problems == games
They occur in multiagent competitive environments
There is an opponent we can’t control planning
against us!
Game vs. search: optimal solution is not a sequence
of actions but a strategy (policy) If opponent does a,
agent does b, else if opponent does c, agent does d,
etc.
Tedious and fragile if hard-coded (i.e., implemented
with rules)
Good news: Games are modeled as search
problems and use heuristic evaluation functions.
Games: hard topic
Games are a big deal in AI
Games are interesting to AI because they are too
hard to solve
100
Chess❑≈10154
has a branching factor of 35, with 35
nodes ~~10,154
Need to make some decision even when the
optimal decision is infeasible
Adversarial Search
Checkers:
Checkers
Chinook ended 40-year-reign of human world
champion Marion Tinsley in 1994.
Used an endgame database defining perfect play
for all positions involving 8 or fewer pieces on the
board, a total of 443,748,401,247 positions.
Adversarial Search
Chess:
In 1949, Caude E. Shannon in his paper “Programming a Computer for
Playing Chess”, suggested Chess as an AI problem for the community.
Deep Blue defeated human world champion Gary Kasparov in a six-
game match in 1997.
In 2006, Vladmir Kramnik, the undisputed world champion, was
defeated 4-2 by Deep Fritz.
Adversarial Search
Go: b > 300! Google Deep mind Project AlphaGo. In 2016,
AlphaGo beat both Fan Hui, the European Go champion and Lee
Sedol the worlds best player.

Othello: Several computer othello exists and human champions

refuse to compete against computers, that are too good.
Relation of Games to Search

Search – no adversary
●
Solution is (heuristic) method for finding goal
●
Heuristics technique can find optimal solution
●
Evaluation function: estimate of cost from start to goal through given node
●
Examples: path planning, scheduling activities
Games – adversary
●
Solution is strategy (strategy specifies move for every possible opponent
reply).
●
Time limits force an approximate solution
●
Evaluation function: evaluate “goodness” of game position
●
Examples: chess, checkers, Othello, backgammon
Types of Games

We are mostly interested in deterministic games, fully

observable environments, zero-sum, where two agents act
alternately.
 Games with perfect information. No randomness is involved.

 Games with imperfect information. Random factors are part

of the game.
Zero-sum Games

Adversarial: Pure competition.

Agents have different values on the outcomes.
One agent maximizes one single value, while the
other minimizes it.
Each move by one of the players is called a “ply.”
One function: one agents maximizes it and one
minimizes it!
Embedded thinking...

One agent is trying to figure out what to do.

How to decide? He thinks about the consequences of the
possible actions.
He needs to think about his opponent as well...
The opponent is also thinking about what to do etc.
Each will imagine what would be the response from the
opponent to their actions.
This entails an embedded thinking.
Game setup

Two players: MAX and MIN

MAX moves first and they take turns until the game
is over. Winner gets award, looser gets penalty.
Formulate game as search problem
MAX uses search tree to determine next move.
Searching in a two player game

Problem Formulation:
Initial state: board configurations and the player to
move.
Successor function: list of pairs (move, state)
specifying legal moves and their resulting states.
(moves + initial state = game tree)
A terminal test: decide if the game has finished.
A utility function: produces a numerical value for
(only) the terminal states. Example: In chess, outcome
= win/loss/draw, with values +1, -1, 0 respectively.

Players need search tree to determine next move.

Partial Game Tree for Tic-Tac-Toe
Minimax

Find the optimal strategy for Max:

– Depth-first search of the game tree
– An optimal leaf node could appear at any depth of
the tree
– Minimax principle: compute the utility of being in a
state assuming both players play optimally from
there until the end of the game
– Propagate minimax values up the tree once
terminal nodes are discovered
Optimal strategies

Find the contingent strategy for MAX assuming an

infallible MIN opponent.
Assumption: Both players play optimally !!
Given a game tree, the optimal strategy can be
determined by using the minimax value of each node:
MINIMAX-VALUE(n)=
UTILITY(n) If n is a terminal
maxs  successors(n) MINIMAX-VALUE(s) If n is a max node
mins  successors(n) MINIMAX-VALUE(s) If n is a min node
Two-Ply Game Tree
Two-Ply Game Tree
Two-Ply Game Tree
Two-Ply Game Tree

The minimax decision

Minimax maximizes the worst-case outcome for max.

Partial Game Tree for Tic-Tac-Toe
Partial game tree for Tic-Tac-Toe
Partial game tree for Tic-Tac-Toe
Partial game tree for Tic-Tac-Toe
Partial game tree for Tic-Tac-Toe
What if MIN does not play optimally?

Definition of optimal play for MAX assumes

MIN plays optimally: maximizes worst-case
outcome for MAX.

But if MIN does not play optimally, MAX will

do even better.
Minimax Algorithm
function MINIMAX-DECISION(state) returns an action
inputs: state, current state in game
vMAX-VALUE(state)
return the action in SUCCESSORS(state) with value v
function MAX-VALUE(state) returns a utility value
if TERMINAL-TEST(state) then return UTILITY(state)
v∞
for a,s in SUCCESSORS(state) do
v  MAX(v,MIN-VALUE(s))
return v
function MIN-VALUE(state) returns a utility value
if TERMINAL-TEST(state) then return UTILITY(state)
v∞
for a,s in SUCCESSORS(state) do
v  MIN(v,MAX-VALUE(s))
return v
Properties of Minimax

Criterion Minimax

Complete? Yes

Time O(bm)

Space O(bm)

Optimal? Yes

Minimax Algorithm

Minimax algorithm
Perfect for deterministic, 2-player game
One opponent tries to maximize score (Max)
One opponent tries to minimize score (Min)
Goal: move to position of highest minimax
value
Identify best achievable payoff against best
play
Multiplayer games

Games allow more than two players

Single minimax values become vectors
Problem of minimax search

Number of game’s states is exponential to

the number of moves.

Solution: Do not examine every node

==> Alpha-beta pruning
●
Alpha = value of best choice found so far at any
choice point along the MAX path
●
Beta = value of best choice found so far at any
choice point along the MIN path
Alpha-beta Game Playing
Basic idea:
If you have an idea that is surely bad, don't take the
time to see how truly awful it is.” -- Pat Winston

Some branches will never be played by rational players since

they include sub-optimal decisions (for either player).

>=2
• We don’t need to compute
=2 <=1 the value at this node.
• No matter what it is, it can’t
affect the value of the root
node.
2 7 1 ?
Alpha-Beta Example

Do DF-search until first leaf

Range of possible values

[-∞,+∞]

[-∞, +∞]
Alpha-Beta Example (continued)

[-∞,+∞]

[-∞,3]
Alpha-Beta Example (continued)

[-∞,+∞]

[-∞,3]
Alpha-Beta Example (continued)

[3,+∞]

[3,3]
Alpha-Beta Example (continued)

[3,+∞]
This node is worse
for MAX

[3,3] [-∞,2]
Alpha-Beta Example (continued)

[3,14] ,

[3,3] [-∞,2] [-∞,14]

Alpha-Beta Example (continued)

[3,5] ,

[3,3] [−∞,2] [-∞,5]

Alpha-Beta Example (continued)

[3,3]

[3,3] [−∞,2] [2,2]

Alpha-Beta Example (continued)

[3,3]

[3,3] [-∞,2] [2,2]

Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Alpha-Beta Pruning Algorithm
function ALPHA-BETA-SEARCH(state) returns an action
inputs: state, current state in game
v← MAX-VALUE(state, - ∞ , +∞)
return the action in SUCCESSORS(state) with value v

function MAX-value (n, alpha, beta) return utility value

if n is a leaf node then return f(n);
for each child n’ of n do
alpha :=max{alpha, MIN-value(n’, alpha, beta)};
if alpha >= beta then return beta /* pruning */
end{do}
return alpha

function MIN-value (n, alpha, beta) return utility value

if n is a leaf node then return f(n);
for each child n’ of n do
beta :=min{beta, MAX-value(n’, alpha, beta)};
if beta <= alpha then return alpha /* pruning */
end{do}
return beta
General alpha-beta pruning

Consider a node n
somewhere in the tree
If player has a better choice
at
●
Parent node of n
●
Or any choice point further up
n will never be reached in
actual play.
Hence when enough is
known about n, it can be
pruned.
Final Comments about Alpha-Beta
Pruning

Pruning does not affect final results

Entire subtrees can be pruned.
Good move ordering improves effectiveness of pruning
With “perfect ordering,” time complexity is O(b m/2)
●
Branching factor of sqrt(b) !!
●
Alpha-beta pruning can look twice as far as minimax in the same amount of
time

Repeated states are again possible.

●
Store them in memory = transposition table
Games of imperfect information

Minimax and alpha-beta pruning require too

much leaf-node evaluations.
May be impractical within a reasonable
amount of time.
SHANNON (1950):
●
Cut off search earlier (replace TERMINAL-TEST by CUTOFF-
TEST)
●
Apply heuristic evaluation function EVAL (replacing utility
function of alpha-beta)
Cutting off search

Change:
if TERMINAL-TEST(state) then return UTILITY(state)
into
if CUTOFF-TEST(state,depth) then return EVAL(state)

Introduces a fixed-depth limit depth

Is selected so that the amount of time will not exceed what the rules of the game
allow.
When cutoff occurs, the evaluation is performed.
Evaluation Function
Evaluation function is a heuristic function, and
it is where the domain experts’ knowledge
resides.
Performed at search cutoff point
Must have same terminal/goal states as utility
function
Tradeoff between accuracy and time →
reasonable complexity
Accurate
 Performance of game-playing system dependent on
accuracy/goodness of evaluation
 Evaluation of nonterminal states strongly correlated
with actual chances of winning
Heuristic EVAL(uation) Function

Idea: produce an estimate of the expected utility of

the game from a given position.
Performance depends on quality of EVAL.
Requirements:
EVAL should order terminal-nodes in the same way as UTILITY.
Computation may not take too long.
For non-terminal states the EVAL should be strongly correlated with the
actual chance of winning.
Only useful for quiescent (no wild swings in value in
near future) states
Heuristic EVAL example

For Tic-Tac-Toe:
f(s) = [# of 3-lengths open for me] - [# of 3-lengths open for you]
where a 3-length is a complete row, column, or diagonal.

For chess, typically linear weighted sum of features

Eval(s) = w1f1(s)+w2f2(s)+ … +wnfn(s)
e.g., Alan Turing’s function for chess
 f(s)=w(s)/b(s) where w(s) = sum of the point value of
white’s pieces and b(s) is sum for black
●
Example features for chess are piece count, piece placement,
squares controlled, etc.
●
Deep Blue has about 6,000 features in its evaluation function.
Heuristic EVAL example

Eval(s) = w1 f1(s) + w2 f2(s) + … + wnfn(s)

Heuristic EVAL example

Addition assumes
independence
Eval(s) = w1 f1(s) + w2 f2(s) + … + wnfn(s)
End

07 Game Playing
No ratings yet
07 Game Playing
30 pages
AI 2ndunit
No ratings yet
AI 2ndunit
25 pages
Inference in First Order Logic
No ratings yet
Inference in First Order Logic
26 pages
AI Course Overview for Students
No ratings yet
AI Course Overview for Students
23 pages
AI Unit4 LogicAgents
No ratings yet
AI Unit4 LogicAgents
17 pages
AI Search Strategies Explained
No ratings yet
AI Search Strategies Explained
43 pages
AI - Unit - 2
No ratings yet
AI - Unit - 2
30 pages
Artificial Intelligence Unit IV
No ratings yet
Artificial Intelligence Unit IV
105 pages
ML Unit-5
No ratings yet
ML Unit-5
83 pages
Topic For The Class:: Knowledge and Reasoning
No ratings yet
Topic For The Class:: Knowledge and Reasoning
41 pages
UNIT 3 KR Predicate Logic
No ratings yet
UNIT 3 KR Predicate Logic
53 pages
Algorithms For Planning: As State-Space Search
No ratings yet
Algorithms For Planning: As State-Space Search
8 pages
Predicate Logic
No ratings yet
Predicate Logic
64 pages
10 Reasoning
100% (1)
10 Reasoning
18 pages
Forward and Backward Chaining AI
No ratings yet
Forward and Backward Chaining AI
11 pages
Ontology Engineering PDF
No ratings yet
Ontology Engineering PDF
25 pages
AIML Unit 2 Notes
No ratings yet
AIML Unit 2 Notes
49 pages
AI Unit2 ProblemSolving
No ratings yet
AI Unit2 ProblemSolving
191 pages
C Programming and Data Structures
No ratings yet
C Programming and Data Structures
5 pages
Classical Planning Graphs Guide
No ratings yet
Classical Planning Graphs Guide
22 pages
AI Game Strategy Basics
No ratings yet
AI Game Strategy Basics
66 pages
Unit-5 Alt
No ratings yet
Unit-5 Alt
15 pages
Knowledge Representation
No ratings yet
Knowledge Representation
29 pages
Unit 5 1
No ratings yet
Unit 5 1
18 pages
AD8402 - Artificial Intelligence (Unit III)
No ratings yet
AD8402 - Artificial Intelligence (Unit III)
24 pages
AI - 03 (Problems, State Space)
No ratings yet
AI - 03 (Problems, State Space)
44 pages
Prolog Notes-Complete
No ratings yet
Prolog Notes-Complete
31 pages
AI Module-2
No ratings yet
AI Module-2
123 pages
Concept Learning
No ratings yet
Concept Learning
85 pages
AI Problem Solving for Students
No ratings yet
AI Problem Solving for Students
36 pages
Knowledge Representation & User Querying
No ratings yet
Knowledge Representation & User Querying
28 pages
Lab Program
100% (1)
Lab Program
15 pages
Unit III AI
100% (1)
Unit III AI
38 pages
Of Module-1 1.: I. What Is AI?
100% (1)
Of Module-1 1.: I. What Is AI?
19 pages
Ai-Unit-Iii Notes
No ratings yet
Ai-Unit-Iii Notes
46 pages
Unit 3 AI Srs 13-14
No ratings yet
Unit 3 AI Srs 13-14
45 pages
AI Knowledge Representation Guide
No ratings yet
AI Knowledge Representation Guide
39 pages
Deep Learning r18 Jntuh Lab Manual
No ratings yet
Deep Learning r18 Jntuh Lab Manual
20 pages
Classification and Regression Trees (CART - I) : Dr. A. Ramesh
No ratings yet
Classification and Regression Trees (CART - I) : Dr. A. Ramesh
34 pages
Expert Systems Unit-5
No ratings yet
Expert Systems Unit-5
19 pages
All Pairs Shortest Path
No ratings yet
All Pairs Shortest Path
28 pages
Unification and Lifting
No ratings yet
Unification and Lifting
8 pages
Dropout Vs Pruning
No ratings yet
Dropout Vs Pruning
2 pages
Linear Models & SVM in Machine Learning
100% (1)
Linear Models & SVM in Machine Learning
23 pages
Module 3 Games Optimal Decisions in Games Minimax Algorithm
No ratings yet
Module 3 Games Optimal Decisions in Games Minimax Algorithm
18 pages
Alpha Beta Pruning
No ratings yet
Alpha Beta Pruning
12 pages
Machine Learning - Its Types
No ratings yet
Machine Learning - Its Types
8 pages
ML - CSA 301 - ML Perspective and Issues
No ratings yet
ML - CSA 301 - ML Perspective and Issues
34 pages
Define The Base or Radix in A Positional Number System
No ratings yet
Define The Base or Radix in A Positional Number System
1 page
Unit-3-Second Chapter
No ratings yet
Unit-3-Second Chapter
9 pages
Module-02 AIML NOTES
No ratings yet
Module-02 AIML NOTES
29 pages
Best First Search
No ratings yet
Best First Search
5 pages
AI.02a - Solving Problems by Searching - T
No ratings yet
AI.02a - Solving Problems by Searching - T
118 pages
Mini Max
100% (1)
Mini Max
9 pages
Prolog Basics for Beginners
No ratings yet
Prolog Basics for Beginners
15 pages
Unit 3 PPT Ai
No ratings yet
Unit 3 PPT Ai
93 pages
First Order Logic (Artificial Intelligence)
100% (1)
First Order Logic (Artificial Intelligence)
39 pages
AI Reasoning: Procedural vs Declarative
No ratings yet
AI Reasoning: Procedural vs Declarative
22 pages
Minimax With Alpha Beta Pruning
No ratings yet
Minimax With Alpha Beta Pruning
21 pages
Adversarial Search in Game Theory
No ratings yet
Adversarial Search in Game Theory
71 pages
Data Structure: Linked List
100% (1)
Data Structure: Linked List
15 pages
Graph I - Basics, Traversal
No ratings yet
Graph I - Basics, Traversal
4 pages
Data Structures & Linked Lists Intro
No ratings yet
Data Structures & Linked Lists Intro
11 pages
Pstii C 22CS201 Unit 2 Manju S-1
No ratings yet
Pstii C 22CS201 Unit 2 Manju S-1
67 pages
Maze Generation Algorithms
No ratings yet
Maze Generation Algorithms
4 pages
Iterative Methods for Linear Equations
No ratings yet
Iterative Methods for Linear Equations
38 pages
Tora Practical
No ratings yet
Tora Practical
12 pages
5 Logistic Regression
No ratings yet
5 Logistic Regression
48 pages
BFS Vs DFS Presentation
No ratings yet
BFS Vs DFS Presentation
11 pages
Algorithm Analysis for Students
No ratings yet
Algorithm Analysis for Students
74 pages
Unit 3 Ds Mca RK Anurag
No ratings yet
Unit 3 Ds Mca RK Anurag
78 pages
Session - Decision Trees PPT DOM304
No ratings yet
Session - Decision Trees PPT DOM304
8 pages
Support Vector Machine
No ratings yet
Support Vector Machine
14 pages
Introduction To Rate Monotonic Scheduling
100% (1)
Introduction To Rate Monotonic Scheduling
4 pages
04 Local Search
No ratings yet
04 Local Search
31 pages
Linked List
No ratings yet
Linked List
10 pages
12.04.dynamic Programming
No ratings yet
12.04.dynamic Programming
97 pages
Unit 1 (Adsa)
No ratings yet
Unit 1 (Adsa)
22 pages
Lecture 10 - Quick Sort
No ratings yet
Lecture 10 - Quick Sort
74 pages
Daa Module 6
No ratings yet
Daa Module 6
17 pages
Green University of Bangladesh: Department of Computer Science and Engineering
No ratings yet
Green University of Bangladesh: Department of Computer Science and Engineering
9 pages
Hashing Presentation
No ratings yet
Hashing Presentation
12 pages
Butterfly Diagram - Wikipedia
No ratings yet
Butterfly Diagram - Wikipedia
3 pages
Experiment No.: 9 Implement Binary Search Tree ADT Using Linked List
No ratings yet
Experiment No.: 9 Implement Binary Search Tree ADT Using Linked List
18 pages
STK - Q Quiz
No ratings yet
STK - Q Quiz
21 pages
DSA Imp Question Bank by MCA SCHOLARS Group
No ratings yet
DSA Imp Question Bank by MCA SCHOLARS Group
2 pages
One Dimension
No ratings yet
One Dimension
24 pages
Dec - 2021 MCS-21
No ratings yet
Dec - 2021 MCS-21
3 pages
350+ Data Structure CCEE MCQ
No ratings yet
350+ Data Structure CCEE MCQ
94 pages
ECEA106L EXP2 Solving A System of Linear Equations PDF
No ratings yet
ECEA106L EXP2 Solving A System of Linear Equations PDF
4 pages