Games and Search: Lecture 3 - 58066 7 Artificial Intelligence (4ov / 8op)

This document discusses games and search problems in artificial intelligence. It covers topics like game trees, minimax strategy, alpha-beta pruning, and solving games. Game trees are used to represent games and show possible states and moves. The minimax strategy determines the optimal strategy for a player trying to maximize their payoff. Alpha-beta pruning removes parts of the game tree that don't need to be examined to improve search efficiency. Solving games involves finding the theoretical value or outcome of a game.

Uploaded by

Saipavanesh Guggilapu

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views

Games and Search: Lecture 3 - 58066 7 Artificial Intelligence (4ov / 8op)

Uploaded by

Saipavanesh Guggilapu

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

4.

Games and search

Lecture 3 – 580667 Artificial Intelligence (4ov / 8op)
4.1 Search problems
●
State space search — find a (shortest) path
from the initial state to the goal state.
●
Constraint satisfaction — find a value
assignment to a set of variables so that given
constraints are met.
●
Combinatorial optimization — find a value
assignment to a set of variables so that an
objective function is minimized (or maximized).
●
Games — find an optimal strategy to beat the
opponent. Lecture 3 – 580667 Artificial Intelligence (4ov / 8op)
4.2 Games of interest
●
Two-person zero-sum games; if one player wins, the
other must lose.
●
Players are adversarial — both not only try to win but
cause the opponent to lose.
●
Players are called:
– MAX  maximize one's own payoff
– MIN  minimize other's (MAX's) payoff
●
No chance involved
●
Complete information
– Available actions

– Payoffs Lecture 3 – 580667 Artificial Intelligence (4ov / 8op)
4.3 Game tree
●
Root of the tree is the starting position, with the indicator
who moves first.
●
Nodes represent possible states of the game.
●
Operators determine the legal moves.
●
Terminal test tells when the game is over.
●
Utility function gives a numeric value to the outcome of the
game.
●
Evaluation function enables the player to estimate if a
given state is good or bad.
●
Moves by two players are represented as alternate levels
in the tree. Lecture 3 – 580667 Artificial Intelligence (4ov / 8op)
4.4 Example tree
MAX

MIN

MAX

MIN

FINAL

Lecture 3 – 580667 Artificial Intelligence (4ov / 8op)
4.5 Expected utility
●
Game theoretic concept (Bernoulli, 1738; von Neumann &
Morgenstern, 1944)
●
Subjective value (utility) of an uncertain outcome is weighed by
its probability.
●
Decision makers try to maximize their expected utility.
●
Lottery example:
(a) Win $1000 with probability 1.0
(b) Win
✔
$101,000 with probability .001, or
✔
$900.9 with probability .999.
●
Which one would you choose?
Lecture 3 – 580667 Artificial Intelligence (4ov / 8op)
4.6 Minimax strategy
●
Proceeds depth-first
●
Determines the optimal strategy for MAX:
– Generate the whole game tree
– Propagate utility values from leaves toward the root.

3 A MAX

2 B1 3 B2 MIN

2 12 8 3 4 6

Lecture 3 – 580667 Artificial Intelligence (4ov / 8op)
4.7 Searching game trees
●
One cannot use exhaustive search
– The tree is potentially huge.
– The opponent complicates the search.
●
One can use depth-first or breadth-first search
to generate the tree, but other methods need to
be used to choose good moves:
– Bounded lookahead
– Alpha-beta pruning

Lecture 3 – 580667 Artificial Intelligence (4ov / 8op)
4.8 Bounded look-ahead
●
At each move the search tree is examined to
particular depth.
●
Difficulty of choosing a fixed cut-point:
– Non-quiescent positions in near future  cut search
only at points that are safe.
– Horizon problem; consequences of a bad move is
postponed beyond the search depth  no general
solution exists.

Lecture 3 – 580667 Artificial Intelligence (4ov / 8op)
4.9 Alpha-beta pruning
●
Remove sections of the game tree that are not worth examining.
●
In other words, if better outcome is already guaranteed after
examining one move or its parents, the others need not be examined.
●
Does not change the outcome of the game, if both players play
optimally.
●
Effectiveness depends on the order the nodes are evaluated.
●
For MAX node
–  = maximum value found in its descendants
–  = minimum beta value found in its MIN ancestors
●
For MIN node
–  = minimum value found in its descendants
–  = maximum alpha value found in its MAX ancestors

Lecture 3 – 580667 Artificial Intelligence (4ov / 8op)
4.10 Alpha-beta algorithm
●
Function alpha_beta(current_node, alpha, beta)
If ROOT(current_node)
alpha = -inf
beta = inf
If LEAF(current_node)
return payoff
If MAX_node
alpha = max(alpha, alpha_beta(children, alpha, beta))
If alpha  beta
cut_off(current_node)
If MIN_node
beta = min(beta, alpha_beta(children, alpha, beta))
If beta  alpha
cut_off(current_node)

Lecture 3 – 580667 Artificial Intelligence (4ov / 8op)
4.11 Alpha-beta example
 = -  = 3
a = MAX

=3 b c =3
MIN

=3 d e f =3 g MAX

1 2 3 4 5 7 1 0 2 6 1 5

Lecture 3 – 580667 Artificial Intelligence (4ov / 8op)
4.12 Other pruning methods
●
Null-move pruning
– Speeds up alpha-beta
– If the position after a skipped moved is strong enough to produce a cut-
off, likely the current position is strong enough even if the player actually
moved.
●
Forward pruning
– Node is discarded without searching beyond it, if it unlikely leads to
better moves.
– Non-zero probability of errors
– Lim & Lee (2006)
●
Errors more severe if opponent's moves pruned than own  prune in MAX
nodes especially if winning
●
The probability the error propagates to the root decreases as the depth of
error location increases.
●
However, the number of leaves increases at each depth faster than errors
can be avoided by minimax  prune less in deeper levels.
Lecture 3 – 580667 Artificial Intelligence (4ov / 8op)
4.13 Solving games
●
Finding the game-theoretic value of the game (van
den Herik et al., 2002): value indicates if the first
mover wins, loses or the game ends in draw.
●
Ultra-weakly solved, weakly solved and strongly
solved
●
Why? To explore if the knowledge from solving games
can be translated to rules and strategies that
– can be applied by humans.
– are general and not ad hoc.
– are transferable between games.
Lecture 3 – 580667 Artificial Intelligence (4ov / 8op)
State-space 4.14 Game space
complexity

Category 3 Category 4
if solvable, then by not solvable by
knowledge-based methods any method
(e.g., Go end games) (e.g., full Chess)

Category 1 Category 2
Solvable by any method if solvable, then by brute-first
(e.g., Go, Othello,
endgame Chess and Checkers)

Game-tree complexity
-State space complexity = number of legal positions reachable from
initial position
- Game-tree complexity = number of leaf nodes in the solution
search tree of the initial position.
- Convergent (Chess, Checkers) vs. divergent games (Go, Othello)
Lecture 3 – 580667 Artificial Intelligence (4ov / 8op)
4.15 Other kind of games
●
Two-person perfect-information games
(e.g., Chess, Othello)
●
Multiple-player, stochastic, incomplete or
imperfect information games (e.g., Poker,
Backgammon)
●
Interactive games, such as action games, role-
playing games, adventure games, and sports
games are a topic of whole another course!

Lecture 3 – 580667 Artificial Intelligence (4ov / 8op)
4.16 Availability of information
●
Complete information: every player knows the payoffs
and the strategies available to other players (type of
players, structure of the game)
●
Perfect information: player knows the actions of the
other players (what happens within the game)
●
Certain information: players know which game they
are playing, i.e., what the payoff from a certain
strategy will be given the strategies played by others.
●
Games of incomplete and imperfect information pose
problems to search based methods.
Lecture 3 – 580667 Artificial Intelligence (4ov / 8op)
4.17 How to study these games?
●
Simplified versions
– Subset of the game
– Address each sub-problem separately
– Abstractions; collect similar sub-problems into same class
– Two-players only

●
Tackle whole problem at once (Billings et al., 2002).
●
For instance, for poker this includes
– Betting strategy
– Opponent modeling
– Learning
– Performance evaluation
Lecture 3 – 580667 Artificial Intelligence (4ov / 8op)
4.18 Poki: architecture
●
Plays Texas Hold'em (Billings et al., 1999, 2002)
●
Architecture (also consists of a dealer)

Opponent Model Public game state:

- opponent action table Opponent Modeler
round #, bets to call,
- weight table betting history,
#players, position, ...
Hand Evaluator
Hand

Triple
p(fold), p(call), p(raise)

Betting rulebase
Simulator
Action selector

Lecture 3 – 580667 Artificial Intelligence (4ov / 8op)
4.19 Poki: Learning and decision
making
●
Basic betting strategy:
1. Compute effective hand strength=current hand strength+potential
to improve
2. Calculate probabilities of actions: fold, call, and raise
3. Choose action stochastically
●
Simulation based betting strategy; play out many likely
scenarios to get the expected value of each betting action.
●
Opponent modeling
– Deduce the strength of the hand from actions
– Predict future actions
●
General Opponent Model (GOM): Fixed strategy based on rational
choice
●
Specific Opponent Model (SOM); Personal history

Lecture 3 – 580667 Artificial Intelligence (4ov / 8op)

Fanuc Focas Ethernet Manual
No ratings yet
Fanuc Focas Ethernet Manual
76 pages
Games
No ratings yet
Games
41 pages
Adversarial Search
No ratings yet
Adversarial Search
37 pages
Lecture05 AdversarialSearch
No ratings yet
Lecture05 AdversarialSearch
51 pages
Artificial Inteligence
No ratings yet
Artificial Inteligence
4 pages
Chapter3 - Search4
No ratings yet
Chapter3 - Search4
37 pages
Adversarial Search
No ratings yet
Adversarial Search
43 pages
AI Lec07 Adversarial Search
No ratings yet
AI Lec07 Adversarial Search
29 pages
WK 7
No ratings yet
WK 7
44 pages
Lecture Game Playing
No ratings yet
Lecture Game Playing
58 pages
Game Playing
No ratings yet
Game Playing
32 pages
ITSC6121 Lecture 4 -- Game Trees I
No ratings yet
ITSC6121 Lecture 4 -- Game Trees I
34 pages
Oradea: Bucharest Arad Craiova
No ratings yet
Oradea: Bucharest Arad Craiova
53 pages
Game Playing MINMAX Search, Alpha-Beta Pruning,.pdf
No ratings yet
Game Playing MINMAX Search, Alpha-Beta Pruning,.pdf
4 pages
Adversarial Search
No ratings yet
Adversarial Search
20 pages
AI Unit-3
No ratings yet
AI Unit-3
109 pages
AAI Lecture 7 Sp 25
No ratings yet
AAI Lecture 7 Sp 25
51 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
58 pages
Why Do AI Researchers Study Game Playing?
No ratings yet
Why Do AI Researchers Study Game Playing?
42 pages
Local Adversarial Search
No ratings yet
Local Adversarial Search
44 pages
02 Lecture
No ratings yet
02 Lecture
3 pages
6 Game
No ratings yet
6 Game
42 pages
IA-c06-NoAnim
No ratings yet
IA-c06-NoAnim
31 pages
1.1.4GamePlaying
No ratings yet
1.1.4GamePlaying
23 pages
Week 13 (1)
No ratings yet
Week 13 (1)
45 pages
SET394 - AI - Lecture 06 - Adversarial Search
No ratings yet
SET394 - AI - Lecture 06 - Adversarial Search
27 pages
Adversarial Search PPT
No ratings yet
Adversarial Search PPT
49 pages
CS335 Introduction To AI: Francisco Iacobelli June 25, 2015
No ratings yet
CS335 Introduction To AI: Francisco Iacobelli June 25, 2015
49 pages
M4 AI Ktustundets - in CS464 Artificial Intelligence
No ratings yet
M4 AI Ktustundets - in CS464 Artificial Intelligence
11 pages
Adversarial Search and Game Playing: Games
No ratings yet
Adversarial Search and Game Playing: Games
8 pages
Adversarial Search
No ratings yet
Adversarial Search
42 pages
Week-11 - Adversarial Search
No ratings yet
Week-11 - Adversarial Search
50 pages
AI Decode
No ratings yet
AI Decode
139 pages
Adversarial Search
No ratings yet
Adversarial Search
91 pages
Game Playing
No ratings yet
Game Playing
53 pages
Adveserial Search
No ratings yet
Adveserial Search
29 pages
Optimal Decision in Games
No ratings yet
Optimal Decision in Games
68 pages
Lecture 6 - minmax alpha beta
No ratings yet
Lecture 6 - minmax alpha beta
41 pages
Adversarial Search: Course: Artificial Intelligence Effective Period: September 2018
No ratings yet
Adversarial Search: Course: Artificial Intelligence Effective Period: September 2018
35 pages
Unit 4 JWFILES PDF
No ratings yet
Unit 4 JWFILES PDF
23 pages
Unit 4 Jwfiles
No ratings yet
Unit 4 Jwfiles
23 pages
06. Chapter. 06 - Adversarial Search and Games - No Embedded Videos
No ratings yet
06. Chapter. 06 - Adversarial Search and Games - No Embedded Videos
51 pages
Chapter 3 - Searching-Part 3
No ratings yet
Chapter 3 - Searching-Part 3
64 pages
6 games
No ratings yet
6 games
45 pages
AI Chapter05
No ratings yet
AI Chapter05
38 pages
Adversial Search
No ratings yet
Adversial Search
101 pages
Adversarial Search and Game Playing
No ratings yet
Adversarial Search and Game Playing
77 pages
AI-Lecture 6 (Adversarial Search)
No ratings yet
AI-Lecture 6 (Adversarial Search)
68 pages
Part4.Game playing
No ratings yet
Part4.Game playing
35 pages
Unit 3
No ratings yet
Unit 3
13 pages
CSC-411-AI-lec6-Adversarial Search
No ratings yet
CSC-411-AI-lec6-Adversarial Search
38 pages
MateriMinggu03 1920 AI ProblemSolving
No ratings yet
MateriMinggu03 1920 AI ProblemSolving
41 pages
18cs753 Ai Module 4
No ratings yet
18cs753 Ai Module 4
44 pages
Adversial Search
No ratings yet
Adversial Search
39 pages
2025-Lecture03-AdversarialSearch
No ratings yet
2025-Lecture03-AdversarialSearch
51 pages
18cs753 Ai Module 4
No ratings yet
18cs753 Ai Module 4
44 pages
AI Notes Unit II
No ratings yet
AI Notes Unit II
31 pages
L06 Adversarial Search
No ratings yet
L06 Adversarial Search
66 pages
3 GamePlaying - Minimax
No ratings yet
3 GamePlaying - Minimax
75 pages
Original Games and Novel Game Variations
From Everand
Original Games and Novel Game Variations
Stanley Korn
No ratings yet
Deep Learning Fundamentals in Python
From Everand
Deep Learning Fundamentals in Python
LazyProgrammer
4/5 (9)
Project Mini
No ratings yet
Project Mini
7 pages
1.1 Discrete Probability Spaces
No ratings yet
1.1 Discrete Probability Spaces
22 pages
C Ompiler Theory: (Intermediate C Ode Generation - Abstract S Yntax + 3 Address C Ode)
No ratings yet
C Ompiler Theory: (Intermediate C Ode Generation - Abstract S Yntax + 3 Address C Ode)
32 pages
Mining Infrequent Patter NS: Johan Bjarnle (Johbj551) Peter Zhu (Petzh912)
No ratings yet
Mining Infrequent Patter NS: Johan Bjarnle (Johbj551) Peter Zhu (Petzh912)
8 pages
Percentage Inspector Chalisa Compressed 20210720103043
No ratings yet
Percentage Inspector Chalisa Compressed 20210720103043
56 pages
Optical Fiber Loss and Attenuation - Fiber Optic Training & Tutorials - FAQ, Tips & News
No ratings yet
Optical Fiber Loss and Attenuation - Fiber Optic Training & Tutorials - FAQ, Tips & News
12 pages
Nuclear Fission and Fusion Lesson 16
No ratings yet
Nuclear Fission and Fusion Lesson 16
6 pages
2300 Discover Software Manual: 0022790 - REV - A 04/21/2020
No ratings yet
2300 Discover Software Manual: 0022790 - REV - A 04/21/2020
203 pages
CFA With Multiple Regression Pp. 15 E-JSBRB 2 Dogbe Zakari Pesse-Kumar 101 2019
No ratings yet
CFA With Multiple Regression Pp. 15 E-JSBRB 2 Dogbe Zakari Pesse-Kumar 101 2019
15 pages
Roles Assignment For Users
No ratings yet
Roles Assignment For Users
2 pages
Dokumen Tips - Lxv-Polish-Physics-Olympiad PL en
No ratings yet
Dokumen Tips - Lxv-Polish-Physics-Olympiad PL en
7 pages
Boost Converter Design DCM
No ratings yet
Boost Converter Design DCM
16 pages
M4 Technical
No ratings yet
M4 Technical
14 pages
Calculation Sheet: Perunding Nusajasa SDN BHD
No ratings yet
Calculation Sheet: Perunding Nusajasa SDN BHD
7 pages
Gear Tooth Profile (1)
100% (1)
Gear Tooth Profile (1)
7 pages
Combination Logic Circuits
100% (1)
Combination Logic Circuits
21 pages
Cryogenic Vacuum Insulation For Vessels and Piping: Blank Line !jlonk Line IJ/ank Line
No ratings yet
Cryogenic Vacuum Insulation For Vessels and Piping: Blank Line !jlonk Line IJ/ank Line
7 pages
Ardx 20 Flatmanual
No ratings yet
Ardx 20 Flatmanual
1 page
Oracle Question
No ratings yet
Oracle Question
5 pages
Forces and Matter
No ratings yet
Forces and Matter
14 pages
SYMMETRICAL FACE GUIDE
No ratings yet
SYMMETRICAL FACE GUIDE
7 pages
Evaluating Large Language Models Trained On Code
No ratings yet
Evaluating Large Language Models Trained On Code
35 pages
Physics 9 12
No ratings yet
Physics 9 12
179 pages
0 - Key Science Skills
No ratings yet
0 - Key Science Skills
81 pages
Norwegian Grammar Book
No ratings yet
Norwegian Grammar Book
11 pages
Memory Management in Unix Operating System Computer Science Essay
No ratings yet
Memory Management in Unix Operating System Computer Science Essay
4 pages
0607_s12_qp_6
No ratings yet
0607_s12_qp_6
12 pages
Bar Envelope Origami
No ratings yet
Bar Envelope Origami
3 pages
الاسم
No ratings yet
الاسم
18 pages
Start Smart: Preparing For Pre-Algebra For Pre-Algebra
No ratings yet
Start Smart: Preparing For Pre-Algebra For Pre-Algebra
22 pages
Financial Modelling Corporate PDF
100% (3)
Financial Modelling Corporate PDF
64 pages
Flame Sensor
No ratings yet
Flame Sensor
3 pages