SlideShare una empresa de Scribd logo
1 de 15
 Games are a traditional hallmark of
intelligence
 Games are easy to formalize
 Games can be a good model of real-world
competitive activities
◦ Military confrontations, negotiation, auctions, etc.
Deterministic Stochastic
Perfect information
(fully observable)
Imperfect
information
(partially
observable)
Deterministic Stochastic
Perfect information
(fully observable)
Chess, checkers, go Backgammon,
monopoly
Imperfect
information
(partially
observable)
Battleships Scrabble, poker,
bridge
 Players take turns
 Each game outcome or terminal state has a
utility for each player (e.g., 1 for win, 0 for
loss)
 The sum of both players’ utilities is a
constant
 We don’t know how the opponent will act
◦ The solution is not a fixed sequence of actions from
start state to goal state, but a strategy or policy (a
mapping from state to best move in that state)
 Efficiency is critical to playing well
◦ The time to make a move is limited
◦ The branching factor, search depth, and number of
terminal configurations are huge
 In chess, branching factor ≈ 35 and depth ≈ 100, giving a
search tree of 10154 nodes
◦ This rules out searching all the way to the end of the
game
 A game of tic-tac-toe between two players, “max”
and “min”
Terminal utilities (for MAX)
3 2 2
3
A two-ply game
 Minimax value of a node: the utility (for MAX) of being in the
corresponding state, assuming perfect play on both sides
 Minimax strategy: Choose the move that gives the best worst-
case payoff
3 2 2
3
 In Minimax Procedure, it seems as if the static
evaluator must be used on each leaf node.
Fortunately there is a procedure that reduces
both the tree branches that must be generated
and the number of evaluations. This procedure
is called Alpha Beta pruning which “prunes” the
tree branches thus reducing the number of
static evaluations
 It is possible to compute the exact minimax
decision without expanding every node in the
game tree
50 40 70 10 60 30 80 90 20 90 70 60
50
70 30 20
50 30 20
Maximizing Level
Minimizing Level
Maximizing Level
50
 Pruning does not affect final result
 Amount of pruning depends on move
ordering
◦ Should start with the “best” moves (highest-value
for MAX or lowest-value for MIN)
◦ For chess, can try captures first, then threats, then
forward moves, then backward moves
◦ Can also try to remember “killer moves” from other
branches of the tree
 Card games like bridge and poker
 Monte Carlo simulation: deal all the cards
randomly in the beginning and pretend the
game is fully observable
◦ “Averaging over clairvoyance”
◦ Problem: this strategy does not account for
bluffing, information gathering, etc.
 Computers are better than humans
◦ Checkers: solved in 2007
◦ Chess: IBM Deep Blue defeated Kasparov in 1997
 Computers are competitive with top human players
◦ Backgammon: TD-Gammon system used
reinforcement learning to learn a good evaluation
function
◦ Bridge: top systems use Monte Carlo simulation and
alpha-beta search
 Computers are not competitive
◦ Go: branching factor 361. Existing systems use
Monte Carlo simulation and pattern databases
 Ernst Zermelo (1912): Minimax algorithm
 Claude Shannon (1949): chess playing with
evaluation function, quiescence search,
selective search (paper)
 John McCarthy (1956): Alpha-beta search
 Arthur Samuel (1956): checkers program
that learns its own evaluation function by
playing against itself

Más contenido relacionado

La actualidad más candente

Mercurial studios odoricodante_assignment1_0315.pptx
Mercurial studios odoricodante_assignment1_0315.pptxMercurial studios odoricodante_assignment1_0315.pptx
Mercurial studios odoricodante_assignment1_0315.pptxDante Odorico
 
Introduction to Game Theory
Introduction to Game TheoryIntroduction to Game Theory
Introduction to Game TheoryCesar Sobrino
 
Maxim gummer-guildwars
Maxim gummer-guildwarsMaxim gummer-guildwars
Maxim gummer-guildwarsmaximgummer
 
پیش بینی زیان مالی
پیش بینی زیان مالیپیش بینی زیان مالی
پیش بینی زیان مالیNima afzalpour
 
Game and Search
Game and SearchGame and Search
Game and SearchAdri Jovin
 
Fearsome Thorny Pigeons Laz Matech_week1assignment_march2015
Fearsome Thorny Pigeons Laz Matech_week1assignment_march2015 Fearsome Thorny Pigeons Laz Matech_week1assignment_march2015
Fearsome Thorny Pigeons Laz Matech_week1assignment_march2015 Laz Matech
 
Slow play and stalling
Slow play and stallingSlow play and stalling
Slow play and stallingmlo825
 
BrainWars Playing Strategies for All BrainWars Players
BrainWars Playing Strategies for All BrainWars PlayersBrainWars Playing Strategies for All BrainWars Players
BrainWars Playing Strategies for All BrainWars Playersctyl
 

La actualidad más candente (18)

Minesweeper
MinesweeperMinesweeper
Minesweeper
 
Mine Sweeper Mastered
Mine Sweeper MasteredMine Sweeper Mastered
Mine Sweeper Mastered
 
Game theory
Game theoryGame theory
Game theory
 
Bomber
BomberBomber
Bomber
 
Dinesh kumar
Dinesh kumarDinesh kumar
Dinesh kumar
 
Game theory
Game theoryGame theory
Game theory
 
Game theory
Game theory Game theory
Game theory
 
Mercurial studios odoricodante_assignment1_0315.pptx
Mercurial studios odoricodante_assignment1_0315.pptxMercurial studios odoricodante_assignment1_0315.pptx
Mercurial studios odoricodante_assignment1_0315.pptx
 
Introduction to Game Theory
Introduction to Game TheoryIntroduction to Game Theory
Introduction to Game Theory
 
Standard Deviation1
Standard Deviation1Standard Deviation1
Standard Deviation1
 
Maxim gummer-guildwars
Maxim gummer-guildwarsMaxim gummer-guildwars
Maxim gummer-guildwars
 
Game Theory by Chadha
Game Theory by ChadhaGame Theory by Chadha
Game Theory by Chadha
 
پیش بینی زیان مالی
پیش بینی زیان مالیپیش بینی زیان مالی
پیش بینی زیان مالی
 
Game and Search
Game and SearchGame and Search
Game and Search
 
Fearsome Thorny Pigeons Laz Matech_week1assignment_march2015
Fearsome Thorny Pigeons Laz Matech_week1assignment_march2015 Fearsome Thorny Pigeons Laz Matech_week1assignment_march2015
Fearsome Thorny Pigeons Laz Matech_week1assignment_march2015
 
Slow play and stalling
Slow play and stallingSlow play and stalling
Slow play and stalling
 
BrainWars Playing Strategies for All BrainWars Players
BrainWars Playing Strategies for All BrainWars PlayersBrainWars Playing Strategies for All BrainWars Players
BrainWars Playing Strategies for All BrainWars Players
 
Game model
Game modelGame model
Game model
 

Similar a Lec08 adversarial search

AI3391 Artificial Intelligence UNIT III Notes_merged.pdf
AI3391 Artificial Intelligence UNIT III Notes_merged.pdfAI3391 Artificial Intelligence UNIT III Notes_merged.pdf
AI3391 Artificial Intelligence UNIT III Notes_merged.pdfAsst.prof M.Gokilavani
 
AI Strategies for Solving Poker Texas Hold'em
AI Strategies for Solving Poker Texas Hold'emAI Strategies for Solving Poker Texas Hold'em
AI Strategies for Solving Poker Texas Hold'emGiovanni Murru
 
Fighting online game cheating with cryptography
Fighting online game cheating with cryptographyFighting online game cheating with cryptography
Fighting online game cheating with cryptographySteven Davis
 
21CSC206T_UNIT3.pptx.pdf ARITIFICIAL INTELLIGENCE
21CSC206T_UNIT3.pptx.pdf ARITIFICIAL INTELLIGENCE21CSC206T_UNIT3.pptx.pdf ARITIFICIAL INTELLIGENCE
21CSC206T_UNIT3.pptx.pdf ARITIFICIAL INTELLIGENCEudayvanand
 
AI3391 Artificial Intelligence Session 14 Adversarial Search .pptx
AI3391 Artificial Intelligence Session 14 Adversarial Search .pptxAI3391 Artificial Intelligence Session 14 Adversarial Search .pptx
AI3391 Artificial Intelligence Session 14 Adversarial Search .pptxAsst.prof M.Gokilavani
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial IntelligenceAltafur Rahman
 
Learning to Play Complex Games
Learning to Play Complex GamesLearning to Play Complex Games
Learning to Play Complex Gamesbutest
 
Unit_I_Introduction(Part_III).ppt
Unit_I_Introduction(Part_III).pptUnit_I_Introduction(Part_III).ppt
Unit_I_Introduction(Part_III).pptganesh15478
 
games, infosec, privacy, adversaries .ppt
games, infosec, privacy, adversaries .pptgames, infosec, privacy, adversaries .ppt
games, infosec, privacy, adversaries .pptMuhammadAbdullah311866
 
AlphaZero: A General Reinforcement Learning Algorithm that Masters Chess, Sho...
AlphaZero: A General Reinforcement Learning Algorithm that Masters Chess, Sho...AlphaZero: A General Reinforcement Learning Algorithm that Masters Chess, Sho...
AlphaZero: A General Reinforcement Learning Algorithm that Masters Chess, Sho...Joonhyung Lee
 
Game Tree ( Oyun Ağaçları )
Game Tree ( Oyun Ağaçları )Game Tree ( Oyun Ağaçları )
Game Tree ( Oyun Ağaçları )Alp Çoker
 

Similar a Lec08 adversarial search (20)

AI3391 Artificial Intelligence UNIT III Notes_merged.pdf
AI3391 Artificial Intelligence UNIT III Notes_merged.pdfAI3391 Artificial Intelligence UNIT III Notes_merged.pdf
AI3391 Artificial Intelligence UNIT III Notes_merged.pdf
 
Adversarial search
Adversarial searchAdversarial search
Adversarial search
 
AlphaGo
AlphaGoAlphaGo
AlphaGo
 
AI Strategies for Solving Poker Texas Hold'em
AI Strategies for Solving Poker Texas Hold'emAI Strategies for Solving Poker Texas Hold'em
AI Strategies for Solving Poker Texas Hold'em
 
Games.4
Games.4Games.4
Games.4
 
Fighting online game cheating with cryptography
Fighting online game cheating with cryptographyFighting online game cheating with cryptography
Fighting online game cheating with cryptography
 
Min-Max algorithm
Min-Max algorithmMin-Max algorithm
Min-Max algorithm
 
adversial search.pptx
adversial search.pptxadversial search.pptx
adversial search.pptx
 
21CSC206T_UNIT3.pptx.pdf ARITIFICIAL INTELLIGENCE
21CSC206T_UNIT3.pptx.pdf ARITIFICIAL INTELLIGENCE21CSC206T_UNIT3.pptx.pdf ARITIFICIAL INTELLIGENCE
21CSC206T_UNIT3.pptx.pdf ARITIFICIAL INTELLIGENCE
 
AI3391 Artificial Intelligence Session 14 Adversarial Search .pptx
AI3391 Artificial Intelligence Session 14 Adversarial Search .pptxAI3391 Artificial Intelligence Session 14 Adversarial Search .pptx
AI3391 Artificial Intelligence Session 14 Adversarial Search .pptx
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Learning to Play Complex Games
Learning to Play Complex GamesLearning to Play Complex Games
Learning to Play Complex Games
 
Unit_I_Introduction(Part_III).ppt
Unit_I_Introduction(Part_III).pptUnit_I_Introduction(Part_III).ppt
Unit_I_Introduction(Part_III).ppt
 
games, infosec, privacy, adversaries .ppt
games, infosec, privacy, adversaries .pptgames, infosec, privacy, adversaries .ppt
games, infosec, privacy, adversaries .ppt
 
AI Lesson 07
AI Lesson 07AI Lesson 07
AI Lesson 07
 
AlphaZero: A General Reinforcement Learning Algorithm that Masters Chess, Sho...
AlphaZero: A General Reinforcement Learning Algorithm that Masters Chess, Sho...AlphaZero: A General Reinforcement Learning Algorithm that Masters Chess, Sho...
AlphaZero: A General Reinforcement Learning Algorithm that Masters Chess, Sho...
 
Minimax
MinimaxMinimax
Minimax
 
Game Tree ( Oyun Ağaçları )
Game Tree ( Oyun Ağaçları )Game Tree ( Oyun Ağaçları )
Game Tree ( Oyun Ağaçları )
 
Minimax
MinimaxMinimax
Minimax
 
AI_unit3.pptx
AI_unit3.pptxAI_unit3.pptx
AI_unit3.pptx
 

Último

Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 

Último (20)

Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service OnlineCALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
CALL ON ➥8923113531 🔝Call Girls Chinhat Lucknow best sexual service Online
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 

Lec08 adversarial search

  • 1.
  • 2.  Games are a traditional hallmark of intelligence  Games are easy to formalize  Games can be a good model of real-world competitive activities ◦ Military confrontations, negotiation, auctions, etc.
  • 3. Deterministic Stochastic Perfect information (fully observable) Imperfect information (partially observable)
  • 4. Deterministic Stochastic Perfect information (fully observable) Chess, checkers, go Backgammon, monopoly Imperfect information (partially observable) Battleships Scrabble, poker, bridge
  • 5.  Players take turns  Each game outcome or terminal state has a utility for each player (e.g., 1 for win, 0 for loss)  The sum of both players’ utilities is a constant
  • 6.  We don’t know how the opponent will act ◦ The solution is not a fixed sequence of actions from start state to goal state, but a strategy or policy (a mapping from state to best move in that state)  Efficiency is critical to playing well ◦ The time to make a move is limited ◦ The branching factor, search depth, and number of terminal configurations are huge  In chess, branching factor ≈ 35 and depth ≈ 100, giving a search tree of 10154 nodes ◦ This rules out searching all the way to the end of the game
  • 7.  A game of tic-tac-toe between two players, “max” and “min”
  • 8. Terminal utilities (for MAX) 3 2 2 3 A two-ply game
  • 9.  Minimax value of a node: the utility (for MAX) of being in the corresponding state, assuming perfect play on both sides  Minimax strategy: Choose the move that gives the best worst- case payoff 3 2 2 3
  • 10.  In Minimax Procedure, it seems as if the static evaluator must be used on each leaf node. Fortunately there is a procedure that reduces both the tree branches that must be generated and the number of evaluations. This procedure is called Alpha Beta pruning which “prunes” the tree branches thus reducing the number of static evaluations  It is possible to compute the exact minimax decision without expanding every node in the game tree
  • 11. 50 40 70 10 60 30 80 90 20 90 70 60 50 70 30 20 50 30 20 Maximizing Level Minimizing Level Maximizing Level 50
  • 12.  Pruning does not affect final result  Amount of pruning depends on move ordering ◦ Should start with the “best” moves (highest-value for MAX or lowest-value for MIN) ◦ For chess, can try captures first, then threats, then forward moves, then backward moves ◦ Can also try to remember “killer moves” from other branches of the tree
  • 13.  Card games like bridge and poker  Monte Carlo simulation: deal all the cards randomly in the beginning and pretend the game is fully observable ◦ “Averaging over clairvoyance” ◦ Problem: this strategy does not account for bluffing, information gathering, etc.
  • 14.  Computers are better than humans ◦ Checkers: solved in 2007 ◦ Chess: IBM Deep Blue defeated Kasparov in 1997  Computers are competitive with top human players ◦ Backgammon: TD-Gammon system used reinforcement learning to learn a good evaluation function ◦ Bridge: top systems use Monte Carlo simulation and alpha-beta search  Computers are not competitive ◦ Go: branching factor 361. Existing systems use Monte Carlo simulation and pattern databases
  • 15.  Ernst Zermelo (1912): Minimax algorithm  Claude Shannon (1949): chess playing with evaluation function, quiescence search, selective search (paper)  John McCarthy (1956): Alpha-beta search  Arthur Samuel (1956): checkers program that learns its own evaluation function by playing against itself