Subgame perfect equilibrium

Subgame Perfect Equilibrium
Subgame Perfect Equilibrium
A Extensive form games
Example	Ultimatum game

In

dynamic games. A strategy profile is a subgame perfect equilibrium if it represents a Nash equilibrium of every subgame of the original game. Informally, this means that at any point in the game, the players' behavior from that point onward should represent a Nash equilibrium of the continuation game (i.e. of the subgame), no matter what happened before. Every finite extensive game with perfect recall has a subgame perfect equilibrium.^[1] Perfect recall is a term introduced by Harold W. Kuhn in 1953 and "equivalent to the assertion that each player is allowed by the rules of the game to remember everything he knew at previous moves and all of his choices at those moves".^[2]

A common method for determining subgame perfect equilibria in the case of a finite game is

incomplete information because this entails cutting through non-singleton information sets

.

A subgame perfect equilibrium necessarily satisfies the one-shot deviation principle.

The set of subgame perfect equilibria for a given game is always a subset of the set of Nash equilibria for that game. In some cases the sets can be identical.

The ultimatum game provides an intuitive example of a game with fewer subgame perfect equilibria than Nash equilibria.

Example

Determining the subgame perfect equilibrium by using backward induction is shown below in Figure 1. Strategies for Player 1 are given by {Up, Uq, Dp, Dq}, whereas Player 2 has the strategies among {TL, TR, BL, BR}. There are 4 subgames in this example, with 3 proper subgames.

Using the backward induction, the players will take the following actions for each subgame:

Subgame for actions p and q: Player 1 will take action p with payoff (3, 3) to maximize Player 1's payoff, so the payoff for action L becomes (3,3).
Subgame for actions L and R: Player 2 will take action L for 3 > 2, so the payoff for action D becomes (3, 3).
Subgame for actions T and B: Player 2 will take action T to maximize Player 2's payoff, so the payoff for action U becomes (1, 4).
Subgame for actions U and D: Player 1 will take action D to maximize Player 1's payoff.

Thus, the subgame perfect equilibrium is {Dp, TL} with the payoff (3, 3).

An extensive-form game with incomplete information is presented below in Figure 2. Note that the node for Player 1 with actions A and B, and all succeeding actions is a subgame. Player 2's nodes are not a subgame as they are part of the same information set.

The first normal-form game is the normal form representation of the whole extensive-form game. Based on the provided information, (UA, X), (DA, Y), and (DB, Y) are all Nash equilibria for the entire game.

The second normal-form game is the normal form representation of the subgame starting from Player 1's second node with actions A and B. For the second normal-form game, the Nash equilibrium of the subgame is (A, X).

For the entire game Nash equilibria (DA, Y) and (DB, Y) are not subgame perfect equilibria because the move of Player 2 does not constitute a Nash equilibrium. The Nash equilibrium (UA, X) is subgame perfect because it incorporates the subgame Nash equilibrium (A, X) as part of its strategy.^[3]

To solve this game, first find the Nash Equilibria by mutual best response of Subgame 1. Then use backwards induction and plug in (A,X) → (3,4) so that (3,4) become the payoffs for Subgame 2.^[3]

The dashed line indicates that player 2 does not know whether player 1 will play A or B in a simultaneous game.

Player 1 chooses U rather than D because 3 > 2 for Player 1's payoff. The resulting equilibrium is (A, X) → (3,4).

Thus, the subgame perfect equilibrium through backwards induction is (UA, X) with the payoff (3, 4).

Repeated games

For finitely repeated games, if a stage game has only one unique Nash equilibrium, the subgame perfect equilibrium is to play without considering past actions, treating the current subgame as a one-shot game. An example of this is a finitely repeated Prisoner's dilemma game. The Prisoner's dilemma gets its name from a situation that contains two guilty culprits. When they are interrogated, they have the option to stay quiet or defect. If both culprits stay quiet, they both serve a short sentence. If both defect, they both serve a moderate sentence. If they choose opposite options, then the culprit that defects is free and the culprit who stays quiet serves a long sentence. Ultimately, using backward induction, the last subgame in a finitely repeated Prisoner's dilemma requires players to play the unique Nash equilibrium (both players defecting). Because of this, all games prior to the last subgame will also play the Nash equilibrium to maximize their single-period payoffs.^[4] If a stage-game in a finitely repeated game has multiple Nash equilibria, subgame perfect equilibria can be constructed to play non-stage-game Nash equilibrium actions, through a "carrot and stick" structure. One player can use the one stage-game Nash equilibrium to incentivize playing the non-Nash equilibrium action, while using a stage-game Nash equilibrium with lower payoff to the other player if they choose to defect.^[5]

Finding subgame-perfect equilibria

imperfect information

.

The subgame-perfect Nash equilibrium is normally deduced by "

Go

has such an optimum strategy for all players. The problem of the relationship between subgame perfection and backward induction was settled by Kaminski (2019), who proved that a generalized procedure of backward induction produces all subgame perfect equilibria in games that may have infinite length, infinite actions as each information set, and imperfect information if a condition of final support is satisfied.

The interesting aspect of the word "credible" in the preceding paragraph is that taken as a whole (disregarding the irreversibility of reaching sub-games) strategies exist which are superior to subgame perfect strategies, but which are not credible in the sense that a threat to carry them out will harm the player making the threat and prevent that combination of strategies. For instance in the game of "chicken" if one player has the option of ripping the steering wheel from their car they should always take it because it leads to a "sub game" in which their rational opponent is precluded from doing the same thing (and killing them both). The wheel-ripper will always win the game (making his opponent swerve away), and the opponent's threat to suicidally follow suit is not credible.

References

^ ^a ^b Osborne, M. J. (2004). An Introduction to Game Theory. Oxford University Press.
ISBN 978-1-4008-8197-0
.

^
OCLC 842323069.{{cite book}}: CS1 maint: location missing publisher (link
)

^ Yildiz, Muhamet (2012). "12 Repeated Games". 14.12 Economic Applications of Game Theory. Massachusetts Institute of Technology: MIT OpenCourseWare. Retrieved April 27, 2021.

OCLC 911616270.{{cite book}}: CS1 maint: location missing publisher (link
)

External links

Selten, R. (1965). Spieltheoretische behandlung eines oligopolmodells mit nachfrageträgheit. Zeitschrift für die gesamte Staatswissenschaft/Journal of Institutional and Theoretical Economics, (H. 2), 301-324, 667-689. [in German - part 1, part 2]

Example of Extensive Form Games with imperfect information

Java applet to find a subgame perfect Nash Equilibrium solution for an extensive form game from gametheory.net.

Java applet to find a subgame perfect Nash Equilibrium solution for an extensive form game from gametheory.net.

Kaminski, M.M. Generalized Backward Induction: Justification for a Folk Algorithm. Games 2019, 10, 34.

v
t
e
Topics of game theory
Definitions

Congestion game

Cooperative game

Determinacy

Escalation of commitment

Extensive-form game

First-player and second-player win

Game complexity

Graphical game

Hierarchy of beliefs

Information set

Normal-form game

Preference

Sequential game

Simultaneous game

Simultaneous action selection

Solved game

Succinct game

Mechanism design

Equilibrium
concepts

Bayes correlated equilibrium

Bayesian Nash equilibrium

Berge equilibrium

Core

Correlated equilibrium

Coalition-proof Nash equilibrium

Epsilon-equilibrium

Evolutionarily stable strategy

Gibbs equilibrium

Mertens-stable equilibrium

Markov perfect equilibrium

Nash equilibrium

Pareto efficiency

Perfect Bayesian equilibrium

Proper equilibrium

Quantal response equilibrium

Quasi-perfect equilibrium

Risk dominance

Satisfaction equilibrium

Self-confirming equilibrium

Sequential equilibrium

Shapley value

Strong Nash equilibrium

Subgame perfection

Trembling hand equilibrium

Strategies

Appeasement

Backward induction

Bid shading

Collusion

Cheap talk

De-escalation

Deterrence

Escalation

Forward induction

Grim trigger

Markov strategy

Dominant strategies

Pure strategy

Mixed strategy

Strategy-stealing argument

Tit for tat

Classes
of games

Auction

Bargaining problem

Global game

Intransitive game

Mean-field game

n-player game

Perfect information

Large Poisson game

Potential game

Repeated game

Screening game

Signaling game

Strictly determined game

Stochastic game

Symmetric game

Zero-sum game

Games

Go

Chess

Infinite chess

Checkers

All-pay auction

Prisoner's dilemma

Gift-exchange game

Optional prisoner's dilemma

Traveler's dilemma

Coordination game

Chicken

Centipede game

Lewis signaling game

Volunteer's dilemma

Dollar auction

Battle of the sexes

Stag hunt

Matching pennies

Ultimatum game

Rock paper scissors

Pirate game

Dictator game

Public goods game

Blotto game

War of attrition

El Farol Bar problem

Fair division

Fair cake-cutting

Bertrand competition

Cournot competition

Stackelberg competition

Deadlock

Diner's dilemma

Guess 2/3 of the average

Kuhn poker

Nash bargaining game

Induction puzzles

Trust game

Princess and monster game

Rendezvous problem

Theorems

Aumann's agreement theorem

Folk theorem

Minimax theorem

Nash's theorem

Negamax theorem

Purification theorem

Revelation principle

Sprague–Grundy theorem

Zermelo's theorem

Key
figures

Albert W. Tucker

Amos Tversky

Antoine Augustin Cournot

Ariel Rubinstein

Claude Shannon

Daniel Kahneman

David K. Levine

David M. Kreps

Donald B. Gillies

Drew Fudenberg

Eric Maskin

Harold W. Kuhn

Herbert Simon

Hervé Moulin

John Conway

Jean Tirole

Jean-François Mertens

Jennifer Tour Chayes

John Harsanyi

John Maynard Smith

John Nash

John von Neumann

Kenneth Arrow

Kenneth Binmore

Leonid Hurwicz

Lloyd Shapley

Melvin Dresher

Merrill M. Flood

Olga Bondareva

Oskar Morgenstern

Paul Milgrom

Peyton Young

Reinhard Selten

Robert Axelrod

Robert Aumann

Robert B. Wilson

Roger Myerson

Samuel Bowles

Suzanne Scotchmer

Thomas Schelling

William Vickrey

Miscellaneous

Alpha–beta pruning

Bounded rationality

Combinatorial game theory

Confrontation analysis

Coopetition

Evolutionary game theory

Glossary of game theory

List of game theorists

List of games in game theory

No-win situation

Topological game

Tragedy of the commons

Retrieved from "https://en.wikipedia.org/w/index.php?title=Subgame_perfect_equilibrium&oldid=1174407539"

[Osborne2004-1] Osborne, M. J. (2004). An Introduction to Game Theory. Oxford University Press.

[2] ISBN 978-1-4008-8197-0
.

[:0-3] 
OCLC 842323069.{{cite book}}: CS1 maint: location missing publisher (link
)

[4] Yildiz, Muhamet (2012). "12 Repeated Games". 14.12 Economic Applications of Game Theory. Massachusetts Institute of Technology: MIT OpenCourseWare. Retrieved April 27, 2021.

[5] OCLC 911616270.{{cite book}}: CS1 maint: location missing publisher (link
)

[1]

[2]

[3]

[4]

[5]

Example

Repeated games

Finding subgame-perfect equilibria

See also

References

External links