Backward Induction: Solving Games by Working Backwards

You’re playing chess. How do you decide your next move? Master players don’t just think one move ahead — they think many moves ahead, anticipating their opponent’s responses, then their own responses to those responses, and so on.

This is backward induction — one of game theory’s most powerful techniques. Instead of reasoning forward from the start, you reason backward from the end.

And it reveals some surprising truths about strategy.

The Core Idea

Backward induction is a method for solving sequential games (games where players move in turns) by:

Starting at the final decision point
Determining the optimal choice at that point
Moving backward to the previous decision point
Repeating until you reach the beginning

The result: a complete strategy for every player that specifies optimal play at every possible point in the game.

%%{init: {'theme':'dark', 'themeVariables': {'primaryTextColor':'#fff','secondaryTextColor':'#fff','tertiaryTextColor':'#fff','textColor':'#fff','nodeTextColor':'#fff'}}}%% graph TD A[Start: Work backward] --> B[Final move: What's optimal?] B --> C[Second-to-last move: Given final play, what's optimal?] C --> D[Third-to-last move: Given subsequent play, what's optimal?] D --> E[...] E --> F[First move: Optimal choice given all future responses] style B fill:#2d3748,stroke:#4299e1,stroke-width:3px style F fill:#2d3748,stroke:#48bb78,stroke-width:3px

A Simple Example: The Take-Away Game

Two players alternate taking 1, 2, or 3 coins from a pile of 10 coins. The player who takes the last coin wins.

Who wins with optimal play, and what’s the strategy?

Let’s use backward induction.

Step 1: Analyze the End

If 1-3 coins remain, the current player takes them all and wins.

If 4 coins remain, the current player loses:

Take 1 → 3 remain → opponent wins
Take 2 → 2 remain → opponent wins
Take 3 → 1 remains → opponent wins

So 4 coins = losing position.

Step 2: Work Backwards

If 5-7 coins remain, the current player can force the opponent into the 4-coin losing position:

From 5: take 1 → leave 4 → opponent loses
From 6: take 2 → leave 4 → opponent loses
From 7: take 3 → leave 4 → opponent loses

So 5, 6, or 7 coins = winning positions.

If 8 coins remain, the current player loses (opponent gets to 5-7 and wins).

Step 3: Pattern Emerges

Positions: W = Win, L = Lose

Coins	1	2	3	4	5	6	7	8	9	10
Result	W	W	W	L	W	W	W	L	W	W

Pattern: Positions that are multiples of 4 are losing positions!

Optimal strategy from 10 coins: Player 1 takes 2, leaving 8 (a losing position for Player 2). Player 1 wins with optimal play.

%%{init: {'theme':'dark', 'themeVariables': {'primaryTextColor':'#fff','secondaryTextColor':'#fff','tertiaryTextColor':'#fff','textColor':'#fff','nodeTextColor':'#fff'}}}%% graph TD A[10 coins: P1] -->|Take 2| B[8 coins: P2 LOSES] B -->|Take 1| C[7: P1 takes 3] B -->|Take 2| D[6: P1 takes 2] B -->|Take 3| E[5: P1 takes 1] C --> F[4: P2 loses] D --> F E --> F F -->|Any move| G[P1 takes all remaining] G --> H[P1 WINS] style A fill:#2d3748,stroke:#48bb78,stroke-width:3px style B fill:#742a2a,stroke:#f56565,stroke-width:3px style H fill:#2d3748,stroke:#4299e1,stroke-width:3px

Formal Game Trees

Sequential games are represented as game trees (also called extensive form).

Each node represents a decision point. Each edge represents an action. Terminal nodes show final payoffs.

Example: The Ultimatum Game (Simplified)

Player 1 proposes how to split $10. Player 2 accepts or rejects. If rejected, both get $0.

Let’s say Player 1 can only propose (5,5) or (8,2).

%%{init: {'theme':'dark', 'themeVariables': {'primaryTextColor':'#fff','secondaryTextColor':'#fff','tertiaryTextColor':'#fff','textColor':'#fff','nodeTextColor':'#fff'}}}%% graph TD A[Player 1 decides] -->|Offer 5,5| B[Player 2 decides] A -->|Offer 8,2| C[Player 2 decides] B -->|Accept| D[Payoffs: 5, 5] B -->|Reject| E[Payoffs: 0, 0] C -->|Accept| F[Payoffs: 8, 2] C -->|Reject| G[Payoffs: 0, 0] style A fill:#2d3748,stroke:#4299e1,stroke-width:3px style D fill:#2d3748,stroke:#48bb78,stroke-width:2px style F fill:#2d3748,stroke:#48bb78,stroke-width:2px

Backward induction:

At each of Player 2’s decision nodes, compare payoffs:
- After (5,5) offer: Accept gives 5, Reject gives 0 → Accept
- After (8,2) offer: Accept gives 2, Reject gives 0 → Accept
At Player 1’s decision node, knowing Player 2 accepts both:
- Offer (5,5) → get 5
- Offer (8,2) → get 8
- Choose (8,2)

Backward induction prediction: Player 1 offers (8,2), Player 2 accepts.

(But as we’ll see in the next post, humans often reject unfair offers — a challenge to rational choice theory!)

Subgame Perfect Equilibrium

Backward induction finds the subgame perfect Nash equilibrium (SPNE).

Nash Equilibrium: Strategies are best responses to each other.

Subgame Perfect: Strategies are optimal at every possible point in the game, even off the equilibrium path.

Why It Matters: Eliminating Non-Credible Threats

Consider this game:

Player 1 chooses: Share ($5 each) or Grab ($8 for P1, then P2 decides)
If P1 grabs, P2 chooses: Accept ($8, $2) or Burn (both get $0)

Is “I’ll burn it if you grab” a credible threat?

%%{init: {'theme':'dark', 'themeVariables': {'primaryTextColor':'#fff','secondaryTextColor':'#fff','tertiaryTextColor':'#fff','textColor':'#fff','nodeTextColor':'#fff'}}}%% graph TD A[Player 1] -->|Share| B[5, 5] A -->|Grab| C[Player 2] C -->|Accept| D[8, 2] C -->|Burn| E[0, 0] F[P2's threat: I'll burn it!] F -.->|Is this credible?| C style A fill:#2d3748,stroke:#4299e1,stroke-width:3px style D fill:#2d3748,stroke:#48bb78,stroke-width:2px style E fill:#742a2a,stroke:#f56565,stroke-width:3px

Backward induction:

At P2’s decision: Accept gives 2, Burn gives 0 → Accept

At P1’s decision: Share gives 5, Grab gives 8 → Grab

SPNE: (Grab, Accept)

The threat to burn is not credible because when the moment comes, P2 prefers to accept. Player 1 should ignore the threat.

Lesson: Only credible threats affect optimal play. Backward induction reveals which threats are credible.

Classic Application: Stackelberg Competition

Two firms choose production quantities. Firm 1 moves first, Firm 2 observes and responds.

Market price: P = 100 - Q (where Q = q₁ + q₂)

Costs: Zero (for simplicity)

Firm profits:

Firm 1: π₁ = q₁(100 - q₁ - q₂)
Firm 2: π₂ = q₂(100 - q₁ - q₂)

Backward Induction Solution:

Step 1: Firm 2’s optimal response to q₁

Maximize π₂ = q₂(100 - q₁ - q₂)

Take derivative: 100 - q₁ - 2q₂ = 0

Firm 2’s best response: q₂ = (100 - q₁)/2

Step 2: Firm 1’s optimal choice, anticipating Firm 2’s response

Substitute into π₁: π₁ = q₁(100 - q₁ - (100 - q₁)/2) = q₁(50 - q₁/2)

Maximize: 50 - q₁ = 0

Firm 1’s optimal quantity: q₁ = 50

Step 3: Firm 2’s response

q₂ = (100 - 50)/2 = 25

Outcome:

Firm 1 produces 50, earns 50 × 50 = $2500
Firm 2 produces 25, earns 25 × 25 = $625
First-mover advantage!

%%{init: {'theme':'dark', 'themeVariables': {'primaryTextColor':'#fff','secondaryTextColor':'#fff','tertiaryTextColor':'#fff','textColor':'#fff','nodeTextColor':'#fff'}}}%% graph LR A[Firm 1 chooses q₁] --> B[Firm 2 observes] B --> C[Firm 2 chooses q₂ = 100-q₁/2] C --> D[Market clears] E[Backward Induction] --> F[Firm 1 anticipates Firm 2's response] F --> G[Firm 1 chooses q₁ = 50] G --> H[Firm 2 chooses q₂ = 25] style A fill:#2d3748,stroke:#48bb78,stroke-width:3px style H fill:#2d3748,stroke:#4299e1,stroke-width:2px

The Centipede Game: A Paradox

Two players alternate choosing to Continue or Stop. The pot grows each round, but stopping gives you a slightly larger share.

Payoffs increase as the game continues, but the player who stops gets slightly more than if the other player stops next.

%%{init: {'theme':'dark', 'themeVariables': {'primaryTextColor':'#fff','secondaryTextColor':'#fff','tertiaryTextColor':'#fff','textColor':'#fff','nodeTextColor':'#fff'}}}%% graph LR A[P1: 1,0] -->|Continue| B[P2: 0,2] B -->|Continue| C[P1: 3,1] C -->|Continue| D[P2: 2,4] D -->|Continue| E[P1: 5,3] E -->|Continue| F[6,6] A -->|Stop| A1[1,0] B -->|Stop| B1[0,2] C -->|Stop| C1[3,1] D -->|Stop| D1[2,4] E -->|Stop| E1[5,3] style F fill:#2d3748,stroke:#48bb78,stroke-width:3px

Backward induction prediction:

At the last decision (P1 at round 5): Stop and get 5 vs Continue and get 6. But wait — by stopping, P1 gets 5, P2 gets 3. By continuing, both get 6. If P1 is selfish about getting more than P2, they stop.

Let me reconsider the payoffs more carefully. In the centipede game, the typical structure is:

Stopping gives you more NOW than continuing and letting the opponent stop next
But continuing and having the opponent also continue gives both more

Simplified version:

Round 1: P1 can stop (1, 0) or continue Round 2: P2 can stop (0, 3) or continue Round 3: P1 can stop (4, 1) or continue Round 4: Both get (2, 5) if reached

Backward induction:

At round 3: P1 stops (gets 4 instead of 2) → (4, 1)

At round 2: P2 stops (gets 3 instead of 1) → (0, 3)

At round 1: P1 stops (gets 1 instead of 0) → (1, 0)

Backward induction predicts: P1 stops immediately at round 1!

In experiments: People usually continue several rounds. Why?

Trust that opponent will reciprocate
Social preferences (fairness, cooperation)
Bounded rationality
Reputation concerns

This is a paradox of backward induction: the logic is ironclad, but it predicts behavior that seems absurd and empirically doesn’t happen.

Limitations of Backward Induction

1. Perfect Rationality Assumption

Backward induction assumes all players are perfectly rational and know all others are perfectly rational, ad infinitum. But humans aren’t perfect reasoners.

2. Complete Information Assumption

You need to know the entire game tree and all payoffs. In reality, there’s often uncertainty.

3. Commitment Problems

Sometimes you want to credibly commit to an irrational action. (More on this in game theory applications to negotiation.)

4. Trembling Hand

What if players sometimes make mistakes? This can break backward induction logic (see the centipede game).

When Backward Induction Works Best

Backward induction is most reliable when:

The game tree is small and tractable
Payoffs are well-defined and known
Rationality is reasonable: contexts like chess, business strategy, optimal stopping problems
Stakes are high enough that players think carefully
Players have experience with the game structure

Real-World Applications

1. Chess and Go

Computer chess engines use backward induction (via algorithms like minimax) to search game trees and find optimal moves.

2. Business Strategy

Should you enter a market, anticipating incumbent response?
Use backward induction to predict whether they’ll fight or accommodate

3. Negotiations

Work backward from the final offer to determine opening positions
Anticipate counteroffers and concessions

4. Political Strategy

Legislative voting: predict how amendments will affect final votes
Campaign strategy: anticipate opponent responses to your moves

5. Legal Proceedings

Litigation vs settlement decisions
Anticipate jury verdicts and appellate outcomes

Practice Problem: The Pirate Game

Five pirates must divide 100 gold coins. They vote on proposals in order (Pirate 1, then 2, etc.). If a proposal gets a majority (including the proposer), it passes. Otherwise, the proposer is thrown overboard, and the next pirate proposes.

Pirates are:

Rational and greedy (want maximum coins)
Bloodthirsty (prefer throwing pirates overboard if indifferent)
Self-preserving (prefer survival above all)

Question: What should Pirate 1 propose?

Solution (using backward induction)

If only Pirate 5 remains: Gets all 100 (majority of 1)

If Pirates 4 & 5 remain:

Pirate 4 proposes (100, 0): gives nothing to 5
Pirate 4’s vote (1 of 2) is enough for a majority
Pirate 4 gets 100, Pirate 5 gets 0

If Pirates 3, 4, 5 remain:

Pirate 4 knows they get 100 if they can reject
Pirate 5 knows they get 0 if they reject
Pirate 3 proposes (99, 0, 1)
Pirate 5 accepts (1 > 0 they’d get if 3 is thrown overboard)
2 votes (3 & 5) → passes

If Pirates 2, 3, 4, 5 remain:

Pirate 3 knows they get 99 if they reject
Pirate 4 knows they get 0 if they reject
Pirate 2 proposes (99, 0, 1, 0)
Pirate 4 accepts (1 > 0)
2 votes → passes

If all 5 pirates:

Pirate 2 gets 99 if they reject
Pirates 4 gets 0 if they reject
Pirate 5 gets 1 if they reject
Pirate 1 proposes (98, 0, 1, 0, 1)
Pirates 3 & 5 accept
3 votes → passes

Answer: Pirate 1 should propose (98, 0, 1, 0, 1) — keeping 98 for themselves, giving 1 each to Pirates 3 and 5.

Despite being the most vulnerable, Pirate 1 gets the most gold through backward induction!

Key Takeaways

Backward induction solves sequential games by reasoning from the end to the beginning
Start at final decision nodes and work backward to determine optimal strategies
Finds subgame perfect equilibrium — strategies that are optimal at every point
Eliminates non-credible threats — only credible commitments matter
First-mover advantage often exists (Stackelberg competition)
Paradoxes arise when perfect rationality is assumed (centipede game)
Most powerful for: chess, business strategy, negotiations, sequential decision-making

What’s Next?

Backward induction assumes players are perfectly rational. But what if they’re not? What if emotions, fairness, and social preferences matter?

Next, we’ll explore The Ultimatum Game — a simple game that reveals the limits of rational choice theory and shows that humans aren’t always the selfish calculators economists assume.

This post is part of the Game Theory Series, where we explore the mathematics of strategic decision-making.

Backward Induction: Solving Games by Working Backwards

Backward Induction: Solving Games by Working Backwards

The Core Idea

A Simple Example: The Take-Away Game

Step 1: Analyze the End

Step 2: Work Backwards

Step 3: Pattern Emerges

Formal Game Trees

Example: The Ultimatum Game (Simplified)

Subgame Perfect Equilibrium

Why It Matters: Eliminating Non-Credible Threats

Classic Application: Stackelberg Competition

Backward Induction Solution:

The Centipede Game: A Paradox

Limitations of Backward Induction

1. Perfect Rationality Assumption

2. Complete Information Assumption

3. Commitment Problems

4. Trembling Hand

When Backward Induction Works Best

Real-World Applications

1. Chess and Go

2. Business Strategy

3. Negotiations

4. Political Strategy

5. Legal Proceedings

Practice Problem: The Pirate Game

Key Takeaways

What’s Next?

AI Assistant

Hi! I'm your AI assistant

Backward Induction: Solving Games by Working Backwards#

The Core Idea#

A Simple Example: The Take-Away Game#

Step 1: Analyze the End#

Step 2: Work Backwards#

Step 3: Pattern Emerges#

Formal Game Trees#

Example: The Ultimatum Game (Simplified)#

Subgame Perfect Equilibrium#

Why It Matters: Eliminating Non-Credible Threats#

Classic Application: Stackelberg Competition#

Backward Induction Solution:#

The Centipede Game: A Paradox#

Limitations of Backward Induction#

1. Perfect Rationality Assumption#

2. Complete Information Assumption#

3. Commitment Problems#

4. Trembling Hand#

When Backward Induction Works Best#

Real-World Applications#

1. Chess and Go#

2. Business Strategy#

3. Negotiations#

4. Political Strategy#

5. Legal Proceedings#

Practice Problem: The Pirate Game#

Key Takeaways#

What’s Next?#

Backward Induction: Solving Games by Working Backwards

The Core Idea

A Simple Example: The Take-Away Game

Step 1: Analyze the End

Step 2: Work Backwards

Step 3: Pattern Emerges

Formal Game Trees

Example: The Ultimatum Game (Simplified)

Subgame Perfect Equilibrium

Why It Matters: Eliminating Non-Credible Threats

Classic Application: Stackelberg Competition

Backward Induction Solution:

The Centipede Game: A Paradox

Limitations of Backward Induction

1. Perfect Rationality Assumption

2. Complete Information Assumption

3. Commitment Problems

4. Trembling Hand

When Backward Induction Works Best

Real-World Applications

1. Chess and Go

2. Business Strategy

3. Negotiations

4. Political Strategy

5. Legal Proceedings

Practice Problem: The Pirate Game

Key Takeaways

What’s Next?