Prisoner's dilemma illustrates how individual rationality can lead to collectively worse outcomes

Image: David McSpadden from Daly City, United States, CC BY 2.0, via Wikimedia Commons

Prisoner's dilemma

Prisoner's dilemma illustrates how individual rationality can lead to collectively worse outcomes

The prisoner's dilemma showcases the paradox where two rational agents choosing to defect results in a worse collective outcome compared to mutual cooperation. This paradox highlights the conflict between individual rationality and collective well-being.

Example

In a classic scenario, if both prisoners betray each other, they each receive a moderate sentence. However, if they had both cooperated, they would have received lighter sentences.

Understanding this paradox is crucial for designing systems and policies that encourage cooperation and improve collective outcomes.

Related concepts

a dominant strategy is: optimal regardless of what other players do

A dominant strategy maximizes payoff irrespective of opponents' actions

mechanism design does: designs rules so rational agents produce desired outcomes

Mechanism design: Creates rules ensuring rational agents achieve intended outcomes

the minimax theorem says: in zero-sum games, there's a saddle point strategy

In zero-sum games, minimax theorem guarantees a saddle point strategy

Reinforcement learning from human feedback

RLHF optimizes a reward model trained on human preference pairs

Knowledge distillation

Knowledge distillation transfers knowledge from a large model to a smaller one without loss of validity

Greedy vs dynamic programming: greedy makes locally optimal choices, DP considers all subproblems

Greedy: locally optimal choices; DP: considers all subproblems

One email a day: 5 concepts + the 5 stories that matter →

Swipe through 100 ML concepts daily

Open TickerNews