Prisoner's dilemma illustrates how individual rationality can lead to collectively worse outcomes
Image: David McSpadden from Daly City, United States, CC BY 2.0, via Wikimedia Commons
Prisoner's dilemma illustrates how individual rationality can lead to collectively worse outcomes
The prisoner's dilemma showcases the paradox where two rational agents choosing to defect results in a worse collective outcome compared to mutual cooperation. This paradox highlights the conflict between individual rationality and collective well-being.
Example
In a classic scenario, if both prisoners betray each other, they each receive a moderate sentence. However, if they had both cooperated, they would have received lighter sentences.
Understanding this paradox is crucial for designing systems and policies that encourage cooperation and improve collective outcomes.
a dominant strategy is: optimal regardless of what other players do
A dominant strategy maximizes payoff irrespective of opponents' actions
mechanism design does: designs rules so rational agents produce desired outcomes
Mechanism design: Creates rules ensuring rational agents achieve intended outcomes
the minimax theorem says: in zero-sum games, there's a saddle point strategy
In zero-sum games, minimax theorem guarantees a saddle point strategy
Reinforcement learning from human feedback
RLHF optimizes a reward model trained on human preference pairs
Knowledge distillation
Knowledge distillation transfers knowledge from a large model to a smaller one without loss of validity
Greedy vs dynamic programming: greedy makes locally optimal choices, DP considers all subproblems
Greedy: locally optimal choices; DP: considers all subproblems
One email a day: 5 concepts + the 5 stories that matter →
Swipe through 100 ML concepts daily
Open TickerNews