The Prisoner's Dilemma: Why Smart People Make Bad Choices Together
“Two criminals are caught. Each can stay silent or betray the other. If both stay silent, both go free. Logic says you should betray. But if everyone follows that logic, everyone loses. What's going on?”
You've been arrested. So has your friend. You're in separate rooms and cannot communicate.
The police offer you both the same deal:
- If both of you stay silent: you both walk free. No charges.
- If you betray and your friend stays silent: you walk free, your friend gets 5 years.
- If both of you betray: you both get 3 years.
Think carefully. What do you do?
Most people start by reasoning like this: "If my friend cooperates, I'm better off betraying — I go free instead of them. If my friend betrays, I'm still better off betraying — I get 3 years instead of 5. Either way, betraying is the better move for me."
This is perfectly correct logic. The problem is, your friend is thinking exactly the same thing.
So both of you betray. Both of you get 3 years. Meanwhile, the outcome where you both went free was sitting right there — you just couldn't reach it together.
This is the Prisoner's Dilemma, and it is one of the most important ideas in social science. It describes situations where individually rational choices lead to collectively irrational outcomes.
It shows up constantly in the real world. Two countries spend billions arming against each other — neither wants to, both feel they must. Fishermen overfish a shared lake — each knows it's unsustainable, each does it anyway. Athletes dope to stay competitive — no one wants to, but no one wants to be the only one who doesn't.
The structure is identical every time: individual logic says one thing, collective wellbeing says another.
What changes the game is repetition and relationship. If you and your friend play this game again and again — and both know you will — cooperation becomes rational. Your reputation matters. Retaliation matters. Trust becomes a strategy.
The most celebrated solution is simple: start by cooperating. Then do whatever your partner did last time. Reward cooperation with cooperation. Punish betrayal with betrayal. This strategy — called Tit for Tat — consistently outperforms more complex approaches in computer tournaments, because it is clear, predictable, and forgiving.
The real lesson isn't about prisoners. It's about why humans built courts, laws, trade agreements, and constitutions: to solve Prisoner's Dilemmas at scale.
Ready to explore?
4 interactive activities waiting in the next tab.
The best known strategy in repeated Prisoner's Dilemma tournaments is called 'Tit for Tat' — cooperate first, then copy whatever the other person did last round. Why does this simple strategy beat more complex ones? What does it reward and punish?
Reflect
The Prisoner's Dilemma shows that individual rationality and collective rationality can be completely different things. Can you think of a rule or institution in society that exists specifically to solve this kind of problem?