Prisoners Dilemma

Wajoma

Die Cheeseburger

Debates

03 Dec 08 20:34

KazetNagorra

Germany

Joined: 27 Oct 08
Moves: 3118

05 Dec 08 14:50

Originally posted by telerion
I don't work directly in game theory, and it's been a few years since I took the course for my grad studies; but isn't it the case that in order for "both cooperate" to occur along the equilibrium path in one of these simple PD's, the horizon must be infinite? The problem with finitely repeated games is that they unravel through backward iteration. Genera ...[text shortened]... hough. Again, if I've messed up something above, please jump in and correct me.

It's true that for a finite amount of iterations, the NE will also be that everyone always defects. However, there is a loophole that does allow the cooperate option to be at equilibrium, and that is if the amount of iterations is unknown beforehand. If you don't know that the last iteration is happening, you do not have to defect per se.

About your comment on belief: yes, this is an interesting variation and it leads to interesting situations. People tend to adapt their strategies in prisoner's dilemmas based on their previous experiences. For example, in countries with a lower perception of crime, people tend to trust random strangers more.

http://www.nationmaster.com/graph/lif_tru_peo-lifestyle-trust-people

telerion

True X X Xian

The Lord's Army

Joined: 18 Jul 04
Moves: 8353

05 Dec 08 17:05

1 edit

Originally posted by KazetNagorra
It's true that for a finite amount of iterations, the NE will also be that everyone always defects. However, there is a loophole that does allow the cooperate option to be at equilibrium, and that is if the amount of iterations is unknown beforehand. If you don't know that the last iteration is happening, you do not have to defect per se.

About your andom strangers more.

http://www.nationmaster.com/graph/lif_tru_peo-lifestyle-trust-people

Good point. I was thinking of just the standard PD when I made the statement about horizon length. There are likely a lot of different augmentations to the basic repeated problem that can return cooperation equilibria.

That said, even if the number of iterations is unknown ex ante, one would still need some other conditions to be true, particularly the product of the discount rate and the continuation probability would have to be sufficiently large.

As a simple example, imagine that each (or any) player discounts the future at a rate of zero. Then the game, in all but words, returns to the one-shot case.

One doesn't need the product to actually be zero to destroy the cooperating equilibrium sequence. It just has to be small enough so that the expected continuation value is less than the gain from defecting in a given period.

KazetNagorra

Germany

Joined: 27 Oct 08
Moves: 3118

05 Dec 08 17:14

Originally posted by telerion
Good point. I was thinking of just the standard PD when I made the statement about horizon length. There are likely a lot of different augmentations to the basic repeated problem that can return cooperation equilibria.

That said, even if the number of iterations is unknown ex ante, one would still need some other conditions to be true, particularly the ...[text shortened]... so that the expected continuation value is less than the gain from defecting in a given period.

Yes, there are a lot of subtleties. The most rational strategy in iterated prisoner's dilemmas is not easy to establish as it also depends on the strategies of the other actors within the system.

telerion

True X X Xian

The Lord's Army

Joined: 18 Jul 04
Moves: 8353

05 Dec 08 17:19

Originally posted by KazetNagorra
Yes, there are a lot of subtleties. The most rational strategy in iterated prisoner's dilemmas is not easy to establish as it also depends on the strategies of the other actors within the system.

Absolutely. When looking for any sort of NE, we are by definition considering the strategies of all players.

KazetNagorra

Germany

Joined: 27 Oct 08
Moves: 3118

05 Dec 08 22:33

Originally posted by telerion
Absolutely. When looking for any sort of NE, we are by definition considering the strategies of all players.

Yes, but it becomes extra difficult if you don't know the strategies of the other players; you then have to base your strategy on what you estimate the other players' strategy to be.

Barts

Joined: 06 Aug 06
Moves: 1945

05 Dec 08 22:59

No source, but I think there once was a sort of competition with different algorithms for an iterated prisoner's dilemma. If I'm not mistaken, the winner was an algorithm that always did the same as the other player in the previous period, with random cooperate thrown in. It obviously lost against always defect, but was quite successful when paired with most other algorithms.

KazetNagorra

Germany

Joined: 27 Oct 08
Moves: 3118

05 Dec 08 23:04

Originally posted by Barts
No source, but I think there once was a sort of competition with different algorithms for an iterated prisoner's dilemma. If I'm not mistaken, the winner was an algorithm that always did the same as the other player in the previous period, with random cooperate thrown in. It obviously lost against always defect, but was quite successful when paired with most other algorithms.

Correct. Winning strategies are often similar to "Tit for Tat" (do the same as the previous player). However it still depends on the other strategies; when other players have a relatively high degree of cooperation, a more cooperative strategy will be better.

telerion

True X X Xian

The Lord's Army

Joined: 18 Jul 04
Moves: 8353

06 Dec 08 16:34

Originally posted by KazetNagorra
Correct. Winning strategies are often similar to "Tit for Tat" (do the same as the previous player). However it still depends on the other strategies; when other players have a relatively high degree of cooperation, a more cooperative strategy will be better.

We should also point out that in these cases where one strategy wins (for instance in a computational run-off), the outcome is almost certainly never a Nash Equilibrium.

Wajoma

Die Cheeseburger

Provocation

Joined: 01 Sep 04
Moves: 78933

06 Dec 08 20:41

Originally posted by KazetNagorra
Correct. Winning strategies are often similar to "Tit for Tat" (do the same as the previous player). However it still depends on the other strategies; when other players have a relatively high degree of cooperation, a more cooperative strategy will be better.

Still nothing on how this relates to, well, anything actually. Rational people make irrational decisions or, tweak a few dials on the game then rational people make rational decisions, if that is possible then if you tweak them the other way you can make irrational people make rational decisions.

No wonder Nash went potty trying to reconcile his ideas.

telerion

True X X Xian

The Lord's Army

Joined: 18 Jul 04
Moves: 8353

06 Dec 08 22:36

Originally posted by Wajoma
Still nothing on how this relates to, well, anything actually. Rational people make irrational decisions or, tweak a few dials on the game then rational people make rational decisions, if that is possible then if you tweak them the other way you can make irrational people make rational decisions.

No wonder Nash went potty trying to reconcile his ideas.

Usually it's just that the model is not capturing some critical part of the actual process. The basic PD works well for police officers working to criminals against each other. They do it all the time and get convictions.

In other cases, the basic PD doesn't do well. That's why other elements like learning and reputation have been added and do a better job in some cases.

KazetNagorra

Germany

Joined: 27 Oct 08
Moves: 3118

07 Dec 08 14:29

Originally posted by Wajoma
Still nothing on how this relates to, well, anything actually. Rational people make irrational decisions or, tweak a few dials on the game then rational people make rational decisions, if that is possible then if you tweak them the other way you can make irrational people make rational decisions.

No wonder Nash went potty trying to reconcile his ideas.

Why are you so focused on irrationality when it has no place in the prisoner's dilemma?

Wajoma

Die Cheeseburger

Provocation

Joined: 01 Sep 04
Moves: 78933

07 Dec 08 19:58

Originally posted by KazetNagorra
Why are you so focused on irrationality when it has no place in the prisoner's dilemma?

I want to know your reasons for bossing other people around or rather, engaging the services of guvamint thugs to do it for you, I want you to show the relevance of the PD to socialised health care.

Presumably if a person makes a bad choice for themselves that must be considered irrational.

Barts

Joined: 06 Aug 06
Moves: 1945

07 Dec 08 21:07

Isn't the relevance of the PD obvious to you ? It shows that by forcing people to take a certain option (cooperate) that they themselves wouldn't take, they are better off.

KazetNagorra

Germany

Joined: 27 Oct 08
Moves: 3118

07 Dec 08 21:30

Originally posted by Wajoma
I want to know your reasons for bossing other people around or rather, engaging the services of guvamint thugs to do it for you, I want you to show the relevance of the PD to socialised health care.

Presumably if a person makes a bad choice for themselves that must be considered irrational.

What alternative to government do you suggest for avoiding the PD?

kmax87

Irregardless

Joined: 09 Oct 04
Moves: 114473

07 Dec 08 21:52

1 edit

Originally posted by telerion
Usually it's just that the model is not capturing some critical part of the actual process. The basic PD works well for police officers working to criminals against each other. They do it all the time and get convictions.

In other cases, the basic PD doesn't do well. That's why other elements like learning and reputation have been added and do a better job in some cases.

Do all these models reject the notion of honor amongst thieves? Someone did mention that many people caught up in this dilemma are likely to have to continue living amongst the people they would have potentially "shopped". You would think the fear of reprisal would help people stay silent.

The other thing must be that would be felons obviously don't watch cop shows. Anyone with even the most casual acquaintance with Police Drama 101 should know that you tell them nothing, and that the cops almost never have enough evidence to convict without a confession.