Go back
Prisoners Dilemma

Prisoners Dilemma

Debates

K

Germany

Joined
27 Oct 08
Moves
3118
Clock
05 Dec 08
Vote Up
Vote Down

Originally posted by telerion
I don't work directly in game theory, and it's been a few years since I took the course for my grad studies; but isn't it the case that in order for "both cooperate" to occur along the equilibrium path in one of these simple PD's, the horizon must be infinite? The problem with finitely repeated games is that they unravel through backward iteration. Genera ...[text shortened]... hough. Again, if I've messed up something above, please jump in and correct me.
It's true that for a finite amount of iterations, the NE will also be that everyone always defects. However, there is a loophole that does allow the cooperate option to be at equilibrium, and that is if the amount of iterations is unknown beforehand. If you don't know that the last iteration is happening, you do not have to defect per se.

About your comment on belief: yes, this is an interesting variation and it leads to interesting situations. People tend to adapt their strategies in prisoner's dilemmas based on their previous experiences. For example, in countries with a lower perception of crime, people tend to trust random strangers more.

http://www.nationmaster.com/graph/lif_tru_peo-lifestyle-trust-people

t
True X X Xian

The Lord's Army

Joined
18 Jul 04
Moves
8353
Clock
05 Dec 08
1 edit
Vote Up
Vote Down

Originally posted by KazetNagorra
It's true that for a finite amount of iterations, the NE will also be that everyone always defects. However, there is a loophole that does allow the cooperate option to be at equilibrium, and that is if the amount of iterations is unknown beforehand. If you don't know that the last iteration is happening, you do not have to defect per se.

About your andom strangers more.

http://www.nationmaster.com/graph/lif_tru_peo-lifestyle-trust-people
Good point. I was thinking of just the standard PD when I made the statement about horizon length. There are likely a lot of different augmentations to the basic repeated problem that can return cooperation equilibria.

That said, even if the number of iterations is unknown ex ante, one would still need some other conditions to be true, particularly the product of the discount rate and the continuation probability would have to be sufficiently large.

As a simple example, imagine that each (or any) player discounts the future at a rate of zero. Then the game, in all but words, returns to the one-shot case.

One doesn't need the product to actually be zero to destroy the cooperating equilibrium sequence. It just has to be small enough so that the expected continuation value is less than the gain from defecting in a given period.

K

Germany

Joined
27 Oct 08
Moves
3118
Clock
05 Dec 08
Vote Up
Vote Down

Originally posted by telerion
Good point. I was thinking of just the standard PD when I made the statement about horizon length. There are likely a lot of different augmentations to the basic repeated problem that can return cooperation equilibria.

That said, even if the number of iterations is unknown ex ante, one would still need some other conditions to be true, particularly the ...[text shortened]... so that the expected continuation value is less than the gain from defecting in a given period.
Yes, there are a lot of subtleties. The most rational strategy in iterated prisoner's dilemmas is not easy to establish as it also depends on the strategies of the other actors within the system.

t
True X X Xian

The Lord's Army

Joined
18 Jul 04
Moves
8353
Clock
05 Dec 08
Vote Up
Vote Down

Originally posted by KazetNagorra
Yes, there are a lot of subtleties. The most rational strategy in iterated prisoner's dilemmas is not easy to establish as it also depends on the strategies of the other actors within the system.
Absolutely. When looking for any sort of NE, we are by definition considering the strategies of all players.

K

Germany

Joined
27 Oct 08
Moves
3118
Clock
05 Dec 08
Vote Up
Vote Down

Originally posted by telerion
Absolutely. When looking for any sort of NE, we are by definition considering the strategies of all players.
Yes, but it becomes extra difficult if you don't know the strategies of the other players; you then have to base your strategy on what you estimate the other players' strategy to be.

B

Joined
06 Aug 06
Moves
1945
Clock
05 Dec 08
Vote Up
Vote Down

No source, but I think there once was a sort of competition with different algorithms for an iterated prisoner's dilemma. If I'm not mistaken, the winner was an algorithm that always did the same as the other player in the previous period, with random cooperate thrown in. It obviously lost against always defect, but was quite successful when paired with most other algorithms.

K

Germany

Joined
27 Oct 08
Moves
3118
Clock
05 Dec 08
Vote Up
Vote Down

Originally posted by Barts
No source, but I think there once was a sort of competition with different algorithms for an iterated prisoner's dilemma. If I'm not mistaken, the winner was an algorithm that always did the same as the other player in the previous period, with random cooperate thrown in. It obviously lost against always defect, but was quite successful when paired with most other algorithms.
Correct. Winning strategies are often similar to "Tit for Tat" (do the same as the previous player). However it still depends on the other strategies; when other players have a relatively high degree of cooperation, a more cooperative strategy will be better.

t
True X X Xian

The Lord's Army

Joined
18 Jul 04
Moves
8353
Clock
06 Dec 08
Vote Up
Vote Down

Originally posted by KazetNagorra
Correct. Winning strategies are often similar to "Tit for Tat" (do the same as the previous player). However it still depends on the other strategies; when other players have a relatively high degree of cooperation, a more cooperative strategy will be better.
We should also point out that in these cases where one strategy wins (for instance in a computational run-off), the outcome is almost certainly never a Nash Equilibrium.

Wajoma
Die Cheeseburger

Provocation

Joined
01 Sep 04
Moves
78933
Clock
06 Dec 08
Vote Up
Vote Down

Originally posted by KazetNagorra
Correct. Winning strategies are often similar to "Tit for Tat" (do the same as the previous player). However it still depends on the other strategies; when other players have a relatively high degree of cooperation, a more cooperative strategy will be better.
Still nothing on how this relates to, well, anything actually. Rational people make irrational decisions or, tweak a few dials on the game then rational people make rational decisions, if that is possible then if you tweak them the other way you can make irrational people make rational decisions.

No wonder Nash went potty trying to reconcile his ideas.

t
True X X Xian

The Lord's Army

Joined
18 Jul 04
Moves
8353
Clock
06 Dec 08
Vote Up
Vote Down

Originally posted by Wajoma
Still nothing on how this relates to, well, anything actually. Rational people make irrational decisions or, tweak a few dials on the game then rational people make rational decisions, if that is possible then if you tweak them the other way you can make irrational people make rational decisions.

No wonder Nash went potty trying to reconcile his ideas.
Usually it's just that the model is not capturing some critical part of the actual process. The basic PD works well for police officers working to criminals against each other. They do it all the time and get convictions.

In other cases, the basic PD doesn't do well. That's why other elements like learning and reputation have been added and do a better job in some cases.

K

Germany

Joined
27 Oct 08
Moves
3118
Clock
07 Dec 08
Vote Up
Vote Down

Originally posted by Wajoma
Still nothing on how this relates to, well, anything actually. Rational people make irrational decisions or, tweak a few dials on the game then rational people make rational decisions, if that is possible then if you tweak them the other way you can make irrational people make rational decisions.

No wonder Nash went potty trying to reconcile his ideas.
Why are you so focused on irrationality when it has no place in the prisoner's dilemma?

Wajoma
Die Cheeseburger

Provocation

Joined
01 Sep 04
Moves
78933
Clock
07 Dec 08
Vote Up
Vote Down

Originally posted by KazetNagorra
Why are you so focused on irrationality when it has no place in the prisoner's dilemma?
I want to know your reasons for bossing other people around or rather, engaging the services of guvamint thugs to do it for you, I want you to show the relevance of the PD to socialised health care.

Presumably if a person makes a bad choice for themselves that must be considered irrational.

B

Joined
06 Aug 06
Moves
1945
Clock
07 Dec 08
Vote Up
Vote Down

Isn't the relevance of the PD obvious to you ? It shows that by forcing people to take a certain option (cooperate) that they themselves wouldn't take, they are better off.

K

Germany

Joined
27 Oct 08
Moves
3118
Clock
07 Dec 08
Vote Up
Vote Down

Originally posted by Wajoma
I want to know your reasons for bossing other people around or rather, engaging the services of guvamint thugs to do it for you, I want you to show the relevance of the PD to socialised health care.

Presumably if a person makes a bad choice for themselves that must be considered irrational.
What alternative to government do you suggest for avoiding the PD?

kmax87
Republicant Retiree

Blade Runner

Joined
09 Oct 04
Moves
107144
Clock
07 Dec 08
1 edit
Vote Up
Vote Down

Originally posted by telerion
Usually it's just that the model is not capturing some critical part of the actual process. The basic PD works well for police officers working to criminals against each other. They do it all the time and get convictions.

In other cases, the basic PD doesn't do well. That's why other elements like learning and reputation have been added and do a better job in some cases.
Do all these models reject the notion of honor amongst thieves? Someone did mention that many people caught up in this dilemma are likely to have to continue living amongst the people they would have potentially "shopped". You would think the fear of reprisal would help people stay silent.

The other thing must be that would be felons obviously don't watch cop shows. Anyone with even the most casual acquaintance with Police Drama 101 should know that you tell them nothing, and that the cops almost never have enough evidence to convict without a confession.

Cookies help us deliver our Services. By using our Services or clicking I agree, you agree to our use of cookies. Learn More.