{"id":1133,"date":"2023-06-01T05:29:25","date_gmt":"2023-05-31T20:29:25","guid":{"rendered":"https:\/\/saraheee.com\/?p=1133"},"modified":"2023-06-27T21:28:02","modified_gmt":"2023-06-27T12:28:02","slug":"game-theory-7-chap10-infinitely-repeated-games","status":"publish","type":"post","link":"https:\/\/saraheee.com\/ko\/2023\/06\/game-theory-7-chap10-infinitely-repeated-games\/","title":{"rendered":"Game Theory 7 \u2013 chap10. Infinitely repeated games"},"content":{"rendered":"<h3 class=\"wp-block-heading\">Infinitely Repeated Prisoners&#8217; Dilemma<\/h3>\n\n\n\n<p>Imagine that the prisoners&#8217; dilemma is played infinitely many times<\/p>\n\n\n\n<p>In order to introduce <em>discounting<\/em> of future payoffs, we denote by \\(\\delta \\in\\) [0, 1] the players&#8217; common discount factor<\/p>\n\n\n\n<p>Suppose that the player obtains a payoff of \\(\\nu\\) every period<\/p>\n\n\n\n<p>Then the sum of the discounted payoff stream, or simply the <strong><mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-global-color-8-color\">discounted payoff<\/mark><\/strong>, is<\/p>\n\n\n\n<p>\\(\\nu + \\nu \\delta + \\nu \\delta^{2} + \\nu \\delta^{3} + \\cdots = \\frac{\\nu}{1-\\delta}\\)<\/p>\n\n\n\n<p>A more convenient way to express payoffs in repeated games is<\/p>\n\n\n\n<p>\\(\\frac{\\nu}{1-\\delta} \\cdot (1 &#8211; \\delta) = \\nu\\)<\/p>\n\n\n\n<p>This is referred to as the <strong><mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-global-color-8-color\">average discounted payoff<\/mark><\/strong> and denoted \\(\\pi\\)<\/p>\n\n\n\n<p><mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-global-color-15-color\">e.g., \\(10 + 5 \\delta + 10 \\delta^{2} + 5 \\delta^{3} + \\cdots = \\frac{10}{1-\\delta^{2}} + \\frac{5\\delta}{1-\\delta^{2}}\\)<\/mark><\/p>\n\n\n\n<p><mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-global-color-15-color\">average discounted payoff: \\((\\frac{10}{1-\\delta^{2}} + \\frac{5\\delta}{1-\\delta^{2}}) \\cdot (1-\\delta)\\) = \\(\\frac{10+5\\delta}{1+\\delta}\\)<\/mark><\/p>\n\n\n\n<p>To formalize the idea of reputation in infinitely repeated games, we consider the following simple strategy:<\/p>\n\n\n\n<p><em>Cooperate so long as no one has ever defected; otherwise defect<\/em><\/p>\n\n\n\n<p>Hence (D, D) is used in the <strong><mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-global-color-8-color\">punishment<\/mark><\/strong> phase and (C, C) in the <strong><mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-global-color-8-color\">cooperation<\/mark><\/strong> phase.<br>This kind of strategies is dubbed a <em>grim-trigger<\/em> strategy<\/p>\n\n\n\n<p>To see whether this strategy constitutes a SPE, we utilize the symmetric payoff structure and focus on player 1&#8217;s incentives to deviate<\/p>\n\n\n\n<p><mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-global-color-15-color\">e.g., symmetric: \\(s_{1} = s_{2}^{\\prime}, s_{2} = s_{1}^{\\prime}\\) in \\(u_{1}(s_{1}, s_{2}) = u_{2}(s_{1}^{\\prime}, s_{2}^{\\prime})\\)<\/mark><\/p>\n\n\n\n<figure class=\"wp-block-table\"><table><tbody><tr><td>1 \\ 2<\/td><td>C<\/td><td>D<\/td><\/tr><tr><td>C<\/td><td>2, 2<\/td><td>0, 3<\/td><\/tr><tr><td>D<\/td><td>3, 0<\/td><td>1, 1<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p><mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-global-color-15-color\">\\(u_{1}(C, D) = u_{2}(D, C) = 0\\)<\/mark><\/p>\n\n\n\n<h5 class=\"wp-block-heading\">(1) Cooperation Phase<\/h5>\n\n\n\n<p>Eqbm: (C, C), (C, C), (C, C), &#8230; \u2192 \\(\\pi_{1}\\) = 2<\/p>\n\n\n\n<p>Deviation: (D, C), (D, D), (D, D), &#8230; \u2192 \\(\\pi_{1}\\) = <mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-global-color-15-color\">\\((1-\\delta)(3+\\sum_{t=1}^{\\infty}\\delta t)\\)<\/mark><\/p>\n\n\n\n<p><mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-global-color-15-color\">= \\((3 + \\delta + \\delta^{2} + \\cdots) \\cdot (1-\\delta) = (3 + \\frac{\\delta}{1-\\delta}) \\cdot (1-\\delta) = 3(1-\\delta) + \\delta = 3 &#8211; 2\\delta\\)<\/mark><\/p>\n\n\n\n<p><mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-global-color-15-color\">2 \u2265 3 &#8211; 2 \\(\\delta\\) \u21d4 \\(\\delta\\) \u2265 1\/2<br>So, if delta is greater than 1\/2, it becomes unprofitable to deviate<\/mark><\/p>\n\n\n\n<h5 class=\"wp-block-heading\">(2) Punishment Phase<\/h5>\n\n\n\n<p>Eqbm: (D, D), (D, D), (D, D), &#8230; \u2192 \\(\\pi_{1}\\) = 2<\/p>\n\n\n\n<p>Deviation: (C, D), (D, D), (D, D), &#8230; \u2192 \\(\\pi_{1}\\) = \\((1-\\delta)(0+\\sum_{t=1}^{\\infty}\\delta^{t} \\cdot 1) = \\delta\\)<\/p>\n\n\n\n<p>Hence defecting forever is the best response for player 1<\/p>\n\n\n\n<p>Therefore, the grim-trigger strategy can be supported as a SPE if \\(\\delta\\) \u2265 \\(\\frac{1}{2}\\)<\/p>\n\n\n\n<p>Remark: The <strong><mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-global-color-8-color\">one-shot deviation principle<\/mark><\/strong> states that a player has no profitable deviation in any subgames if and only if she has no profitable one-shot deviation. Therefore, to determine whether a player&#8217;s behavior is optimal, it is enough to check whether the player cannot benefit from deviating only in the current period.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Another Prisoners&#8217; Dilemma<\/h3>\n\n\n\n<figure class=\"wp-block-table\"><table><tbody><tr><td>1 \\ 2<\/td><td>C<\/td><td>D<\/td><\/tr><tr><td>C<\/td><td>4, 4<\/td><td>-2, 6<\/td><\/tr><tr><td>D<\/td><td>6, -2<\/td><td>0, 0<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p>(1) <strong><mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-global-color-8-color\">Cooperation<\/mark><\/strong> phase<\/p>\n\n\n\n<p>Eqbm: \\(\\pi_{1}\\) = 4.<br>Deviation: <mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-global-color-15-color\">\\(\\pi_{1} = 6(1-\\delta)\\), thus, 4 \u2265 6(1-\\(\\delta\\)) \u21d4 6\\(\\delta\\) \u2265 2 \u21d4 \\(\\delta\\) \u2265 1\/3<\/mark><\/p>\n\n\n\n<p>(2) <strong><mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-global-color-8-color\">Punishment<\/mark><\/strong> phase<\/p>\n\n\n\n<p>Eqbm: \\(\\pi_{1}\\) = 0.<br>Deviation: <mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-global-color-15-color\">\\(\\pi_{1} = -2(1-\\delta)\\), thus, -2(1-\\(\\delta\\)) &lt; 0 \u21d4 \\(\\delta\\) <mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-global-color-15-color\">&lt;<\/mark> 1 (Always true)<\/mark><\/p>\n\n\n\n<p><mark style=\"background-color:rgba(0, 0, 0, 0)\" class=\"has-inline-color has-global-color-15-color\">So, \\(\\delta\\) \u2265 1\/3 (C, C) can be supported as an equilibrium<\/mark><\/p>\n\n\n\n<p><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Reference: Chang-Koo Chi, (26\/50) Game Theory and Applications 7 \u2013 \bFinitely repeated game, Jul 8, 2020,&nbsp;<a href=\"https:\/\/youtu.be\/fN6L9RL5IPk\" rel=\"noopener\">https:\/\/youtu.be\/fN6L9RL5IPk<\/a><\/li>\n<\/ul>\n\n\n\n<p><\/p>","protected":false},"excerpt":{"rendered":"<p>Find the deviation of the cooperation phase and the punishment phase so that equilibrium can be reached. Also, learn the status of the one-shot deviation principle.<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[5],"tags":[65,4,64,50,67,66,51,63],"class_list":["post-1133","post","type-post","status-publish","format-standard","hentry","category-game-theory-and-applications","tag-cooperation-phase","tag-game-theory","tag-infinitely","tag-jun-1-2023","tag-one-shot-deviation-principle","tag-punishment-phase","tag-repeated-game","tag-repeated-prisoners-dilemma"],"_links":{"self":[{"href":"https:\/\/saraheee.com\/ko\/wp-json\/wp\/v2\/posts\/1133"}],"collection":[{"href":"https:\/\/saraheee.com\/ko\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/saraheee.com\/ko\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/saraheee.com\/ko\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/saraheee.com\/ko\/wp-json\/wp\/v2\/comments?post=1133"}],"version-history":[{"count":22,"href":"https:\/\/saraheee.com\/ko\/wp-json\/wp\/v2\/posts\/1133\/revisions"}],"predecessor-version":[{"id":1598,"href":"https:\/\/saraheee.com\/ko\/wp-json\/wp\/v2\/posts\/1133\/revisions\/1598"}],"wp:attachment":[{"href":"https:\/\/saraheee.com\/ko\/wp-json\/wp\/v2\/media?parent=1133"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/saraheee.com\/ko\/wp-json\/wp\/v2\/categories?post=1133"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/saraheee.com\/ko\/wp-json\/wp\/v2\/tags?post=1133"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}