Game Theory
Transcription
Game Theory
Game Theory Bernhard Nebel, Ingo Thon, Robert Mattmüller, Malte Helmert, Tim Schulte, Dominik Winterer Lecture Notes This work is licensed under the Creative Commons Attribution-NonCommercialShareAlike 4.0 International License. To view a copy of this license, visit http: //creativecommons.org/licenses/by-nc-sa/4.0/. ii Contents 1 Introduction 1.1 What is Game Theory? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.2 Areas of Game Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1 1 2 Strategic Games 2.1 Dominated Strategies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.1.1 Iterative Elimination of Strictly Dominated Strategies . . . . . . . . . 4 5 6 iii 1 Introduction 1.1 What is Game Theory? Game Theory is the analysis of strategic decision situations where multiple players interact with each other. The result of a game depends on the decisions of the involved players and all players are aware of this fact. This raises the question of what the outcome of the game will be assuming each player acts rationally, i.e. tries to maximize its own utility, and each player itself assumes that all other players act rationally as well. Originally, game theory was a research field of theoretical economics but has come to play an increasingly important role in artificial intelligence and computer science, especially when modelling distributed, heterogeneous systems of selfish agents. While solution concepts for games are already known, many algorithmic questions remain to be answered. Since the assumption of players’ rationality is more reasonable for artificial agents than for natural agents, game theory might even be more applicable to computer science than to economics. Example 1 (Beauty contest). Every player chooses a natural number between 1 and 100. The players that come closest to 2/3 of the average win. Example 2 (Congestion games). Consider a network of streets with travel costs dependent on the number of agents choosing a particular road. The goal of an agent is to travel from s to t minimizing travel cost. Which route will the agents take when there are n = 2, 3, . . . agents? What is the average travel cost per agent? n1/n 1 s t 0 1 n2/n 1.2 Areas of Game Theory One area of game theory deals with the analysis of strategic games (normal form games) where each player selects a strategy from a predefined set of strategies and the outcome of the game is determined by the resulting strategy combination. 1 1 Introduction Example 3 (Prisoner’s dilemma). Two members of a criminal gang are arrested and interrogated separately. The prosecutors lack sufficient evidence to convict the pair on the accused crime. Each prisoner is given the opportunity to defect and betray the other prisoner or to cooperate with the other by remaining silent. Possible outcomes are: 1. both cooperate and each of them is sentenced to three months of prison, 2. only one prisoner defects and gets out of prison while the cooperating prisoner is sentenced to ten years of prison, 3. both defect and each of them is sentenced to three years of prison. Another area of game theory regards extensive-form games where players perform multiple moves (e.g. chess or repeated strategic games). We differentiate between games with perfect information, like chess, and games with imperfect information, like poker. Coalition or negotiation games model situations such as negotiations, distribution of payoffs, or elections. Mechanism design considers games from the perspective of a game designer who specifies the rules of the game with the intention to optimize the social welfare of all players. The most important solution concepts in game theory are the elimination of dominated strategies, Nash equilibria, and other equilibria notions. 2 1 Introduction 3 2 Strategic Games Definition 4 (Strategic game). A strategic game G = hN, (Ai ), (ui )i consists of • a finite nonempty set of players N , • for each player i ∈ N , a nonempty set Ai of actions/strategies and • for each player i ∈ N , payoff function ui : A → R, where Y A= Ai . i∈N The strategic game G is finite if A is finite. Instead of payoff functions, we will sometimes use the notion of preference relations. The preference relation i for player i ∈ N is defined as follows: a i b iff ui (a) ≥ ui (b). We can model a finite strategic game using a payoff matrix. As an example, consider a two-player game where each player has two actions. player 1 has actions T and B, player 2 has actions L and R. player 2 L R T w1 , w2 x1 , x2 B y1 , y2 z1 , z2 player 1 If player 1 chooses action T and player 2 chooses action L, then player 1 gets payoff w1 and player 2 gets payoff w2 , i. e. in every cell of the payoff matrix the first value is the payoff of player 1, and the second is the payoff of player 2. Example 5 (Prisoner’s dilemma). Consider the prisoner’s dilemma as described above. Let D be the strategy to defect and C to cooperate with the fellow prisoner. The payoffs correspond to the months in prison: C D C −3, −3 −120, 0 D 0, −120 −36, −36 4 2 Strategic Games Example 6 (Hawk and dove). In a fight for resources, two players can behave either like a hawk (H) or like a dove (D). If both players behave like doves, both players share the benefit, if a hawk meets a dove, the hawk wins and gets the bigger part. However if two hawks meet each other, then the benefit gets lost completely because they will fight each other. D H D 3, 3 1, 4 H 4, 1 0, 0 Example 7 (Matching pennies). Two players can choose either heads (H) or tails (T ) of a coin. If both players make the same choice, then player 1 receives one Euro from player 1, and if they make different choices, then player 2 gets one Euro from player 1. H T H 1, −1 −1, 1 T −1, 1 1, −1 Example 8 (Bach or Stravinsky). Two persons, one of whom prefers Bach whereas the other one prefers Strawinsky want to go to a concert together. For both it is more important to go to the same concert than to go to their favorite concert. Let B be the action of going to the Bach concert, S the action of going to the Strawinsky concert. Strawinsky enthusiast B S B 2, 1 0, 0 S 0, 0 1, 2 Bach enthusiast 2.1 Dominated Strategies Q Notation 9. Let a = (ai )i∈N be strategy profile (a ∈ A = i∈N Ai ). Furthermore we define a−i := (aj )j∈N \{i} and (a−i , ai ) = (aj )j∈(N \{i})∪{i} = (aj )j∈N = a. Definition 10 (Strictly dominated strategy). An action ai ∈ Ai in strategic game hN, (Ai ), (ui )i is called strictly dominated if there is an action a+ i ∈ Ai such that for all a ∈ A: ui (a−i , ai ) < ui (a−i , a+ i ). 5 2 Strategic Games Remark 11. It is never rational to play strictly dominated strategies. 2.1.1 Iterative Elimination of Strictly Dominated Strategies • cancel out strategies that are strictly dominated as long as possible. • if there is only one strategy profile left then that is the solution. Example 12. Firstly, strike out the first row which is strictly dominated by the second row. Then eliminate the second column which is strictly dominated by the first column. C D C 3,3 0,4 D 4,0 1,1 D C D 4,0 1,1 Example 13. Iterative elimination of strictly dominated strategies in three steps: L R T 2,1 0,0 M 1,2 2,1 B 0,0 1,1 L R L T 2,1 0,0 T 2,1 M 1,2 2,1 M 1,2 Remark 14. Only rarely there is strict dominances between actions. Remark 15. The result of iterative elimination of strict dominance is unique, i. e., independent of the elimination order. Definition 16 (Weakly dominated strategies). An action ai ∈ Ai in game hN, (Ai ), (ui )i is called weakly dominated, if there is an action a+ i ∈ Ai such that for all a ∈ A ui (a−i , ai ) ≤ ui (a−i , a+ i ) and for at least one a ∈ A, we have ui (a−i , ai ) < ui (a−i , a+ i ). Example 17. The result of iterative elimination of weakly dominated strategies is in general not unique and depends on the elimination order. To see this, consider the following game: 6 2 Strategic Games L R T 2, 1 0, 0 M 2, 1 1, 1 B 0, 0 1, 1 First of all, we eliminate action T (T ≤ M ), then we eliminate action L (L ≤ R). As a result every possible strategy profile, i.e. M R or BR, in the reduced game has payoff profile (1, 1). An alternative to this elimination order is to first eliminate B (B ≤ M ), and then R (R ≤ L) leading to a reduced game with strategy profiles T L and M L. Both strategy profiles have payoff profiles (2, 1) and give player 1 –unlike the other ordering–a higher utility compared to the reduced game where only strategy profiles M R and BR remained. 7 2 Strategic Games 8 Bibliography [Bin92] Binmore, Ken: Fun and Games. D. C. Heath and Co., Lexington, MA, 1992. [CS03] Conitzer, Vincent and Tuomas Sandholm: Complexity Results about Nash Equilibria. In Gottlob, Georg and Toby Walsh (editors): Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence (IJCAI), Acapulco, Mexico, pages 765–771. Morgan Kaufmann, 2003. [FT91] Fudenberg, Drew and Jean Tirole: Game Theory. MIT Press, Cambridge, MA, 1991. [HI02] Holler, Manfred J. and Gerhard Illing: Einführung in die Spieltheorie. Springer-Verlag, Berlin, fünfte edition, 2002. [NRTV07] Nisan, Noam, Tim Roughgarden, Éva Tardos and Vijay V. Vazirani (editors): Algorithmic Game Theory. Cambridge University Press, 2007. [OR94] Osborne, Martin J. and Ariel Rubinstein: A Course in Game Theory. MIT Press, Cambridge, MA, 1994. [Osb03] Osborne, Martin J.: An Introduction to Game Theory. Oxford University Press, 2003. [RZ94] Rosenschein, Jeffrey S. and Gilad Zlotkin: Rules of Encounter. MIT Press, Cambridge, MA, 1994. [vS02] von Stengel, Bernhard: Computing Equilibria for Two-Person Games. In Aumann, Robert J. and Sergiu Hart (editors): Handbook of Game Theory, volume 3, pages 1723–1759. North-Holland, Amsterdam, 2002. 9 Index Aktion, 4, 31 Allokation, 62 approximative, 64 Allokationsproblem für single-minded bidders, 63 Anreizkompatibilität einer sozialen Entscheidungsfunktion, 53 eines Mechanismus, 58 eines Mechanismus für single-minded bidders, 64 Approximation einer Allokation, 64 Arrows Unmöglichkeitssatz, 50, 51 Auktion Vickrey-Auktion, 57 Auktionen kombinatorische, 62 Auszahlungsfunktion, 4, 31 Einstimmigkeit partielle, 50 totale, 50 Ergebnis, 33 Erweiterung einer sozialen Entscheidungsfunktion, 54 Erweiterungslemma, 54 Evolutionäres Gleichgewicht, 21 Gefangenendilemma, 4 Gemischte Erweiterung, 16 Gleichgewicht teilspielperfektes, 35 Greedy-Mechanismus für single-minded bidders, 65 Hawk und dove, 5 Historie, 31 Horizont endlicher, 31 Bach or Stravinsky, 5 Berechnungseffizienz eines Mechanismus für single-minded bidders, 64 Bewertung Kombinatorische Auktionen, 62 single-minded, 63 Borda-Wahlverfahren, 46 Individuelle Rationalität, 59 Kakutani’s fixed point theorem, 17 Kemeny-Young-Methode, 49 Kombinatorische Auktionen, 62 kritische Bezahlungen, 65 Kuchenverteilspiel, 39 Linear Complementarity Problem, 26 Lineares Programm, 25 Cauchy-Schwarzsche Ungleichung, 67 Clarke-Pivot-Funktion, 59 Clarke-Pivot-Regel, 59 Condorcet-Methoden, 48 Condorcet-Paradoxon, 46 Matching Pennies, 5 Maximinimierer, 11 Mechanismus, 58 für single-minded bidders, 64 Vickrey-Clarke-Groves-Mechanismus, 58 Mechanismusdesign, 57 Mehrheitsentscheidung, 46 Monotonie Diktator soziale Entscheidungsfunktion, 53 soziale Wohlfahrtsfunktion, 50 Dominanz schwache, 6 strikte, 5 10 INDEX einer sozialen Entscheidungsfunktion, 53 eines Mechanismus für single-minded bidders, 65 symmetrisches strategisches, 10 Spieler, 4, 31 Strategie evolutionär stabile, 21 gemischte, 15 in extensiven Spielen mit perfekter Information, 32 reine, 4 Strategische Manipulation, 53 strategy profile, 5 Nash-Gleichgewicht, 7 in gemischten Strategien, 16 Neutralität paarweise, 51 Nullsummenspiel, 10 Nutzen erwarteter, 16 Teilspiel, 34 Teilspielperfektes Gleichgewicht, 35 Top-Präferenz, 54 Transfer positiver, 59 Paarweise Neutralität, 51 Pluralitätsverfahren, 46 Pluralitätswahl, 47 in zwei Runden, 47 positiver Transfer, 59 Präferenzrelation, 46 Präferenzwahl mit übertragbaren Stimmen, 47 Unabhängigkeit von irrelevanten Alternativen, 50 Unterstützungsmenge, 16 Vickrey-Auktion, 57 Vickrey-Clarke-Groves-Mechanismus, 58 Satz von Arrow, 51 von Conitzer und Sandholm, 28 von Gibbard-Satterthwaite, 55 von Kakutani, 17 von Kuhn, 37 von Nash, 16 von Vickrey-Clarke-Groves, 58 Schritt-Abweichung, 35 Schulze-Methode, 48 Single-minded Bewertung, 63 bid, 63 bidder, 63 Soziale Effizienz, 62 Soziale Entscheidung, 46 Soziale Entscheidungsfunktion, 46 Soziale Wohlfahrt einer Allokation, 62 Soziale Wohlfahrtsfunktion, 46 Spiel extensives mit perfekter Information, 31 und simultanen Zügen, 39 und Zufallszügen, 38 strategisches, 4 Wahlverfahren Condorcet-Methoden, 48 Kemeny-Young-Methode, 49 Pluralitätswahl, 47 in zwei Runden, 47 Präferenzwahl mit übertragbaren Stimmen, 47 Schulze-Methode, 48 11