전략(게임 이론)

게임 이론에서, 플레이어의 전략은 자신의 행동뿐만 아니라 다른 사람의 행동에 따라 결과가 달라지는 환경에서 그들이 선택하는 옵션들 중 하나이다.^[1] 그 규율은 주로 다른 선수들의 행동이나 행동에 영향을 미치는 게임에서의 선수의 행동에 관한 것이다. "게임"의 예로는 체스, 브릿지, 포커, 독점, 외교 또는 전함을 들 수 있다.^[2] 플레이어의 전략은 플레이어가 경기의 어떤 단계에서 어떤 행동을 취할지 결정할 것이다. 게임 이론을 연구할 때 경제학자들은 서로 다른 분야에서 둘 이상의 당사자 사이의 결정을 분석할 때 취해지는 심리적 또는 사회학적 관점보다는 결정을 분석하는 데 더 합리적인 렌즈를 사용한다.

전략 개념은 때때로 움직임의 개념과 혼동된다. 이동은 플레이어가 게임을 하는 동안(예: 체스에서, 백인의 비숍 a2를 b3으로 이동) 어느 시점에 취하는 행동이다. 반면에 전략은 게임을 하는 완전한 알고리즘으로, 게임 내내 가능한 모든 상황에 대해 무엇을 해야 하는지 선수에게 알려준다. 방향목록으로 '전략'을, 방향목록 자체를 한바퀴 돌면서 '움직임'을 생각해보면 도움이 된다. 이 전략은 각 행동의 성과나 결과에 기초한다. 각 에이전트의 목표는 경쟁자의 행동에 근거하여 그들의 보상을 고려하는 것이다. 예를 들어, 경쟁자 A는 경쟁자 B가 시장에 진입한다고 가정할 수 있다. 여기서부터, 경쟁자 A는 참가와 참가하지 않음으로 그들이 받는 보상을 비교한다. 다음 단계는 선수 B가 참가하지 않는다고 가정하고, 선수 A가 참가 여부를 결정하는 경우에 따라 어떤 보상이 더 나은지 고려하는 것이다. 이 기법은 선수가 어떤 행동을 취하든 어떤 행동을 취하든 간에 선수가 어떤 행동을 취하든 그 행동을 식별하여 보상을 극대화할 수 있는 지배적인 전략을 식별할 수 있다. 이것은 또한 선수들이 아래에서 더 자세히 논의되는 내시 평형을 식별하는 데 도움이 된다.

A strategy profile (sometimes called a strategy combination) is a set of strategies for all players which fully specifies all actions in a game. A strategy profile must include one and only one strategy for every player.

Strategy set

A player's strategy set defines what strategies are available for them to play. A strategy profile is a list of strategy sets, ordered from most to least desirable.

A player has a finite strategy set if they have a number of discrete strategies available to them. For instance, a game of rock paper scissors comprises a single move by each player—and each player's move is made without knowledge of the other's, not as a response—so each player has the finite strategy set {rock paper scissors}.

A strategy set is infinite otherwise. For instance the cake cutting game has a bounded continuum of strategies in the strategy set {Cut anywhere between zero percent and 100 percent of the cake}.

In a dynamic game, games that are played over a series of time, the strategy set consists of the possible rules a player could give to a robot or agent on how to play the game. For instance, in the ultimatum game, the strategy set for the second player would consist of every possible rule for which offers to accept and which to reject.

In a Bayesian game, or games in which players have incomplete information about one another, the strategy set is similar to that in a dynamic game. It consists of rules for what action to take for any possible private information.

Choosing a strategy set

In applied game theory, the definition of the strategy sets is an important part of the art of making a game simultaneously solvable and meaningful. The game theorist can use knowledge of the overall problem, that is the friction between two or more players, to limit the strategy spaces, and ease the solution.

For instance, strictly speaking in the Ultimatum game a player can have strategies such as: Reject offers of ($1, $3, $5, ..., $19), accept offers of ($0, $2, $4, ..., $20). Including all such strategies makes for a very large strategy space and a somewhat difficult problem. A game theorist might instead believe they can limit the strategy set to: {Reject any offer ≤ x, accept any offer > x; for x in ($0, $1, $2, ..., $20)}.

Pure and mixed strategies

A pure strategy provides a complete definition of how a player will play a game. Pure strategy can be thought about as a plan subject to the observations they make during the course of the game of play. In particular, it determines the move a player will make for any situation they could face. A player's strategy set is the set of pure strategies available to that player.

A mixed strategy is an assignment of a probability to each pure strategy. When enlisting mixed strategy, it is often because the game doesn't allow for a rational description in specifying a pure strategy for the game. This allows for a player to randomly select a pure strategy. (See the following section for an illustration.) Since probabilities are continuous, there are infinitely many mixed strategies available to a player. Since probabilities are being assigned to strategies for a specific player when discussing the payoffs of certain scenarios the payoff must be referred to as "expected payoff".

Of course, one can regard a pure strategy as a degenerate case of a mixed strategy, in which that particular pure strategy is selected with probability 1 and every other strategy with probability 0.

A totally mixed strategy is a mixed strategy in which the player assigns a strictly positive probability to every pure strategy. (Totally mixed strategies are important for equilibrium refinement such as trembling hand perfect equilibrium.)

Mixed strategy

Illustration

In a soccer penalty kick, the kicker must choose whether to kick to the right or left side of the goal, and simultaneously the goalie must decide which way to block it. Also, the kicker has a direction they are best at shooting, which is left if they are right-footed. The matrix for the soccer game illustrates this situation, a simplified form of the game studied by Chiappori, Levitt, and Groseclose (2002).^[3] It assumes that if the goalie guesses correctly, the kick is blocked, which is set to the base payoff of 0 for both players. If the goalie guesses wrong, the kick is more likely to go in if it is to the left (payoffs of +2 for the kicker and -2 for the goalie) than if it is to the right (the lower payoff of +1 to kicker and -1 to goalie).

		Goalie
		Lean Left	Lean Right
Kicker	Kick Left	0, 0	+2, -2
	Kick Right	+1, -1	0, 0


Payoff for the Soccer Game (Kicker, Goalie)

This game has no pure-strategy equilibrium, because one player or the other would deviate from any profile of strategies—for example, (Left, Left) is not an equilibrium because the Kicker would deviate to Right and increase his payoff from 0 to 1.

The kicker's mixed-strategy equilibrium is found from the fact that they will deviate from randomizing unless their payoffs from Left Kick and Right Kick are exactly equal. If the goalie leans left with probability g, the kicker's expected payoff from Kick Left is g(0) + (1-g)(2), and from Kick Right is g(1) + (1-g)(0). Equating these yields g= 2/3. Similarly, the goalie is willing to randomize only if the kicker chooses mixed strategy probability k such that Lean Left's payoff of k(0) + (1-k)(-1) equals Lean Right's payoff of k(-2) + (1-k)(0), so k = 1/3. Thus, the mixed-strategy equilibrium is (Prob(Kick Left) = 1/3, (Prob(Lean Left) = 2/3).

Note that in equilibrium, the kicker kicks to their best side only 1/3 of the time. That is because the goalie is guarding that side more. Also note that in equilibrium, the kicker is indifferent which way they kick, but for it to be an equilibrium they must choose exactly 1/3 probability.

Chiappori, Levitt, and Groseclose try to measure how important it is for the kicker to kick to their favored side, add center kicks, etc., and look at how professional players actually behave. They find that they do randomize, and that kickers kick to their favored side 45% of the time and goalies lean to that side 57% of the time. Their article is well-known as an example of how people in real life use mixed strategies despite not being mathematically sophisticated.

Significance

In his famous paper, John Forbes Nash proved that there is an equilibrium for every finite game. One can divide Nash equilibria into two types. Pure strategy Nash equilibria are Nash equilibria where all players are playing pure strategies. Mixed strategy Nash equilibria are equilibria where at least one player is playing a mixed strategy. While Nash proved that every finite game has a Nash equilibrium, not all have pure strategy Nash equilibria. For an example of a game that does not have a Nash equilibrium in pure strategies, see Matching pennies. However, many games do have pure strategy Nash equilibria (e.g. the Coordination game, the Prisoner's dilemma, the Stag hunt). Further, games can have both pure strategy and mixed strategy equilibria. An easy example is the pure coordination game, where in addition to the pure strategies (A,A) and (B,B) a mixed equilibrium exists in which both players play either strategy with probability 1/2.

Interpretations of Mixed Strategies

During the 1980s, the concept of mixed strategies came under heavy fire for being "intuitively problematic", since they are weak Nash equilibria, and a player is indifferent about whether to follow their equilibrium strategy probability or deviate to some other probability.^[4] ^[5] game theorist Ariel Rubinstein describes alternative ways of understanding the concept. The first, due to Harsanyi (1973),^[6] is called purification, and supposes that the mixed strategies interpretation merely reflects our lack of knowledge of the players' information and decision-making process. Apparently random choices are then seen as consequences of non-specified, payoff-irrelevant exogenous factors.^[5] A second interpretation imagines the game players standing for a large population of agents. Each of the agents chooses a pure strategy, and the payoff depends on the fraction of agents choosing each strategy. The mixed strategy hence represents the distribution of pure strategies chosen by each population. However, this does not provide any justification for the case when players are individual agents.

Later, Aumann and Brandenburger (1995),^[7] re-interpreted Nash equilibrium as an equilibrium in beliefs, rather than actions. For instance, in rock paper scissors an equilibrium in beliefs would have each player believing the other was equally likely to play each strategy. This interpretation weakens the descriptive power of Nash equilibrium, however, since it is possible in such an equilibrium for each player to actually play a pure strategy of Rock in each play of the game, even though over time the probabilities are those of the mixed strategy.

Behavior strategy

While a mixed strategy assigns a probability distribution over pure strategies, a behavior strategy assigns at each information set a probability distribution over the set of possible actions. While the two concepts are very closely related in the context of normal form games, they have very different implications for extensive form games. Roughly, a mixed strategy randomly chooses a deterministic path through the game tree, while a behavior strategy can be seen as a stochastic path. The relationship between mixed and behavior strategies is the subject of Kuhn's theorem, a behavioral outlook on traditional game-theoretic hypotheses. The result establishes that in any finite extensive-form game with perfect recall, for any player and any mixed strategy, there exists a behavior strategy that, against all profiles of strategies (of other players), induces the same distribution over terminal nodes as the mixed strategy does. The converse is also true.

A famous example of why perfect recall is required for the equivalence is given by Piccione and Rubinstein (1997)^{[full citation needed]} with their Absent-Minded Driver game.

Outcome Equivalence

Outcome equivalence combines the mixed and behavioral strategy of Player i in relation to the pure strategy of Player i’s opponent. Outcome equivalence is defined as the situation in which, for any mixed and behavioral strategy that Player i takes, in response to any pure strategy that Player I’s opponent plays, the outcome distribution of the mixed and behavioral strategy must be equal. This equivalence can be described by the following formula: (Q^(U(i), S(-i)))(z) = (Q^(β(i), S(-i)))(z), where U(i) describes Player i's mixed strategy, β(i) describes Player i's behavioral strategy, and S(-i) is the opponent's strategy.^[8]

Strategy With Perfect Recall

Perfect recall is defined as the ability of every player in game to remember and recall all past actions within the game. Perfect recall is required for equivalence as, in finite games with imperfect recall, there will be existing mixed strategies of Player I in which there is no equivalent behavior strategy. This is fully described in the Absent-Minded Driver game formulated by Piccione and Rubinstein. In short, this game is based on the decision-making of a driver with imperfect recall, who needs to take the second exit off the highway to reach home but does not remember which intersection they are at when they reach it. Figure [2] describes this game.

Without perfect information (i.e. imperfect information), players make a choice at each decision node without knowledge of the decisions that have preceded it. Therefore, a player’s mixed strategy can produce outcomes that their behavioral strategy cannot, and vice versa. This is demonstrated in the Absent-minded Driver game. With perfect recall and information, the driver has a single pure strategy, which is [continue, exit], as the driver is aware of what intersection (or decision node) they are at when they arrive to it. On the other hand, looking at the planning-optimal stage only, the maximum payoff is achieved by continuing at both intersections, maximized at p=2/3 (reference). This simple one player game demonstrates the importance of perfect recall for outcome equivalence, and its impact on normal and extended form games.^[9]

참고 항목

참조

^ 벤 폴락 게임 이론: 강의 1 성적표 ECON 159, 2007년 9월 5일, 오픈 예일 코스.
^ Aumann, R. (22 March 2017). Game Theory. In: Palgrave Macmillan. London: Palgrave Macmillan. ISBN 978-1-349-95121-5.
^ Chiappori, P. -A.; Levitt, S.; Groseclose, T. (2002). "Testing Mixed-Strategy Equilibria when Players Are Heterogeneous: The Case of Penalty Kicks in Soccer" (PDF). American Economic Review. 92 (4): 1138. CiteSeerX 10.1.1.178.1646. doi:10.1257/00028280260344678.
^ Aumann, R. (1985). "What is Game Theory Trying to accomplish?" (PDF). In Arrow, K.; Honkapohja, S. (eds.). Frontiers of Economics. Oxford: Basil Blackwell. pp. 909–924.
^ ^a ^b Rubinstein, A. (1991). "Comments on the interpretation of Game Theory". Econometrica. 59 (4): 909–924. doi:10.2307/2938166. JSTOR 2938166.
^ Harsanyi, John (1973). "Games with randomly disturbed payoffs: a new rationale for mixed-strategy equilibrium points". Int. J. Game Theory. 2: 1–23. doi:10.1007/BF01737554.
^ Aumann, Robert; Brandenburger, Adam (1995). "Epistemic Conditions for Nash Equilibrium". Econometrica. 63 (5): 1161–1180. CiteSeerX 10.1.1.122.5816. doi:10.2307/2171725. JSTOR 2171725.
^ "Outcome-equivalence of self-confirming equilibrium and Nash equilibrium". Games and Economic Behavior. 75 (1): 441–447. 2012-05-01. doi:10.1016/j.geb.2011.09.010. ISSN 0899-8256.
^ Kak, Subhash (2017). "The Absent-Minded Driver Problem Redux" (PDF). Retrieved 22 April 2021.

[1] 벤 폴락 게임 이론: 강의 1 성적표 ECON 159, 2007년 9월 5일, 오픈 예일 코스.

[2] Aumann, R. (22 March 2017). Game Theory. In: Palgrave Macmillan. London: Palgrave Macmillan. ISBN 978-1-349-95121-5.

[3] Chiappori, P. -A.; Levitt, S.; Groseclose, T. (2002). "Testing Mixed-Strategy Equilibria when Players Are Heterogeneous: The Case of Penalty Kicks in Soccer" (PDF). American Economic Review. 92 (4): 1138. CiteSeerX 10.1.1.178.1646. doi:10.1257/00028280260344678.

[Aumann1985-4] Aumann, R. (1985). "What is Game Theory Trying to accomplish?" (PDF). In Arrow, K.; Honkapohja, S. (eds.). Frontiers of Economics. Oxford: Basil Blackwell. pp. 909–924.

[Rubinstein1991-5] Rubinstein, A. (1991). "Comments on the interpretation of Game Theory". Econometrica. 59 (4): 909–924. doi:10.2307/2938166. JSTOR 2938166.

[6] Harsanyi, John (1973). "Games with randomly disturbed payoffs: a new rationale for mixed-strategy equilibrium points". Int. J. Game Theory. 2: 1–23. doi:10.1007/BF01737554.

[7] Aumann, Robert; Brandenburger, Adam (1995). "Epistemic Conditions for Nash Equilibrium". Econometrica. 63 (5): 1161–1180. CiteSeerX 10.1.1.122.5816. doi:10.2307/2171725. JSTOR 2171725.

[8] "Outcome-equivalence of self-confirming equilibrium and Nash equilibrium". Games and Economic Behavior. 75 (1): 441–447. 2012-05-01. doi:10.1016/j.geb.2011.09.010. ISSN 0899-8256.

[9] Kak, Subhash (2017). "The Absent-Minded Driver Problem Redux" (PDF). Retrieved 22 April 2021.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

v t 게임 이론의 주제
정의들	혼잡 게임 협동 게임 결정성 약속의 에스컬레이션 포브스폼 게임 1번과 2번 우승 게임 복잡성 게임 설명 언어 그래픽 게임 믿음의 위계 정보 세트 노멀 폼 게임 선호 순차 게임 동시 게임 동시 동작 선택 해결된 게임 간결한 게임
평형 개념	나시 평형 서브게임 완성도 메르텐스-안정성 평형 베이시안 나시 평형 완벽한 베이시안 평형 떨리는 손 적정 평형 엡실론 평형화 상관평형 순차 평형 준완벽 평형 진화적으로 안정된 전략 위험 우위 코어 샤플리 값 파레토 효율 깁스 평형 양자 반응 평형 자기 확인 평형 강한 나시 평형 마르코프 완전 평형
전략들	우세한 전략 순수전략 혼합 전략 전략-스틸링 인수 Tit for tat 그림 트리거 공모 후진 유도 전진 유도 마르코프 전략 입찰 셰이딩
반 사냥감의	협상문제 싸구려 말씨 글로벌 게임 자동 게임 평균 필드 게임 메커니즘 설계 n-플레이어 게임 완벽한 정보 대형 포아송 게임 포텐셜 게임 반복 게임 스크리닝 게임 신호 게임 엄격하게 결정된 게임 확률 게임 대칭 게임 제로섬 게임
게임.	가다 체스 무한 체스 체커스 틱택토 죄수의 딜레마 선물 교환 게임 선택형수의 딜레마 여행자의 딜레마 코디네이션 게임 치킨 지네 게임 루이스 시그널 게임 자원봉사자의 딜레마 달러 경매 성 전투 사슴 사냥 매칭 페니 얼티메이텀 게임 가위바위보 해적 게임 독재자 게임 공공재 게임 블로토 게임 소모전 엘 파롤 바 문제 공정분할 페어 케이크 커팅 쿠르노 게임 교착 상태 다이너의 딜레마 평균의 2/3을 추측하라. 쿤 포커 나시 흥정 게임 유도 퍼즐 트러스트 게임 공주와 괴물 게임 랑데부 문제
정리	화살의 불가능 정리 오만의 합의 정리 민속 정리 미니맥스 정리 내시의 정리 정화 정리 계시의 원리 제르멜로의 정리
키 수치	앨버트 W. 터커 아모스 트베르스키 앙투안 아우구스틴 쿠르노 아리엘 루빈스타인 클로드 섀넌 대니얼 카너 데이비드 K. 레빈 데이비드 M. 크렙스 도널드 B. 길리스 드루 푸덴베르크 에릭 마신 해럴드 쿤 허버트 사이먼 헤르베 물랭 존 콘웨이 장 티롤 장프랑수아 메르텐스 제니퍼 투어 체이스 존 하사니 존 메이너드 스미스 존 나시 존 폰 노이만 케네스 애로우 케네스 빈모어 레오니드 후르비츠 로이드 샤플리 멜빈 드레스허 메릴 M. 홍수 올가 본다레바 오스카르 모겐스턴 폴 밀그롬 페이턴 영 라인하르트 셀턴 로버트 액슬로드 로버트 아우만 로버트 B. 윌슨 로저 마이어슨 새뮤얼 보울스 수잔 스카치머 토머스 셸링 윌리엄 비크리
잡다한	올페이 경매 알파-베타 가지치기 베르트랑 역설 한정적 합리성 콤비네이터 게임 이론 대립분석 쿠페티션 진화 게임 이론 체스의 첫 동작의 이점 게임 설명 언어 게임 역학 게임 이론 용어집 게임 이론가 목록 게임 이론의 게임 목록 승리가 없는 상황 체스 풀기 위상 게임 공동체의 비극 작은 결정의 횡포

권한통제
일반	통합 권한 파일(독일)
기타	마이크로소프트 어학

Search