混合均衡的隨機穩定：期許水準於零和賽局之應用

在有唯一混合Nash均衡的兩人零和賽局中，我們導入期許水準（Aspiration）建立了一個超賽局模型。參賽者於每期選定一個策略並重複玩N次零和賽局；以此方式進行無限多期。參賽者透過檢視當前策略於前期的表現是否達到設定期許水準來決定繼續使用原策略或挑選新的策略。當策略都不能滿足時則降低期許水準。參賽者有微小的機率會進行新的嘗試或犯錯。依據Young（1993, 1998）的隨機隱定方法，我們證明「兩個玩家期許水準為零且採用混合均衡策略」為隨機穩定狀態之一。因此我們部分解決了Crawford難題。並以兩性戰爭賽局為例，說明混合均衡也可以如純粹均衡一般穩定。

關鍵字

零和賽局；混合Nash均衡；期許水準；隨機穩定

並列摘要

This paper aims to provide a theoretical foundation for players learning to play mixed strategies and analyze the stochastic stability of the unique mixed Nash equilibrium in zero-sum games. We construct a supergame in which each player selects a strategy to play a zero-sum game for N rounds in each period. Each player then compares the average payoffs received in the N rounds with her aspiration. If the former exceeds the latter, the player is satisfied and sticks to the same strategy for the next period; otherwise she randomly selects a new strategy from the set of feasible strategies that have not yet been adopted. If all strategies have been tried and none of them can fulfill the player's current aspiration, then the player lowers her aspiration. Players also have a small probability of making mistakes when adjusting their strategies or aspiration. We apply the stochastic stability approach proposed by Young (1993), combined with the aspiration hypothesis and show that the unique mixed Nash equilibrium outcome is stochastically stable in a zero-sum game. We also use Battle of the Sexes to illustrate that mixed equilibrium could be as stable as pure ones.

並列關鍵字

zero-sum game ； mixed Nash equilibrium ； aspiration ； stochastic stability

參考文獻

Pangallo, Marco, Torsten Heinrich, and J. Doyne Farme (2019), “Best Reply Structure and Equilibrium Convergence in Generic Games,” ScienceAdvances, 5(2), DOI: 10.1126/sciadv.aat1328.

Börgers, Tilman and Rajiv Sarin (2000), “Naïve Reinforcement Learningwith Endogenous Aspirations,” International Economic Review, 41(4),921–950.

Google Scholar

Brown, George William (1951), “Iterative Solution of Games by FictitiousPlay,” in Tjalling Charles Koopmans (ed.), Activity Analysis of Production and Allocation, Cowles Commission Monography, New York: JohnWiley and Sons, Inc., 374–376.

Google Scholar

Chong, Juin-Kuan, Teck-Hua Ho, and Colin Camerer (2016), “A Generalized Cognitive Hierarchy Model of Games,” Games and Economic Behavior, 99, 257–274.

Google Scholar

Conlisk, John (1993a), “Adaptation in Games: Two Solutions to the Crawford Puzzle,” Journal of Economic Behavior and Organization, 22(1), 25–50.

Google Scholar

國際替代計量

混合均衡的隨機穩定：期許水準於零和賽局之應用

全文下載

主題瀏覽