混合均衡之演化：期許水準於零和賽局之應用

本文研究在特定類型的兩人零和賽局裡—雙方互有輸贏、具有唯一且為混合的Nash均衡，當玩家皆為追求滿足者(satisfier)時，其混合均衡如何被演化出來。我所設定的超賽局結構為：每期玩家選擇策略，重複玩N次賽局，N夠大，重複玩無限多期。我所提出的學習法為：玩家盡量嘗試各種方法來達到期許水準，如果不能滿足則降低期許水準，並有微小的機率允許玩家嘗試實驗或犯錯。我只假設理性策略為有限個，且包含混合均衡策略。在上述設定之下，我應用Young(1993, 1998)的方法證明雙方期許為零，且皆採用混合均衡策略，必為隨機穩定狀態，但同時存在其他隨機穩定狀態。這個結論部分解決了Crawford難題，且玩家不只是「看起來」像使用混合均衡行動，而是真的採用混合均衡行動。我並以兩性戰爭賽局為例，說明純粹均衡未必比混合均衡穩定。

關鍵字

零和賽局；演化；期許水準；混合均衡

並列摘要

無資料

並列關鍵字

zero-sum game ； aspiration ； stochastically stable state ； mixed Nash equilibrium

參考文獻

Börgers, Tilman, and Rajiv Sarin (2000)：”Naïve reinforcement learning with endogenous aspirations” International Economic Review 41, 921-950

Crawford, Vincent P. (1974)：”Learning the optimal strategy in a zero-sum game” Econometrica 42, 885-891

Crawford, Vincent P. (1985)：”Learning behavior and mixed-strategy Nash equilibria” Journal of Economic Behavior anf Organization 6, 69-78.

Karandikar, R., D. Mookherjee, D. Ray, and F. Vega-Redondo (1998)：“Evolving aspirations and cooperation” Journal of Economic Theory 80, 292-331

Lambson, Val E., and Daniel A. Probst (2004)：”Learning by matching patterns” Games and Economic Behavior 46, 398-409

被引用紀錄

鄔廸嘉（2015）。以委託代理人模型探討空間文化資產：以金門為例〔博士論文，國立臺灣大學〕。華藝線上圖書館。https://doi.org/10.6342/NTU.2015.00781

國際替代計量

混合均衡之演化：期許水準於零和賽局之應用

全文下載

主題瀏覽