透過您的圖書館登入
IP:3.139.86.108
  • 期刊

以增強學習模型與貝氏因子探討酬賞預測誤差的男女差異及性格相關

Using a Reinforcement Learning Model and Bayes Factors to Study Gender Differences and Personality Correlations in Reward Prediction Errors

摘要


本研究使用動態酬賞作業與三向度性格量表以大學生為受試探討酬賞預測誤差的男女差異及性格相關。我們用一增強學習模型作適配,並藉貝氏因子作參數組間比較。配對法則分析結果顯示男性酬賞敏感度比女性高,表示男性選擇優勢牌的傾向比女性佳。增強學習模型參數估計發現女性的學習速率較男性略高,亦即當預期與實際經驗不符時,女性傾向給予預測誤差較大權重,較快改變信念。而女性的選擇固著性較男性小,代表女性在作業中探索的程度高於男性。另外,相較於男性組,女性受試者的學習速率與作業總分呈負相關,而男女受試的選擇固著性皆與作業總分呈正相關。這些結果顯示男性相對於女性,平均而言有較接近「最佳化」的學習速率與較大的選擇固著性,為此作業男性較佳的選擇優勢牌傾向提供了一可能解釋。男女性在三向度性格量表各向度分數間無甚差異,然而,相關分析發現量表部分次向度的分數與增強學習模型參數間存有性別差異:女性受試學習速率與「新奇追求」分向度「揮豁性」呈負相關;類似負相關亦存在於女性受試選擇固著性與「酬賞依賴」分向度「親近性」。男性組則沒有發現任何顯著相關。這些結果顯示女性相對於男性,部分性格特徵與學習表現有關,然確切關聯有賴後續研究。

並列摘要


We used a dynamic reward task to study gender differences in reward prediction errors in college students. Experimental data were fit by a reinforcement learning model using the Bayesian estimation approach. We also correlated subjects' performance with their personality traits assessed by the Tridimensional Personality Questionnaire (TPQ). Matching law analysis revealed that males are more sensitive than females, meaning that males are better than females in choosing the advantageous card. Model fitting together with Bayes factors revealed that females update their values slightly faster than males, indicating that females have the tendency to give the trial-by-trial prediction error more weight. Further, females showed less perseveration than males, indicating that females tend to explore the uncertainty of future events more often. We also found that females' learning rate is negatively correlated with their task performance, whereas both males and females' degree of perseveration is positively correlated with their task scores. These results indicate that on average, males have a more 'optimal' learning rate and a larger degree of perseveration than females, providing a plausible explanation of why males performed slightly better than females in the task. Regarding the relation of task performance and personality traits, correlational analysis revealed that females with higher scores in "extravagance" (which is in the novelty-seeking dimension of TPQ) have lower value-updating rates. Moreover, females with higher scores in "attachment" (which is in the reward-dependence dimension of TPQ) have lower degrees of perseveration. No such findings were found in males. Further work is needed to verify the findings.

參考文獻


Batchelder, W. H.,Anders, R.(2012).Cultural consensus theory: Comparing different concepts of cultural truth.Journal of Mathematical Psychology.56,316-332.
Baum, W. M.(1974).On two types of deviation from the matching law: Bias and undermatching.Journal of the Experimental Analysis of Behavior.22,231-242.
Chen, W. J.,Chen, H. M.,Chen, C. C.,Yu, W. Y.,Cheng, A. T. A.(2002).Cloninger's tridimensional personality questionnaire: Psychometric properties and construct validity in Taiwanese adults.Comprehensive Psychiatry.43,158-166.
Chen, Y. C.,Chen, Y. W.,Hsu, Y. F.,Chang, W. T.,Hsiao, C. K.,Min, M. Y.(2012).Akt1 deficiency modulates reward learning and reward prediction error in mice.Genes, Brain and Behavior.11,157-169.
Cloninger, C. R.(1987).Neurogenetic adaptive mechanisms in alcoholism.Science.236,410-416.

延伸閱讀