運用事後機率最大化調適準則於情緒辨識之研究

在本篇論文中，我們提出了一個改進語音情緒辨識率的系統架構，利用語者相關(speaker dependent)或是語者無關(speaker independent)的調適語料，經由事後機率最大化(MAP)調適法加入語者或情緒資訊，將原始的高斯混合模型(GMM)以及通用背景模型(UBM)調適成足以更精確表達不同情緒狀態的模型，並使用梅爾頻率倒頻譜係數(MFCC)以及感知線性估測倒頻譜係數(PLPCC)兩種常見語音特徵來驗證調適之結果，當我們將所提之新系統架構運用在國際通用的情緒語音資料庫 Emotional Prosody Speech and Transcripts，所得之實驗結果顯示，經由事後機率最大化調適法調適之語音情緒辨識率會明顯優於調適前的結果，此結果驗證了所提出之新系統架構對情緒辨識的改進能力。關

關鍵字

情緒辨識；事後機率最大化調適；梅爾倒頻譜係數；感知線性預估係數

並列摘要

In this thesis, we present a system structure which can improve the accuracy of speech emotion recognition. Using maximum a posteriori (MAP) principle via speakerspecific or speaker-independent utterances to adapt the original Gaussian mixture model (GMM) and universal background model (UBM), the resulting new emotion model can express the information of emotions more accurately and thus achieve higher emotion recognition accuracy. Two types of speech features, Mel-frequency cepstral coefficient (MFCC) and perceptual linear predictive cepstral coefficient (PLPCC), are used to validate the efficacy of the presented adaptive structure. The experiments conducted on the well-known emotion database, Emotional Prosody Speech and Transcripts, reveal that further adaptation of the emotion models with the MAP principle can improve the recognition accuracy relative to the original GMMs without any adaptation. Key

並列關鍵字

emotion recognition ； maximum a posteriori adaptation ； mel-frequency cepstral coefficient ； perceptual linear prediction coefficient

參考文獻

參考資料

Google Scholar

[1] C. R. Darwin, “The Expression of the Emotions in Man and Animals”, Oxford Univ Pr, 1872.

Google Scholar

[2] K. R. Scherer, “What Are Emotions? And How They be Measured? ”, Social Science Information, 44(4), pp. 695-729, 2005.

Google Scholar

[3] M. E. Ayadi, M. S. Kamel, and F. Karray, “Survey on Speech Emotion Recognition: Features, Classification Schemes, and Databases”, Pattern Recognition, vol. 44, pp. 572–587, 2011.

Google Scholar

[4] R. Cowie, E. Douglas-Cowie, N. Tsapatsoulis, G. Votsis, S. Kollias, W. Fellenz, and J. Taylor, “Emotion Recognition in Human-Computer Interaction”, IEEE Signal Processing Magazine, vol. 18, no.1, pp. 32-80, 2001.

Google Scholar

被引用紀錄

謝文隆（2010）。南投縣國小高年級參與運動性社團學生知覺指導教師領導行為、參與動機對運動樂趣影響之研究〔碩士論文，朝陽科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0078-2611201410133127

鄭遠崴（2012）。不同年齡層跆拳道選手運動動機與運動熱情之相關研究〔碩士論文，國立臺灣師範大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0021-1610201315270646

房世昕（2013）。台中市國小棒球代表隊參與動機、阻礙因素與運動承諾之研究〔碩士論文，朝陽科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0078-2712201314042580

枋薇菁（2017）。跆拳道選手運動員認同、運動熱情與運動幸福感之相關研究〔碩士論文，朝陽科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0078-2712201714442864

國際替代計量

運用事後機率最大化調適準則於情緒辨識之研究

全文下載

主題瀏覽