透過您的圖書館登入
IP:18.219.215.178

並列摘要


The importance of automatically recognizing emotions in human speech has grown with the increasing role of spoken language interfaces in human-computer interaction applications. In this paper, a Mandarin speech based emotion classification method is presented. Five primary human emotions, including anger, boredom, happiness, neutral and sadness, are investigated. Combining different feature streams to obtain a more accurate result is a well-known statistical technique. For speech emotion recognition, we combined 16 LPC coefficients, 12 LPCC components, 16 LFPC components, 16 PLP coefficients, 20 MFCC components and jitter as the basic features to form the feature vector. Two corpora were employed. The recognizer presented in this paper is based on three cl4ssification techniques: LDA, K-NN and HMMs. Results show that the selected features are robust and effective for the emotion recognition in the valence and arousal dimensions of the two corpora. Using the HMMs emotion classification method, an average accuracy of 88.7% was achieved.

並列關鍵字

Mandarin emotion recognition LPC LFPC PLP MFCC

參考文獻


Ata, B .S.(1974).Effectiveness of Linear Prediction Characteristics of the Speech Wave for Automatic Speaker Identification and Verification.(Journal of the Acoustical Society of America).
Cheng, P.Y.(2002).Automated Recognition of Emotion in Mandarin.National Cheng Kung University.
Chuang, Z.J.,C.H. Wu(2004).Multi-Modal Emotion Recognition from Speech and Text.(International Journal of Computational Linguistics and Chinese Language Processing).
Cowie, R.,F. Douglas-Cowie,N. Tsapatsoulis,G. Votsis,S. Kollias,W. Fellenz,J.G. Taylor(2000).Emotion Recognition in Human-Computer Interaction.(IEEE Signal Processing Magazine).
Davis, S.,P. Mermelstein(1980).Comparison of Parametric Representations of Monosyllabic Word Recognition in Continuously Spoken Sentences.(IEEE Transactions on Acoustics).

被引用紀錄


Yeh, L. Y. (2010). 使用時頻變化調變於強健語音情緒辨識 [master's thesis, National Chiao Tung University]. Airiti Library. https://doi.org/10.6842/NCTU.2010.00683
Chang, Y. H. (2005). 運用權重式D-KNN分類法則之中文語音情緒辨識及評估 [master's thesis, Tatung University]. Airiti Library. https://www.airitilibrary.com/Article/Detail?DocID=U0081-0607200917234331
Lin, Y. Y. (2007). 雜訊對語音情緒辨識影響之研究 [master's thesis, Tatung University]. Airiti Library. https://www.airitilibrary.com/Article/Detail?DocID=U0081-0607200917241794
Liao, W. Y. (2009). 聽視覺特徵擷取在中文數字語音辨識之研究 [doctoral dissertation, Tatung University]. Airiti Library. https://www.airitilibrary.com/Article/Detail?DocID=U0081-3001201315103781
林秋煌(2009)。語音情緒辨識在VoIP客服系統上的應用〔碩士論文,大同大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0081-0607200917250380

延伸閱讀