透過您的圖書館登入
IP:216.73.216.100
  • 學位論文

改良式梅爾倒頻譜參數應用於關鍵字萃取

Improved Mel-scale Frequency Cepstral Coefficients for Keyword Spotting Technique

指導教授 : 莊堯棠
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


在語音辨識系統中,梅爾倒頻譜係數(Mel frequency cepstral coefficients, MFCCs)為常用的特徵值參數,然而隨著MFCC被廣泛地應用,許多研究MFCC改良的方法也被提出,本論文針對三角帶通濾波器能量組進行權重調整,以粒子群演算法尋找濾波器組的最佳權重,演算法中以語料能量統計曲線與濾波器組包絡線曲線之差作為適應函數,使濾波器組更能符合人耳感受度,以提升辨識效果。由實驗結果得知,改良後的MFCC的辨識效果優於傳統MFCC,且其抗高頻雜訊能力也優於傳統MFCC。

並列摘要


In the speech recognition system, Mel frequency cepstral coefficients (MFCCs) are the feature parameters that are used widely. Because of the wide applications of MFCC in the audio signal processing, lots of studies on the improvement of MFCCs were presented. In this study, we use particle swarm optimization algorithm to optimize the weight of MFCC filter bank. We utilize the difference between voice training database’s energy statistical curve and MFCC filter bank’s envelope as fitness function. Experimental results show that the proposed MFCCs method improves the recognition rate. In noisy environment experiments, the presented MFCCs method also improves the recognition performance.

並列關鍵字

MFCC PSO keyword spotting

參考文獻


[29] 周智勳,最佳化梅爾倒頻譜係數之研究及其於音樂曲風辨識之應用,Journal of Information Technology and Applications, Vol. 4, No. 1, pp. 53-58, 2010.
[1] A. J. Oxenham and C. J. Plack, “Suppression and the upward spread of masking,” Journal of the Acoustical Society of America, 104 (6), pp. 3500-3510, December 1998.
[2] B. H. Juang, “The past, present, and future of speech processing,” IEEE Signal Processing Magazine, pp. 24-28, May 1998.
[4] H. Ney, “The use of a one-stage dynamic programming algorithm for connected word recognition,” IEEE Transactions on Acoustic, Speech, and Signal Processing, Vol. 32, pp. 263-271, 1984.
[7] J. Kennedy and R. Eberhart, “Particle swarm optimization,” IEEE International Conference on Neural Networks, Vol. 4, pp. 1942-1948, 1995.

被引用紀錄


唐曲亮(2015)。改良式梅爾倒頻譜係數混合多種語音特徵之研究〔碩士論文,國立中央大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0031-0412201512055340

延伸閱讀