語音特徵參數擷取之濾波器改良

本論文研究之主題為針對語音關鍵詞辨識系統中的特徵參數擷取部分進行改良。在整個關鍵詞辨識系統的架構中，擷取語音特徵參數主要是想凸顯每段不同聲音個別的特性，並且在擷取的過程又可達到減低資料量的效果，很多學者都曾在文獻中提出不同的方式來擷取出語音特徵參數，或是對其中的擷取方法來進行改良。　　本論文主要為討論在梅爾倒頻譜係數中數種改良後的濾波器組，將效果最好的濾波器組取代原本的梅爾三角濾波器組，經實驗結果發現，應用此改良後的濾波器組能夠提升關鍵詞萃取系統的辨識率，故證明此濾波器組能有效的加強擷取出之語音的特性。

關鍵字

梅爾濾波器組；語音特徵；關鍵詞萃取

並列摘要

The theme of this thesis is to improve the part of feature extraction in the speech keyword recognition. In the framework of the entire keyword recognition system, feature extraction is to highlight the individual features of different voices, and can reduce the amount of data by means of the extract process. Many researchers have presented different ways to extract the speech features in the literature, or on which making improvements at extracting feature coefficient method. 　　This thesis discusses several improved filter bank in mel-frequency cepstral coefficients (MFCC). The best filter bank is used to replace the original mel-triangular filter set. The experimental results showed that the application of this improved filter bank can effectively improve the recognition rate of the keyword extraction system.

並列關鍵字

mel-filterbank ； speech feature ； keyword spotting

參考文獻

[1] Juang B. H., “Speech recognition in adverse environment,” Computer Speech and language, 5, pp275-294, 1991.

[3] Mansour, D. and Juang, B.H., “The short-time modified coherence representation and noisy speech recognition, ” Acoustics, Speech and Signal Processing, IEEE Transactions on, vol.37, no.6, pp.795-804, 1989.

[5] Shannon, B. J. and Paliwal, K. K., “Feature extraction from higher-lag autocorrelation coefficients for robust speech recognition,” Science Direct Speech Communication, Vol.48, pp. 1458-1485, 2006.

[9] Davis, S. and Mermelstein, P., “Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences,” Acoustics, Speech and Signal Processing, IEEE Transactions on, vol.28, no.4, pp.357-366, 1980.

[13] Wilpon, J. G., Rabiner, L., Chin-Hui, Lee. and Goldman, E.R., “Automatic recognition of keywords in unconstrained speech using hidden Markov models,” Acoustics, Speech and Signal Processing, IEEE Transactions on, vol.38, no.11, pp.1870-1878, 1990.

被引用紀錄

唐曲亮（2015）。改良式梅爾倒頻譜係數混合多種語音特徵之研究〔碩士論文，國立中央大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0031-0412201512055340

國際替代計量

語音特徵參數擷取之濾波器改良

未授權

主題瀏覽