類神經網路及隱藏式馬可夫理論應用於ＰＤＡ之研究

本研究以PDA為平台之語音控制系統，討論以類神經網路為主之向量量化過程，對語音系統辨識率的影響。使用的方法包括利用數位訊號處理技術擷取語音特徵參數，向量量化方法作前處理，以及隱藏式馬可夫模型為主的辨識及訓練演算法。　　特徵擷取使用梅爾倒頻譜係數(MFCC，Mel-Frequency Cepstrum Coefficient)。向量量化採用改良k means的二分法，類神經網路的自我組織特徵映射網路(Self-Organizing Feature Map network)，與頻率感應競爭式學習網路(Frequency-Sensitive Competitive Learning)三種方法，並對此三種做法逐一探討。在訓練階段，語音的特徵參數透過Baum-Welch演算法來訓練各個隱藏式馬可夫模型(Hidden Makov Model)內的參數。在辨識階段，使用維特比演算法(Viterbi algorithm)快速的求出機率的近似值，並透過Windows API程式介面，來執行辨識後的指令動作。其功能包括預約行程，以及連線上網…等，經由語音輸入指令，使得操作PDA更加便利。

關鍵字

PDA ；數位信號處理；梅爾倒頻譜；隱藏式馬可夫模型；類神經網路

並列摘要

This research is on the speech control system constructed on Personal Digital Assistant (PDA), and discuss the process of vector quantization how to affect the recognition rate under this system. Process methods include how to extract speech feature vector, preprocess of vector quantization, and hidden Markov model for training and recognition algorithm. The feature vector extraction use Mel Frequency Cepstrum Coefficient. Vector quantization use three methods, including binary splitting improved by k-mean clustering algorithm, neural network’s self organizing map and frequency sensitive competitive learning. And then discuss the three methods sequentially. During training stage, feature parameters in hidden Markov model are trained by Baum-Welch algorithm. During recognition stage, use Viterbi algorithm to find out the approximate value of probability quickly. And then via program interface of Windows Application Programming Interface to execute the instruction after recognition. The functions include making appointment and exploring Internet, etc. Via speech to input command makes more convenient to operate PDA.

並列關鍵字

Hidden Markov Model ； Digital Signal Process ； Mel Frequency Cepstrum ； PDA ； Neural Network

參考文獻

[21] 李建平，語音辨認應用於PDA之作業控制研究，中原大學資訊工程所碩士論文，2001。

[4] E.O. Brigham, The Fast Fourier Transform, Prentice-Hall, 1974.

[5] L.R. Rabiner and B.H. Juang, Fundamentals of Speech Recognition, Prentice Hall, 1993.

[7] R.M. Gray, “Vector Quantization” IEEE ASSP Magazine, pp. 4-29, Apr. 1984.

[9] J.G. Wilpon and L.R. Rabiner, “A Modified K-Means Clustering Algorithm for Use Isolated Word Recognition,” IEEE Trans. on Acoustics, Speech, and Signal Proc., Vol. 33, No. 3, pp. 587-594, June 1985.

被引用紀錄

張云箐（2007）。最小均方演算法以及功率頻譜密度差異值用於雜訊消除的分析〔碩士論文，國立臺北科技大學〕。華藝線上圖書館。https://doi.org/10.6841/NTUT.2007.00019

國際替代計量

類神經網路及隱藏式馬可夫理論應用於ＰＤＡ之研究

未授權

主題瀏覽