應用小波封包轉換於PDA之語音辨認研究

本論文在建立一套以PDA為操作介面之語音控制系統，本系統在Window CE作業系統下，以Pocket PC為發展平台，利用eMbedded Visual C++ 3.0和MFC等工具開發。首先利用能量與越零率的語音訊號切割技術，把語者聲音的部分擷取出來。在作時頻分析時利用小波包(Wavelet Packet)多頻帶解析的能力，架構適當出小波包分解樹來進行特徵參數抽取。訓練時經過二元分裂法(Binary splitting)建立向量量化碼本，並以離散型隱藏式馬可夫模型(DHMM, Discrete Hidden Markov Models)建立語音模型後，再使用波氏演算法(BaumWelch Algorithm)做調適。辨識時採用維特比演算法(Viterbi algorithm)來計算最佳辨認的機率。本系統透過Windows API與POOM(Pocket Outlook Object Model)程式介面，來執行辨識後的指令動作。其功能包括查詢聯絡人的各種資訊、預約行程，以及連線上網等…動作。

關鍵字

隱藏式馬可夫模型；小波封包轉換；數位訊號處理

並列摘要

In this thesis, a speech recognition for controlling PDA is implemented. The System is build with components that include Pocket PC platform, Windows CE, eMbedded Visual C++ 3.0, and MFC. First, the speech of speaker segmented by utilized the energy detecting and zero crossing technology. During Time-Frequency analysis stage, Utilizing the ability of Multi-Band resolution of wavelet packet Constructed the wavelet packet decomposition tree to extract speech feature. During training stage, to build vector quantization codebook used Binary splitting ,and then to build speech model use Discrete Hidden Markov Models, and then to adapt speech model employ BaumWelch Algorithm. During recognition stage,the system Used Viterbi algorithm to calculate the best probability. The system used the interface of the Windows API and POOM to implement the instructs recognized. The function of system include inquiry, making appointments and exploring internet, etc.

並列關鍵字

Wavelet Packet Transform Hidden Markov Models ； Digital Signal Process

參考文獻

[1] C.S. Burrus , DFT/FFT and Convolution Algorithm, Wiley , 1985.

[8] D. O’Shaughnessy, Speech Communication: Human and Machine, Addison

Wesley, 1987.

[9] J.R. Deller, Discrete-Time Processing of Speech Signals, Macmillan, 1993.

[11] A.V. Oppenheim and R.W. Schafer, Discrete-Time Signal Processing,

被引用紀錄

林昱廷（2010）。以HHT研究氣候變遷對於濁水溪流域降雨之影響〔碩士論文，淡江大學〕。華藝線上圖書館。https://doi.org/10.6846/TKU.2010.00147

黃漢強（2007）。具備多重辨識器並以PDA為平台之車牌辨識系統〔碩士論文，中原大學〕。華藝線上圖書館。https://doi.org/10.6840/cycu200700469

廖信樵（2006）。以特徵值分解法分離單通道之母體心電圖與胎兒心電圖〔碩士論文，中原大學〕。華藝線上圖書館。https://doi.org/10.6840/cycu200600676

國際替代計量

應用小波封包轉換於PDA之語音辨認研究

未授權

主題瀏覽