透過您的圖書館登入
IP:18.219.63.90
  • 學位論文

以微機電麥克風陣列實現強健性語音辨識

Implement of MEMS Microphone Array for Robust Speech Recognition

指導教授 : 廖元甫
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


隨著對聲音科技的需求,目前麥克風陣列的應用越來越多樣化。例如用在手持式裝置的噪音及迴音抑制及音源追蹤和說話人辨認等。本論文設計一個麥克風陣列為同步擷取資料並建立完整的系統架構,實現各種演算法的驗證。所以我們使用了微機電為基礎的麥克風陣列。微機電的製程容易與積體電路結合,達到低耗電、小型化等優點。 實作麥克風陣列並經過各種的演算法進行處理,確認此系統功能及動作無誤。我們實作了聲源追蹤、延遲相加波束成形器、適應性波束成形器和子頻帶最大相似度波束成形器。在聲源追蹤的應用上,我們獲得了極佳的追蹤效果。在語音辨識演算法的驗證上,其結果並不如預期。經過分析之後,發現是電路的雜訊使得信號的雜訊比降低,從而影響到辨識的結果。在未來仍需持續的分析及改善電路效能,使得麥克風陣列能發揮最大的效用,實現麥克風陣列的各項運用。

並列摘要


As the demand for voice technology, the microphone array applications become more diverse. For example, in the handheld device noise and echo suppression and audio tracking and speaker recognition, etc.. In this paper we design a microphone array for the simultaneous capture information and the establishment of a complete system architecture and the test of various algorithms. Therefore, we use a MEMS-based microphone array. The MEMS manufacturing process easier and chip combine to low power consumption, small size and other advantages. We implement the microphone array and a variety of algorithms for processing, confirmed this system function and action correct. We implement sound source tracking, Delay-and-Sum beamformer, adaptive filter beamformer and Subband Maximum Likelihook Beamformer. In sound source tracking application, we have an excellent track results. In speech recognition algorithm validation, and the results are not as expected. After analysis, the circuit is making noise reduced SNR, thus affecting the outcome of recognition. In the future we will improve the circuit performance. The microphone array can be made to maximize the effectiveness of achieving the use of the microphone array.

並列關鍵字

microphone array speech recognition

參考文獻


[1] Hansler Hänsler, Gerhard Schmidt, Topics in acoustic echo and noise control.
[2] M. L. Seltzer, R. M. Stern, “Subband Likelihood-Maximizing Beamforming for Speech Recognition in Reverberant Environments,” IEEE Trans. Audio, Speech, and Language Processing., vol. 14, pp. 2109 – 2121, Nov. 2006.
[4] Joshua M. Sachar, Harvey F. Silverman, and William R. Patterson, “Microphone Position and Gain Calibration for a Large-Aperture Microphone Array,” IEEE Trans. Audio, Speech processing., vol. 13, no. 1, pp. 42-52, Jan 2005.
Berlin, Germany: Springer-Verlag, 2006.
[3] Harry L. Van Trees, Optimum Array Processing. Wiley-Interscience, 2002.

延伸閱讀