透過您的圖書館登入
IP:3.227.239.9
  • 學位論文

基於聯合近似對角化之即時語音分離系統

A Real-time Speech Separation System based on Joint Approximate Diagonlization Algorithm

指導教授 : 王小川

摘要


本論文主要探討的問題為真實環境下的即時分離摺積混合盲聲源之研究。在一般環境中的聲源混合過程,因混合通道具有記憶性,混合過程為摺積混合,其運算量遠大於瞬時混合,所以處理摺積混合的即時系統文獻是比較少見的,也是可以說專門為了分離語音訊號的討論。為了使系統有較佳的效率,我們採用了在頻域執行盲聲源分離,但伴隨發生之不確定因素也是本篇討論的重點。運用獨立成份分析演算法達成盲聲源分離,核心演算法為聯合對角化演算法。其中聯合近似對角化演算法其利用二階統計的特性使得分離後訊號有最大的獨立性值,應用此方法解決盲訊號分離問題並不用對混合訊號做集中化和白色化的前處理,能避免訊號因前處理而造成統計特性的改變。在即時盲聲源分離系統,系統的輸入僅是少量的語音片段,如何利用少輸入訊求出準確之分離資訊也是本篇論文的重點。我們應用線上處理的架構來實現即時分離系統,也利用線上處理的架構,設計出以遞迴架構解決排列問題。利用Simulink來實現即時系統,實驗時利用兩支麥克風在真實環境分離兩位語者的混合聲音,立用錄音界面將兩個麥克風訊號輸入電腦計算分離資訊,並立用累加分離資訊來得到即時分離的音檔,語音分離的成果也可接受和利用批次處理的系統並無太大差別。

並列摘要


Real-time blind source separation (BSS) is a technique to recover independent sources from the mixed signals in online system without any prior knowledge of the sources and the mixing channels. This thesis is a study of BSS problem for speech signals recorded in real environment. The research can be divided into three parts. One is to decorrelate mixing signals in time-frequency domain, which use covariance matrix to measure the independence components. The ideal has to derive the algorithm that we can get the clean speech from different channel. The other is to use that characteristic of human hearing. By this way, we can change the algorithm by adding more parameter that can make the algorithm speed the computing time via low complexity, so we can went the system be implemented at real-time system. Obviously, it is not suitable in realistic environment implementation. Therefore, last is to make our system become an on-line algorithm by using speech signals are non-stationary in time domain. The processing runs accurately learning optimal values in the complicated space for these delays and attenuations with the previous and moment input. Our BSS system for acoustic source separation can implement in a realistic environment and the result show it has better performance with decreased computational complexity.

參考文獻


[30] 陳世勛, “針對摺積混合的加速聯合近似對角化盲訊號分離方法,”P.29-32,(2008).
[15] Shuxue Ding, Jie Huang, Daming Wei and Andrzej Cichocki, “A Near Real-Time Approach for Convolutive Blind Source Separation,” IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—I: REGULAR PAPERS, VOL. 53, NO. 1, JANUARY (2006).
[1] Te-Won Lee and Gil-Jin Jang. “The statistical structures of male and female speech signals.” ICASSP'01 Vol:1, Page:105-108 (2001)
[2] A.Hyvärinen and E. Oja, “A fast fixed-point algorithm for independent component analysis,” Neural Comput., vol. 9, pp. 1483–1492, (1997).
[3] Pau Bofill and Enric Monte. “Underdetermined convoluted source reconstruction using LP and SOCP, and a neural approximator of the optimizer.” ICA (2006)

被引用紀錄


梁翰銘(2012)。利用粒子濾波器與麥克風陣列進行直角座標上多聲源之追蹤〔碩士論文,國立清華大學〕。華藝線上圖書館。https://doi.org/10.6843%2fNTHU.2012.00699

延伸閱讀