本論文討論了在聲場分析中兩個重要的議題:聲源定位及分離。演算法基於以平面波分解的基礎來實現定位與分離聲源訊號的目的。在定位的階段,以最小變異無失真響應法(Minimum Variance Distortionless Response, MVDR)與多重信號分類法(multiple signal classification, MUSIC)來測定平面波的方向。對於寬頻的訊號的應用上,同調(coherent)與非同調(incoherent)的技術被應用在定位處理上。在聲源分離階段,使用過定與未定設置的兩個方法被使用。過定的方法,使用Tikhonov正規化來實現聲源訊號的重建;而在未定的狀況下,轉向(steering)矩陣以定位出之方位再做增廣。因此,分離問題可以壓縮感知(compressive sensing, CS)的問題來表示且可以有效地以凸集合最佳化來求解。數值模擬與實驗以一24顆麥克風所組成的環形陣列來實現。客觀的語音品質感知試驗以及主觀的聆聽測試比較原始混雜訊號與分離出之訊號在語音品質上的提升。
This thesis examines two fundamental issues in sound field analysis: acoustic sources localization and separation. Algorithms are developed to locate and separate acoustic signals on the basis of plane-wave decomposition. In the localization stage, directions of plane waves are determined using either minimum variance distortionless response (MVDR) method or multiple signal classification (MUSIC) method. For broadband scenarios, coherent and incoherent techniques are utilized in the localization procedure. In the separation stage, two approaches with overdetermined and underdetermined settings can be employed. In the overdetermined approach, Tikhonov regularization (TIKR) is utilized to recover the source signals. In the underdetermined approach, the steering matrix is augmented by including the directions that have been determined in the localization stage. Hence, the separation problem is formulated into a compressive sensing (CS) problem which can be effectively solved by using convex (CVX) optimization. Simulation and experiments are conducted for a 24-element circular array. Sources Objective tests using perceptual evaluation of speech quality (PESQ) tests and subjective listening tests demonstrate that the proposed methods yield speech signals with well separated and improved quality, as compared to the mixed signals.
為了持續優化網站功能與使用者體驗,本網站將Cookies分析技術用於網站營運、分析和個人化服務之目的。
若您繼續瀏覽本網站,即表示您同意本網站使用Cookies。