基於多模型架構之語者辨認系統

本論文我們提出了新的多模型語者確認架構，主要的研究方向是結合高斯混合模型(Gaussian Mixture Model, GMM)、支撐向量機模型(Support Vector Machine, SVM)與模糊模型(Fuzzy Model)以對傳統的單一模型方式之語者確認系統做進一步的辨識性能改良。在高斯混合模型與支撐向量機模型之整合的多模型之語者辨認架構中，我們提出了平行式整合及序列式整合兩種機制，其分別為Voting-GMMSVM及GMM-dependent SVM 等兩種語者辨識方法。所提出之兩個方法經過三種語音資料庫的實驗測試得以驗證以有效性。與傳統高斯混合模型及支撐向量機分類器相較之下，Voting-GMMSVM之76.27%及GMM-dependent SVM之77.41%的辨識效能皆有大幅度的上升且具備競爭力。在支撐向量機模型、模糊模型與高斯混合模型之整合的多模型之語者辨認架構中，我們提出了FDoMV&ID-SVM辨識方法，該方法藉由模糊控制器之依據合法語者與仿冒語者之兩類高斯混合模型之模型平均向量差異及相似度分數差異來調整SVM的邊界大小進而提昇SVM分類器的辨識準確度。實驗結果證實所提出之FDoMV&ID-SVM方法得以再改善傳統SVM方法之辨識性能。所發展之強化SVM的FDoMV&ID-SVM方法更進一步再導入於前述所提出之Voting-GMMSVM與GMM-dependent SVM等高斯混合模型與支撐向量機模型混合方法以達到最大化的辨識效能。所導入之方式即是直接將此兩種混合方法中之SVM分類器直接替換為FDoMV&ID-SVM分類器。由實驗結果得知在強化SVM後之新的多模型之語者確認架構於強化之Voting-GMMSVM及強化之GMM-dependent SVM各有著78.51% 和81.84%之較為準確的辨識率，實驗結果亦證實了此類經強化SVM分類器後之多模型架構的有效性。

關鍵字

語者確認；高斯混合模型；支撐向量機；模糊模型；支撐向量機邊界

並列摘要

In this thesis, we present a new multi-model speaker recognition framework. The main purpose of this thesis is to combine the GMM model, the SVM model and the fuzzy model to enhance the performance of the conventional single model speaker verification scheme. In the speaker verification framework of GMM and SVM dual-model combination, we present the parallel-style and serial-style model combination methods, which are Voting-GMMSVM and GMM-dependent SVM, respectively. Both of the proposed methods can be validated to be effective from the experimental results. Compared with conventional SVM-based and GMM-based speaker verification, the recognition rats of the developed Voting-GMMSVM and GMM-dependent SVM are a little more satisfactory, which achieve the performance of 76.27% and 77.41%, respectively. In addition, we present the FDoMV&ID-SVM method to combine SVM, GMM and fuzzy models. This developed method is to use the distance of mean vectors and the difference of likelihood scores between valid speakers and invalid speakers GMM models to build the fuzzy model. The experimental results show that the proposed FDoMV&ID-SVM can improve the recognition performance of conventional SVM. Furthermore, the improved SVM classifier, the FDoMV&ID-SVM, is further integrated into the multi-model speaker verification framework. The SVM classifiers of Voting-GMMSVM and GMM-dependent SVM are directly replaced with the FDoMV&ID-SVM classifiers. The recognition performances of improved Voting-GMMSVM and GMM-dependent SVM multi-model frameworks are improved, which are 78.51% and 81.84%, respectively. The experiment results prove the validness of the developed multi-model speaker verification.

並列關鍵字

Speaker Verification ； Gaussian Mixture Model ； Support Vector Machines ； Fuzzy Model ； SVM Margin

參考文獻

[3] C. -S. Jung, M. Y. Kim and H.-G. Kang, “Selecting feature frames for automatic speaker recognition using mutual information,” IEEE Transactions on Audio, Speech, and Language Processing, Vol. 18, No. 6, pp. 1332–1340, 2010.

[4] N. Wang, P. C. Ching, N. Zheng and T. Lee, “Robust speaker recognition using denoised vocal source and vocal tract features,” IEEE Transactions on Audio, Speech, and Language Processing, Vol. 19, No. 1, pp.196–205, 2011.

[5] R. Saeidi, J. Pohjalainen, T. Kinnunen and P. Alku, “Temporally weighted linear prediction features for tackling additive noise in speaker verification,” Signal Processing Letters, Vol.17, No. 6, pp. 599–602, 2010.

[6] T. H. Falk and W. -Y. Chan, “Modulation spectral features for robust far-field speaker identification,” IEEE Transactions on Audio Speech, and Language Processing, Vol. 18, No. 1, pp. 90–100, 2010.

[7] K. Kim and M. Y. Kim, “Robust speaker recognition against background noise in an enhanced multi-condition domain,” IEEE Transactions on Consumer Electronics, Vol. 56, No. 3, pp. 1684–1688, 2010.

被引用紀錄

許晏銘（2013）。基於動態規劃之機器學習方法於小字彙DTW語音辨識系統之研究〔碩士論文，國立虎尾科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0028-3007201315014600

劉建廷（2013）。基於音訊方式之人類活動辨識〔碩士論文，國立虎尾科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0028-2708201422411900

歐大誠（2013）。應用於語者確認之支撐向量機參數最佳化研究〔碩士論文，國立虎尾科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0028-3007201315370100

吳宗桂（2015）。運用KINECT姿態辨識的使用者辨識研究〔碩士論文，國立虎尾科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0028-3107201501460900

國際替代計量

基於多模型架構之語者辨認系統

未授權

主題瀏覽