改善最小錯誤鑑別式之語者辨認方法

在語者辨認中，能夠有效的訓練語料是非常重要的，因為這對辨識的效果是有很大的影響。到目前為止，傳統的語者模型都還是以最大相似度為準則，這在擁有大量訓練語料之下確實是有很好的效果，但在極少量訓練語料下卻不然，並且最大相似度估計的方法是，利用同一個語者的訓練語料去訓練出這個語者的模型，跟其它語者的訓練語料並無相關。，而此種模型訓練並沒有考慮到語者辨認時模型間彼此的關係，在模型參數訓練完成後有可能使得語音特徵向量落在對應的聲學模型與非相關模型的相似度值同時變大，產生辨識上的混淆。因此近十幾年來有所謂的鑑別式聲學模型訓練方法被提出來，不以最大化訓練聲學語料的相似度為目標，而以最小化分類(或辨識)錯誤為目標。在本論文中，我們使用最小錯誤鑑別式法則重新去訓練語者模型，並提出了三個改善傳統最小錯誤鑑別式法則的方法。此外，還把最小錯誤鑑別式使用在特徵語音調適法上，因為最小錯誤鑑別式受劣質近似模型的影響比最大相似度小。於是我們提出一個結合最小錯誤鑑別式和特徵語音調適法的方法，增加在極少語料時的強健性，以及降低建構聲學空間時造成劣質近似模型的影響性。

關鍵字

最小錯誤鑑別式；語者辨認

並列摘要

In the speaker identification, the data that can be effective training is very important, because this has very great influence on identification rate. Up to now, traditional speaker model use maximum likelihood. There is a very good result in a large amount of training data, but not good in a small amount of training data. The method of maximum likelihood is, use the training data for this speaker to train model for this speaker and not relevant with other speaker’s training data. This kind of training model which does not consider mutual relation among the models to verification.After the parameters are trained to finish,it may make the likelihood value of feature vectors leave the corresponding acoustics model and non- relevant model which become great at the same time,then produce the obscurity in verifying.So the so-called Discriminative Acoustic Model Training has been proposed in recent ten years.Do not regard maximizing to train acoustic data of likelihood as the goal, but regard minimizing classification(or identificaion) error as the goal. In this thesis, we use minimum classification error to train speaker model again, and propose three method of improved traditional minimum classification error. In addition, also use minimum classification error in eigenvoices, because minimum classification error is smaller of mistake distinguishing than maximum likelihood. Then we purpose a method of to combine minimum classification error and eigenvoices, increase robust in a few data, and reduce influence of mistake distinguishing when construct acoustics space.

並列關鍵字

Speaker Identification ； Minimum Classifiaction Error

參考文獻

[23] 莊智顯， “結合聲學與韻律訊息之強健性語者辨認” ，國立臺北科技大學電腦通訊與控制研究所碩士論文，民國九十四年。

[3] G.R. Doddington: Speaker Recognition-Identifying People by Their Voices. Proceedings of IEEE, Vol. 73, No. 11, 1986, pp. 1651-1644.

[4] J. L. Gauvain and C. H. Lee, “Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains,”IEEE Trans. Speech and Audio Processing, vol. 2, no. 2, pp. 291-298,April 1994.

[6] B.H Juang, W. Hou, C.H Lee, “Minimum classification error rate methods for speech recognition:’ IEEE Trans. on Speech and Audio Processing. vol. 5, pp. 257-265, May 1997.

[7] O. Siohan, A. E. Rosenberg, and S. Parthasarathy, “Speaker identification using minimum classification error training,” ICASSP-98, vol.1, pp.109–112, May 1998.

被引用紀錄

朱映霖（2007）。利用支撐向量機改善最小錯誤鑑別式之語者辨識方法〔碩士論文，國立中央大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0031-0207200917344534

黃夢晨（2008）。最小錯誤鑑別式應用於語者辨識之競爭語者探討〔碩士論文，國立中央大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0031-0207200917352439

邢凱婷（2009）。基於隱藏式條件隨機域語者模型之語者識別演算法〔碩士論文，元智大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0009-2807200914245700

劉維宸（2011）。基於隱藏式條件隨機域模型調適之語者識別演算法〔碩士論文，元智大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0009-2801201414583635

國際替代計量

改善最小錯誤鑑別式之語者辨認方法

未授權

主題瀏覽