透過您的圖書館登入
IP:3.141.35.52
  • 期刊
  • OpenAccess

Telephony Based Speaker-Independent Continuous Mandarin Syllable Recognition

並列摘要


This paper presents a study on speaker-independent continuous Mandarin syllable recognition under telephone environments. It compares and contrasts several cepstral bias removal techniques for compensation of telephone channel effects, including cepstral mean subtraction (CMS), signal bias removal (SBR) and stochastic matching (SM). Then some modifications and combinations of these techniques are investigated for further improvement of environmental robustness over the telephone. To better estimate contextual acoustics and co-articulation in spontaneous Mandarin telephone speech, the between-syllable context-dependent phone-like units (such as triphones, biphones and demiphones) are used to train the speech models. In addition, the discriminative capabilities of the speech models are further enhanced using the minimum classification error (MCE) algorithms. Experimental results showed that the achieved recognition rates for Mandarin base syllables are as high as 59.53%, leading to an improvement of 27.81% in the error rates.

參考文獻


Cole, R.(1995).The Challenge of Spoken Language Systems: Research Directions for the Nineties.IEEE Trans. Speech and Audio Proc..3(1),1-20.
Furui, S(1981).Cepstral analysis technique for automatic speaker verificaiton.IEEE ASSP Magazine: a Publication of the IEEE Acoustics, Speech, and Signal Processing Society.29,254-272.
Hon, H. W.,Hwang, M. Y.,Lee, K. F.(1989).Proc. Eurospeech.
Hon, H. W.,Lee, K. F.(1995).Proceeding DARPA Speech and Natural Language Processing Workshop.
Johnson, D.(1997).Telephony based speech technology- from laboratory visions to customer applications.Journal of Speech Technology.2(2),89-100.

被引用紀錄


鍾譯賢(2009)。助聽器的噪音消除演算法〔碩士論文,國立交通大學〕。華藝線上圖書館。https://doi.org/10.6842/NCTU.2009.01191

延伸閱讀