透過您的圖書館登入
IP:18.188.142.146
  • 期刊
  • OpenAccess

Automatic Pronunciation Assessment for Mandarin Chinese: Approaches and System Overview

並列摘要


This paper presents the algorithms used in a prototypical software system for automatic pronunciation assessment of Mandarin Chinese. The system uses forced alignment of HMM (Hidden Markov Models) to identify each syllable and the corresponding log probability for phoneme assessment, through a ranking-based confidence measure. The pitch vector of each syllable is then sent to a GMM (Gaussian Mixture Model) for tone recognition and assessment. We also compute the similarity of scores for intensity and rhythm between the target and test utterances. All four scores for phoneme, tone, intensity, and rhythm are parametric functions with certain free parameters. The overall scoring function was then formulated as a linear combination of these four scoring functions of phoneme, tone, intensity, and rhythm. Since there are both linear and nonlinear parameters involved in the overall scoring function, we employ the downhill Simplex search to fine-tune these parameters in order to approximate the scoring results obtained from a human expert. The experimental results demonstrate that the system can give consistent scores that are close to those of a human's subjective evaluation.

參考文獻


Chen J.C.,J.S. R. Jang(2007).Extended Supratone Modeling for HMM-based Continuous Tone Recognition.(ACM Transaction on Speech and Language Processing).
Chen J.C.,J.S. R. Jang,J.Y. Li,M.C. Wu(2004).Automatic Pronunciation Assessment for Mandarin Chinese.IEEE International Conference on Multimedia & Expo.(IEEE International Conference on Multimedia & Expo).
Chen S.H.,Y.R. Wang(1995).Tone Recognition of Continuous Mandarin Speech Based on Neural Networks.IEEE Transactions on Speech and Audio Processing.3(2),146-150.
Proceedings of Conference on Computational Linguistics and Speech Processing (ROCLING)
Huang S.C.(2006).Improvement and Error Analysis of Tone Recognition for Mandarin Chinese.National Tsing Hua University.

被引用紀錄


李宛穎(2011)。使用音高資訊以改進華語發音評量〔碩士論文,國立清華大學〕。華藝線上圖書館。https://doi.org/10.6843/NTHU.2011.00051
董姵汝(2010)。使用音高資訊來改進日文發音評量〔碩士論文,國立清華大學〕。華藝線上圖書館。https://doi.org/10.6843/NTHU.2010.00484
羅珝瑩(2009)。根基於 HMM 之華語語音合成初步研究〔碩士論文,國立清華大學〕。華藝線上圖書館。https://doi.org/10.6843/NTHU.2009.00624
吳德祥(2009)。台華語音節 雙拼合成〔碩士論文,國立清華大學〕。華藝線上圖書館。https://doi.org/10.6843/NTHU.2009.00567
凃昱銘(2012)。基於快速音高序列比對之哼唱式歌曲檢索〔碩士論文,國立臺北科技大學〕。華藝線上圖書館。https://doi.org/10.6841/NTUT.2012.00103

延伸閱讀