透過您的圖書館登入
IP:3.138.105.31
  • 學位論文

英文初學者發音自動評分之研究

The Research of Automatic Pronunciation Evaluation for Beginners

指導教授 : 李忠謀
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


電腦輔助發音訓練(Computer Assisted Pronunciation Training,CAPT)是常用的一種語言學習方式,可以針對初學者的英文發音提供回饋讓初學者可以反覆的練習。本研究利用語音辨識以及字串相似度比對的技術,建置一個適合初學者英文發音的辨識模型用以輔助初學者發音練習。 本研究包含兩部分,第一部分為建置語音辨識模型,使用本研究自行錄製的JTES語料庫建置初始模型,再挑選JTJS中較優初學者的語音進行模型調適,作為整體的語音辨識模型;第二部分為評估是採用字串比對方式藉由本研究所提出的Levenshtein Distance-Like作為相似度計算且藉由cubic polynomial fit找到四個等級(好、尚可、待加強、重錄)的門檻值。 實驗結果呈現,當分成四個等級時人工評分與系統評分的正確率為75%,代表系統有一定的準確率,透過皮爾森係數得知人工評分與系統評分的相關性為0.71,呈現人工評分與系統評分是具有相關的,因此系統給予的回饋對於初學者是有一定的可信度,可以藉由此來提升口說技能。

並列摘要


“Computer Assisted Pronunciation Training “program is primary designed to assist students in language learning. The program provides the feedback based on each individual need and it helps beginners to repeat practice proper pronunciation. The research utilizes the speech recognition and string matching to build speech recognize model for beginners to practice pronunciation. Research consisted two main parts. First part is to build speech recognize model, which is to record JTES corpus. The next step is to select the top speeches in JTJS corpus to do model adaption. The second part is to evaluate speeches by using string-matching method. We proposal Levenshtein Distance-Like approach and using cubic polynomial fit to find threshold. Those approaches help us to separate into four levels of the evaluating standards (excellent, average, inferior, and re-recording). The result from the experiment shows the accuracy of evaluating process is around 75% when the program is separated into four levels. This is supported by both human and systematic evaluation. Based on the analysis of Pearson correlation, correlation between human and system evaluation is 0.71, which mean two variables are correlated. Therefore, the system is credible for beginners to learn and enhance their verbal skill.

參考文獻


[1] Murray, D. E., "A Case for Online English Language Teacher Education," The International Research Foundation for English Language Education 2013
[2] Coniam D., "Voice Recognition Software Accuracy with Second Language Speakers of English," System 27 1999, p49-64
[3] Nguyen, H., et. al., “Automatic Speech Recognition for Vietnamese Using HTK System”, International Conference on Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), Hanoi, November 2010
[4] ISLE, "Interactive Spoken Language Education", University of Hamburg. [Online:http://nats-www.informatik.uni-hamburg.de/~isle/]
[5] Franco H., Abrash V., Precoda K., Bratt H., Rao R., Butzberger J., "The SRI EduSpeak System: Recognition and Pronunciation Scoring for Language Learning", Proceedings of INSTIL 2000, p123-128.

延伸閱讀