透過您的圖書館登入
IP:18.188.39.178
  • 學位論文

以深層卷積神經網路對中文語調進行分類

Mandarin Tone Classification Using CNN/DNN

指導教授 : 張智星

摘要


在華語系統中,聲調扮演十分重要之角色,同樣的一個音節,只要聲調的不同,即會產生完全不同的意義。母語是否為中文,常常可藉由講出來字詞之聲調辨認。為此,本論文提出一個對語音聲調進行分類的方法:先將聲音訊號轉為頻譜,將頻譜視為圖片,輸入至現有之影像識別卷積神經網路架構中,訓練出聲調分類模型,比較現成之影像辨識模型對處理聲調分類的效果如何。最後以此建立出不需對音訊進行過多處理步驟,即可達到一定程度之聲調分類架構。此聲調分類架構可套用至華語教學系統之中,為語言教學之方式提供新的選擇。

並列摘要


In Mandarin Chinese system, the tone plays an important role. Different tone patterns of the same syllable may result in different meanings. People whose native language aren’t Mandarin can be distinguished by their tone patterns. Therefore, we propose a method for tone classification. First, we convert the audio signal into the spectrogram. We treat the spectrogram as images, apply them as the image inputs for image recognition convolutional neural networks, and create tone classification models. We compare different image recognition models for tone classification. This approach can achieve good accuracy without too many processes on the audio signal. The tone classification architecture can be applied to Chinese teaching methods which will lead to educational success.

參考文獻


[1] Y. Chao, “A system of tone letters,” Le maitre phonetique, vol. 45, 1980.
[2] XU Li, ZHANG Wenle, ZHOU Ning, LEE Chaoyang, LI Yongxin, CHEN Xiuwu, ZHAO Xiaoyan, “Mandarin Chinese Tone Recognition with an Artificial Neural Network,” Journal of Otology, Volume 1, Issue 1, Pages 30-34, June 2006.
[3] Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, and L. D. Jackel, “Backpropagation applied to handwritten zip code recognition,” Neural Computation, vol. 1, no. 4, pp. 541–551, Dec. 1989.
[4] Charles Chen, Razvan C. Bunescu, Li Xu, Chang Liu, “Tone Classification in Mandarin Chinese Using Convolutional Neural Networks,” Interspeech, 2016.
[5] Y. Lecun, L. Bottou, Y. Bengio, P. Haffner, "Gradient-based learning applied to document recognition", Proc. IEEE, vol. 86, no. 11, pp. 2278-2324, Nov. 1998.

延伸閱讀