透過您的圖書館登入
IP:18.118.12.222
  • 學位論文

發聲特徵對跨語言音素之代表性-以中英語為例

Articulatory Features’ Representativeness of Phones Across Languages – Case of Mandarin & English

指導教授 : 李琳山

摘要


各類語音處理技術大多是針對單一語言而設計,而在處理跨語言資料時,則必須結合所包含每種語言的所有資訊,在今日全球化世界大量多語言語料的情況下,資料量會變得太過龐大而複雜,降低表現和處理速度。 然而,若回歸語音本質,使用發聲特徵來解析各語言音素,便可以以一套特徵系統表現出所有語言的所有音素,如此在處理各種單語言或多語言語料時便能統一處理。 本研究針對多套中英語言語料(TIMIT、COSPRO、DSP),試圖以一套發聲特徵定義表現出語料中各語言之音素。再使用類神經網路訓練出音訊特徵和發聲特徵之對應關係,檢測其正確率以供未來各種後續應用。

並列摘要


Most digital speech processing techniques are often designed for certain languages. Therefore, for works that contain cross-language data, information of all languages is required. Yet, when faced upon corpora of multiple languages in the globalized world today, the amount of data would be too mass and complex. Performance and processing speed would then inevitably become an issue. However, we can look at the speech of all languages from an articulatory point of view. By using articulatory features, it is possible to represent all phones of all languages with limited numbers of features. With these features, speech can be easily handled regardless of language. In this research, a set of articulatory features is proposed to represent phones of Mandarin and English, based on three corpora: TIMIT, COSPRO and DSP. Artificial neural networks are used to train the relationship between acoustics and articulatory features. With accuracy tested, the models may be ready for future applications.

並列關鍵字

articulatory features IPA phones ANN TIMIT COSPRO

參考文獻


[1] Simon King and Paul Taylor, Detection of Phonological Features in Continuous Speech using Neural Networks. Computer Speech and Language 2000
[3] Sabato Marco Siniscalchi1, Torbjorn Svendsen, and Chin-Hui Lee, Toward a Detector-Based Universal Phone Recognizer. ICASSP 2008
[4] I-Fan Chen and Hsin-Min Wang, Articulatory Feature Asynchrony Analysis and Compensation in Detection-Based ASR. INTERSPEECH 2009
[5] Chung-Hsien Wu, Han-Ping Shen and Yan-Ting Yang, Phone Set Construction Based on Context-Sensitive Articulatory Attributes for Code-Switching Speech Recognition. ICASSP 2012
[7] Henning Reetz and Allard Jongman, Phonetics: Transcription, Production, Acoustics, and Perception, 2011

延伸閱讀