透過您的圖書館登入
IP:3.144.26.138
  • 期刊
  • OpenAccess

Statistical Analysis of Mandarin Acoustic Units and Automatic Extraction of Phonetically Rich Sentences Based upon a Very Large Chinese Text Corpus

並列摘要


Automatic speech recognition by computers can provide humans with the most convenient method to communicate with computers. Because the Chinese language is not alphabetic and input of Chinese characters into computers is very difficult, Mandarin speech recognition is very highly desired. Recently, high performance speech recognition systems have begun to emerge from research institutes. However, it is believed that an adequate speech database for training acoustic models and evaluating performance is certainly critical for successful deployment of such systems in realistic operating environments. Thus, designing a set of phonetically rich sentences to be used in efficiently training and evaluating a speech recognition system has become very important. This paper first presents statistical analysis of various Mandarin acoustic units based upon a very large Chinese text corpus collected from daily newspapers and then presents an algorithm to automatically extract phonetically rich sentences from the text corpus to be used in training and evaluating a Mandarin speech recognition system.

參考文獻


Bai, B. R.,Tseng, C. Y.,Lee, L. S.(1997).A Multi-phase Approach for Fast Spotting of Large Vocabulary Chinese Keywords from Mandarin Speech Using Prosodic Information.ICASSP97.2,903-906.
Chang, H. Y.,Chen, B.,Chou, C. S.,Liu, C. M.(1996).Signal Processing and Its Applications.
Hon, H. W.,Lee, K. F.,Reddy, R.(1990).An Overview of the SPHINX Speech Rcognition System.IEEE Transactions on Acoustics, Speech, and Signal Processing.38(1),35-45.
Kuremastsu, Akira,Takeda, Kazuya,Sagisaka, Yoshinori,Katagiri, Shigeru,Kuwabara, Hisao,Shikano, Kiyohiro(1990).ATR Japanese Speech Database As A Tool of Speech Recognition and Synthesis.Speech Communication.9,365-374.
Lee, L. S.(1993).Golden Mandarin(I)-A Real-time Mandarin Speech Dictation Machine for Chinese Language with very Large Vocabulary.IEEE Trans. Speech and Audio Proc..1(2),158-179.

延伸閱讀