透過您的圖書館登入
IP:3.145.97.248
  • 期刊

基於權重式有限狀態機之中文大詞彙連續語音辨識介紹與中文語音辨識之挑戰

Introduction to Weighted Finite-state Transducer Based Mandarin Large Vocabulary Continuous Speech Recognition and Unique Challenges from Mandarin Chinese

摘要


大詞彙連續語音辨識為語音辨識技術的極致,以其為基礎搭配語言理解與對話管理,能夠發展出許多功能性和趣味性的智慧型應用和服務,提升人機互動的便利性。本文將介紹權重式有限狀態機中文大詞彙連續語音辨識系統幾項必要的功能模組,內容涵蓋聲學特徵參數萃取、聲學模型訓練演算法、語言模型訓練演算法、以及解譯器架構。除此之外,尚會提及GALE計劃裡中文大詞彙連續語音辨識系統的效能,以及中文辨識所要面對的特殊問題和挑戰。

並列摘要


Large vocabulary continuous speech recognition (LVCSR) is the ultimate goal of speech recognition. When incorporated with other techniques such as language understanding and spoken dialogue management, a variety of speech-based intelligent applications and services are realized to enrich interaction between human and machine. This article gives readers a brief introduction to modules in weighted finite-state transducer (WFST) based LVCSR system, including acoustic feature extraction, acoustic model training algorithms, language model training algorithms, and decoder architecture. In addition, recent performance of Mandarin LVCSR systems in GALE project and some issues specific to Mandarin are addressed.

延伸閱讀