透過您的圖書館登入
IP:3.142.119.241
  • 期刊
  • OpenAccess

A Survey on Automatic Speech Recognition with an Illustrative Example on Continuous Speech Recognition of Mandarin

並列摘要


For the past two decades, research in speech recognition has been intensively carried out worldwide, spurred on by advances in signal processing, algorithms, architectures, and hardware. Speech recognition systems have been developed for a wide variety of applications, ranging from small vocabulary keyword recognition over dial-up telephone lines, to medium size vocabulary voice interactive command and control systems on personal computers, to large vocabulary speech dictation, spontaneous speech understanding, and limited-domain speech translation. In this paper we review some of the key advances in several areas of automatic speech recognition. We also illustrate, by examples, how these key advances can be used for continuous speech recognition of Mandarin. Finally we elaborate the requirements in designing successful real-world applications and address technical challenges that need to be harnessed in order to reach the ultimate goal of providing an easy-to-use, natural, and flexible voice interface between people and machines.

參考文獻


Bourlard, H.(1994).Connectionist Speech Recognition-A Hybrid Approach.
Bourlard, H.,Wellekens, C. J.(1992).Links between Markov Models and Multi-Layer Perceptron.IEEE Transactions On Pattern Analysis and Machine Intelligence.12,1167-1178.
Brown, P. F.,De Souza, P. V.,Bahl, L. R.,Mercer, R. L.(1986).Proc. IEEE ICASSP - 86.
Brown, P. F.,De Souza, P. V.,Bahl, L. R.,Mercer, R. L.(1989).Tree-Based Language Model for Natural Language Speech Recognition.IEEE transactions on acoustics, speech, and signal processing .37,1001-1008.
Digalakis, V.,Weintraub, M.,Weintraub, M.,Murveit, H.,Butzberger, J.(1993).Proc. IEEE ICASSP.

被引用紀錄


Lin, C. Y. (2011). 基於隨機森林法之爆發起始偵測及其在嗓音起始時間預估之應用 [doctoral dissertation, National Tsing Hua University]. Airiti Library. https://doi.org/10.6843/NTHU.2011.00057
杜明桓(2009)。利用加速演算法之大詞彙連續語音辨識系統〔碩士論文,國立交通大學〕。華藝線上圖書館。https://doi.org/10.6842/NCTU.2009.00621
Kuo, Y. C. (2003). 數位訊號處理在臨床上的應用-數位腦波計數器 [master's thesis, Chung Yuan Christian University]. Airiti Library. https://doi.org/10.6840/cycu200300740
邱政湧(2003)。標記傳遞模式應用於中文連續語音關鍵詞辨認系統〔碩士論文,中原大學〕。華藝線上圖書館。https://doi.org/10.6840/cycu200300299
Chung, C. T. (2017). 無監督式結構化語音模型和語音特徵及其在語音檢索的運用 [doctoral dissertation, National Taiwan University]. Airiti Library. https://doi.org/10.6342/NTU201702854

延伸閱讀