透過您的圖書館登入
IP:3.146.221.204
  • 學位論文

針對口頭演講自動推薦演講停頓點

Automatic Determination of Speech Pause in Oral Presentation

指導教授 : 張智星 張俊盛

摘要


我們時常將標點符號視為語句上可呼吸停頓的位址,然而,並不是所有停頓都發生在標點符號的位址,也不是所有標點符號都會停頓。本篇論文中,我們介紹一個可以針對英文語言學習者輸入的演講文稿自動推薦適當的停頓點的系統。在使用的方法中,我們必須將演講文稿裡面的標點符號去除,並且產生適當的特徵。其中包括自動產生標記停頓點的訓練資料、自動針對訓練資料產生文字上的特徵值,並且自動訓練分類器協助判斷停頓點。最終的評估顯示我們提出的方法在針對標記停頓點上有相當不錯表現。

關鍵字

停頓點推薦

並列摘要


Punctuation marks in text usually tend to be taken as breath pauses. However, not all pauses occur at punctuation marks, and, in fact, not all punctuations are designed to be pauses. In this paper, we introduce a method for suggesting speech pauses for a given script submitted by English language learners. In our approach, a text is transformed into a non-punctuated text with features aimed at suggesting appropriate pauses in speech. The method involves automatically generating training data annotated with pauses, automatically transform the training data into linguistic features, and automatically training a discriminative classifier. Evaluation shows that the proposed method achieves a satisfactory performance in suggesting pauses in given speech.

並列關鍵字

Pause suggestion

參考文獻


[2] Bosker, H. R., Pinget, A.-F., Quene, H., Sanders, T., & de Jong, N. H. (2013, April). What makes speech sound fluent? the contributions of pauses, speed and repairs. Language Testing, 30(2), 159-175.
[3] Chiang, C.-Y., Wang, Y.-R., & Chen, S.-H. (2012, March). Punctuation generation inspired linguistic features for mandarin prosodic boundary prediction. In Acoustics, speech and signal processing (icassp), 2012 ieee international conference on (p. 4597-4600). doi: 10.1109/ICASSP.2012.6288942
[4] Derwing, T. M., Rossiter, M. J., Munro, M. J., & Thomson, R. I. (2004, December). Second language fluency: Judgments on different tasks. Language Learning, 54, 655-679.
[5] Hirschberg, J., & Prieto, P. (1996). Training intonational phrasing rules automatically for english and spanish text-to-speech. Speech Communication, 18.3, 281-290.
[8] Koehn, P., Abney, S., Hirschberg, J., & Collins, M. (2000, ). Improving intonational phrasing with syntactic information. In Acoustics, speech, and signal processing, 2000. icassp ’00. proceedings. 2000 ieee international conference on (Vol. 3, p. 1289-1290 vol.3). doi: 10.1109/ICASSP.2000.861813

延伸閱讀