透過您的圖書館登入
IP:18.191.202.45
  • 期刊
  • OpenAccess

Locating Boundaries for Prosodic Constituents in Unrestricted Mandarin Texts

並列摘要


This paper proposes a three-tier prosodic hierarchy, including prosodic word, intermediate phrase and intonational phrase tiers, for Mandarin that emphasizes the use of the prosodic word instead of the lexical word as the basic prosodic unit. Both the surface difference and perceptual difference show that this is helpful for achieving high naturalness in text-to-speech conversion. Three approaches, the basic CART approach, the bottom-up hierarchical approach and the modified hierarchical approach, are presented for locating the boundaries of three prosodic constituents in unrestricted Mandarin texts. Two sets of features are used in the basic CART method: one contains syntactic phrasal information and the other does not. The one with syntactic phrasal information results in about a 1% increase in accuracy and an 11% decrease in error-cost. The performance of the modified hierarchical method produces the highest accuracy, 83%, and lowest error cost when no syntactic phrasal information is provided. It shows advantages in detecting the boundaries of intonational phrases at locations without breaking punctuation. 71.1% precision and 52.4% recall are achieved. Experiments on acceptability reveal that only 26% of the mis-assigned break indices are real infelicitous errors, and that the perceptual difference between the automatically assigned break indices and the manually annotated break indices are small.

並列關鍵字

無資料

參考文獻


Chu, M.,Chang, E.,Yang, H.,Peng, H.(2001).Proceeding of the 2001 International Conference on Acoustics, Speech and Signal Processing.
Chu, M.,Lu, S. N.(1996).A Text-to-speech system with High Intelligibility and High Naturalness for Chinese.Chinese Journal of Acoustics.15(1),81-90.
Chu, M.,Peng, H.,Qian, Y.(2001).Proceeding of the 2001 International Conference on Acoustics, Speech and Signal Processing.
Dutoit, T.,Pagel, V.,Pierret, N.,Bataille, F.,Verchen, O.(1996).Proceeding of the Fourth International Conference on Spoken Language Processing.
Grosjean, L.,Lane, H.,Grosjean, F.(1979).The patterns of silence: performance structures in sentence production.Cognitive Psychology.11,58-81.

被引用紀錄


Chiang, C. Y. (2009). 非監督式中文語音韻律標記及韻律模式 [doctoral dissertation, National Chiao Tung University]. Airiti Library. https://doi.org/10.6842/NCTU.2009.00093

延伸閱讀