透過您的圖書館登入
IP:18.221.154.151
  • 期刊
  • OpenAccess

Modeling Taiwanese Southern-Min Tone Sandhi Using Rule-Based Methods

並列摘要


A sizable corpus of Taiwanese text in Latin script has been accumulated over the past two hundred or so years. However, due to the special status of Taiwan, few people can read these materials at present. It is regrettable that the utilization of these plentiful materials is very low. This paper addresses problems raised in the Taiwanese Southern-Min tone sandhi system by describing a set of computational rules to approximate this system, as well as the results obtained from its implementation. Using the romanized Taiwanese Southern-Min text as source, we take the sentence as the unit, translate every word into Chinese via an online Taiwanese-Chinese dictionary (OTCD), and obtain the part-of-speech (POS) information from the Chinese Electronic Dictionary (CED) made by the Chinese Knowledge and Information Processing (CKIP) group of Academia Sinica. By using the POS data and tone sandhi rules based on linguistics, we then tag each syllable with its post-sandhi tone marker. Finally, we implement a Taiwanese Southern-Min tone sandhi processing system which takes a romanized sentence as an input and then outputs the tone markers. Our system achieves 97.39% and 88.98% accuracy rates with training and test data, respectively. Finally, we analyze the factors influencing error for the purpose of future improvement.

參考文獻


Cheng, R.(2002).Tone Sandhi on the Grammar Template-Cognition and Testing.Proceeding of 2002 International Conference on Teaching and Researching of Taiwanese Romanization.(Proceeding of 2002 International Conference on Teaching and Researching of Taiwanese Romanization).
Iunn, U.-G.(2003).Taiwanese-Chinese On-line Dictionary-Discussion of Building Technique and its Utilization.Proceeding of 3rd International Conference on Internet Chinese Education.(Proceeding of 3rd International Conference on Internet Chinese Education).
Iunn, U.-G.,H.-K. Tiunn(1999).Review and Analysis of Taiwan Ho-lo Language non-Han Character Spelling Symbols.Proceedings of 1st Conference on the Regeneration and Rebuild of Taiwan Mother Tongue Culture.(Proceedings of 1st Conference on the Regeneration and Rebuild of Taiwan Mother Tongue Culture).
Iunn, U.-G.,H. H.Tan-Tenn.A Survey of Media and Data Processing Development for Written Taiwanese.(Accepted by International Journal of the Sociology of Language, Special Issues on Taiwanese).
Liang, M.-S.,J.-C. Yang,Y.-C. Chiang,R.-Y. Lyu(2004).A Taiwanese Text-to-Speech System with Applications to Language Learning.Proceedings of the 4th IEEE International Conference on Advanced Learning Technologies.(Proceedings of the 4th IEEE International Conference on Advanced Learning Technologies).

延伸閱讀