透過您的圖書館登入
IP:18.117.91.153

並列摘要


The Chinese language has many special characteristics which are substantially different from western languages, causing conventional methods of language processing to fail on Chinese. For example, Chinese sentences are composed of strings of characters without word boundaries that are marked by spaces. Therefore, word segmentation and unknown word identification techniques must be used in order to identify words in Chinese. In addition, Chinese has very few inflectional or grammatical markers, making purely syntactic approaches to parsing almost impossible. Hence, a unified approach which involves both syntactic and semantic information must be used. Therefore, a lexical feature-based grammar formalism, called Information-based Case Grammar, is adopted for the parsing model proposed here. This grammar formalism stipulates that a lexical entry for a word contains both semantic and syntactic feature structures. By relaxing the constraints on lexical feature structures, even ill-formed input can be accepted, broadening the coverage of the grammar. A model of a priority controlled chart parser is proposed which, in conjunction with a mechanism of dynamic grammar extension, addresses the problems of: (1) syntactic ambiguities, (2) under-specification and limited coverage of grammars, and (3) ill-formed sentences. The model does this without causing inefficient parsing of sentences that do not require relaxation of constraints or dynamic extension of the grammar.

參考文獻


Chang, L. L.(1989).Part of Speech (POS) Analysis on Chinese Language.
Chen, K. J.(1989).Proceedings of ROCLING II.
Moulines, E.,Charpentier, F.(1990).Pitch Synchronous Waveform Processing Techniques for Text-to-Speech Synthesis Using Diphones.Speech Communication.9,453-467.
Nakajima, S.,Hamada, H.(1988).Proc. ICASSP.
Ohta, K.,Mikuni, I.(1986).Proc. ICASSP.

被引用紀錄


Su, W. P. (2004). 台灣閩南語:講、提出、建議之語意及功能研究 [master's thesis, National Taiwan Normal University]. Airiti Library. https://www.airitilibrary.com/Article/Detail?DocID=U0021-2004200709290503

延伸閱讀