透過您的圖書館登入
IP:3.133.109.211
  • 期刊
  • OpenAccess

Computational Tools and Resources for Linguistic Studies

並列摘要


This paper presents several useful computational tools and available resources to facilitate linguistic studies. For each computational tool, we demonstrate why it is useful and how can it be used for research. In addition, linguistic examples are given for illustration. First, a very useful searching engine, Key Word in Context (KWIC), is introduced. This tool can automatically extract linguistically significant patterns from large corpora and help linguists discover syntagmatic generalizations. Second, Dynamic Clustering and Hierarchical Clustering are introduced for identifying natural clusters of words or phrases in distribution. Third, statistical measures which could be used to measure the degree of cohesion and correlation among linguistic units are presented. These tools can help linguists identify the boundaries of lexical units. Fourth, alignment tools for aligning parallel texts at the word, sentence and structure levels are presented for linguists who do comparative studies of different languages. Fifth, we introduce Sequential Forward Selection (SFS) and Classification and Regression Tree (CART) for automatic rule ordering. Finally, some available electronic Chinese resources are described to provide reference purposes for those who are interested.

參考文獻


黃居仁,陳克健 Keh-Jiann, Keh-Jiann(1996).Proceedings of the 16th International Conference on Computational Linguistics.
Su, Keh-Yih,Chiang, Tung-Hui,Chang, Jing-Shin(1996).An Overview of Corpus-Based Statistics-Oriented (CBSO) Techniques for Natural Language.Journal of Computational Linguistics and Chinese Language Processing (CLCLP).1(1),101-157.
Aho, A. V.(1996).Compilers: Principles, Techniques, and Tools.
Breiman, L.(1984).Classification And Regression Trees.
Brown, P. F.(1991).Proceedings of 29th Annual Meeting of the Association for Computational Linguistics (ACL-29).

延伸閱讀