透過您的圖書館登入
IP:18.119.131.72
  • 學位論文

推導文法規則下名詞參數之語義分類

Inducing Semantic Categories of Arguments of Grammar Patterns

指導教授 : 張俊盛

摘要


本論文提出一個透過Collins COBUILD的文法規則推導名詞參數的語義分類的方法,以協助英語學習者在寫作過程獲得文法規則相關的提示。 我們將文法規則轉換成語料搜尋引擎的查詢式並檢索N-gram,再透過WordNet取得名詞參數之岐義資訊,計算文法規則之名詞參數 語料搜索引擎的N-gram與WordNet的詞義為每個文法規則的名詞參數產生語義分類。 此方法涉及把文法規則轉換成語料搜尋引撆的查詢式、檢索N-gram與詞義、篩選候選字,以及透過演算法計算分數。 我們提出了一個寫作輔助系統Composer,把此方法應用在Collins COBUILD文法規則及Google Web 1T語料上。 實驗結果顯示,本系統能推導出有效且對學習者有用的語義及文法資訊。

關鍵字

語義分類 文法規則

並列摘要


This paper describes a method for deriving semantic categories for noun argument in a given grammar pattern of a head word. In our approach, we use ngrams retrieved from Web-scale ngram and WordNet supersenses to generates all possible candidates. The method involves converting the grammar pattern into an effective regular expression query, retrieving ngrams from the given grams, generates and sense disambiguating norminal argument. We present a prototype system, Composer that applies the proposed method to a set of manually compiled grammar patterns and Google Web 1T. The preliminary evaluation shows the system derives reasonably well semantics categories, which are useful for learning vocabulary and grammar.

並列關鍵字

Grammar Pattern Semantic Category

參考文獻


1. Naoki Abe and Hang Li. Learning word association norms using tree cut pair models. arXiv preprint cmp-lg/9605029, 1996.
2. Omri Abend and Ari Rappoport. Fully unsupervised core-adjunct argument classification. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 226-236, 2010.
3. Carsten Brockmann and Mirella Lapata. Evaluating and combining approaches to selectional preference acquisition. In 10th Conference of the European Chapter of the Association for Computational Linguistics, 2003.
4. Stephen Clark and David Weir. Class-based probability estimation using a semantic hierarchy. Computational Linguistics, 28(2):187-206, 2002.
5. Katrin Erk. A simple, similarity-based model for selectional preferences. In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pages 216-223, 2007.

延伸閱讀