透過您的圖書館登入
IP:3.145.36.10
  • 期刊
  • OpenAccess

Chinese Main Verb Identification: From Specification to Realization

並列摘要


Main verb identification is the task of automatically identifying the predicate-verb in a sentence. It is useful for many applications in Chinese Natural Language Processing. Although most studies have focused on the model used to identify the main verb, the definition of the main verb should not be overlooked. In our specification design, we have found many complicated issues that still need to be resolved since they haven't been well discussed in previous works. Thus, the first novel aspect of our work is that we carefully design a specification for annotating the main verb and investigate various complicated cases. We hope this discussion will help to uncover the difficulties involved in this problem. Secondly, we present an approach to realizing main verb identification based on the use of chunk information, which leads to better results than the approach based on part-of-speech. Finally, based on careful observation of the studied corpus, we propose new local and contextual features for main verb identification. According to our specification, we annotate a corpus and then use a Support Vector Machine (SVM) to integrate all the features we propose. Our model, which was trained on our annotated corpus, achieved a promising F score of 92.8%. Furthermore, we show that main verb identification can improve the performance of the Chinese Sentence Breaker, one of the applications of main verb identification, by 2.4%.

參考文獻


Banko, M.,E. Brill(2001).Scaling to very very large corpora for natural language disambiguation.(Proceedings of the 39th Annual Meeting and 10th, Conference of the European Chapter of the Association for Computational Linguistics).
Chen, X. H.,D. Y. Shi(1997).To Mark Topic and Subject in Chinese Sentences.(Proceedings of 4th National Computational Linguistics).
Ding, S. S.,S. X. Lv,R. Chen,D. X. Sun,X. C. Guan,J. Fu,S. Z. Huang,Z. W. Chen(1961)."xiandai hanyu yufa jianghua," (Modem Chinese Grammar Talk).
Fan, X.(1995)."sange pingmian de yufa guan," (The Grammar View of Three Levels).
Gong, X. J.,Z. S. Luo,W. H. Luo(2003).Recognizing the Predicate Head of Chinese Sentences.Journal of Chinese Information Processing.17(2),7-13.

被引用紀錄


林晏僖(2010)。中文名詞組的辨識:規則式判別、監督式、半監督式與非監督式學習法的實驗〔碩士論文,國立臺灣大學〕。華藝線上圖書館。https://doi.org/10.6342/NTU.2010.00550
侯慧如(2004)。漢英視譯主要動詞之選取及非主要動詞之轉換〔碩士論文,國立臺灣師範大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0021-2004200709413854

延伸閱讀