透過您的圖書館登入
IP:18.191.102.112
  • 學位論文

近體詩自動分類研究

The Study of Chinese Jintishi Categorization

指導教授 : 梁婷

摘要


近體詩是華人社會中一項重要的文化資產,然而很多詩作中皆含有隱喻,使得近體詩對於學生而言不容易了解其中含義。在本論文中,我們提出幾個有效的方法來做近體詩的自動分類,藉以幫助學習者對於詩作的理解。我們利用法則式的方法搭配同義詞詞林來做語意標記,以及SVM的分類模型來做詩作分類。並從詩作的語料中探勘七種特徵來做為分類特徵,再利用Forward Sequential Selection Algorithm來做為選取特徵的演算法,而我們所提出的方法經過217首的五言絕句來做六個類別近體詩的詩作分類實驗,可達到72.35%的正確率。

並列摘要


Chinese Jintishi is one important heritage in Chinese societies. Nevertheless, many poets use metaphors while composing their poems. So it becomes hard to understand Jintishi for high school students. In this thesis, an effective approach to automate Jintishi is presented with the aim to facilitate poem comprehension. We propose a method to tackle with semantic role labeling based on Tongyici Cilin and a SVM-based model to handle poem categorization. The categorization employs seven kinds of features mined from training corpus. Best set of features is selected by using forward sequential selection algorithm. The approach is justified in terms of 72.35% accuracy by categorizing 217 five-character quatrains into six types of Jintishi.

參考文獻


[29] 陳紹宜,“建構一個中文對聯創作的知識評價架構”,國立交通大學,碩士論文,2010年6月。
[31] 羅鳳珠,“植基於中國詩詞語言特性所建構之語意概念分類體系研究”,第九屆海峽兩岸圖書資訊學學術研討會,武漢大學,2008年7月3-6日。
[3] Catherine Plaisant, James Rose (2006), “Exploring erotics in Emily Dickinson's correspondence with text mining and visual interfaces.”Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries, Chapel Hill, NC, USA, pp. 141-150.
[4] Chih-Chung Chang and Chih-Jen Lin, LIBSVM : a library for support vector machines (2001). Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm
[6] Gerard Escudero and Llus Mrquez and German Rigau (2004),“An Empirical Study of the Domain Dependence of Supervised Word Sense Disambiguation Systems.”Joint SIGDAT Conference on Empirical Methods in NLP and Very Large Corpor, Hong Kong.

被引用紀錄


林孟儒(2017)。確認影響手機遊戲App內消費之關鍵因素〔碩士論文,朝陽科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0078-2712201714440692

延伸閱讀


國際替代計量