透過您的圖書館登入
IP:3.16.81.94
  • 期刊
  • OpenAccess

Assessing Chinese Readability using Term Frequency and Lexical Chain

並列摘要


This paper investigates the appropriateness of using lexical cohesion analysis to assess Chinese readability. In addition to term frequency features, we derive features from the result of lexical chaining to capture the lexical cohesive information, where E-HowNet lexical database is used to compute semantic similarity between nouns with high word frequency. Classification models for assessing readability of Chinese text are learned from the features using support vector machines. We select articles from textbooks of elementary schools to train and test the classification models. The experiments compare the prediction results of different sets of features.

並列關鍵字

Readability Chinese Text Lexical Chain TF-IDF SVM

參考文獻


Lin, S.-Y.,Su, C.-C.,Lai, Y.-D.,Yang, L.-C.,Hsieh, S.-K.(2009).Assessing text readability using hierarchical lexical relations retrieved from WordNet.Computational Linguistics and Chinese Language Processing.14(1),45-84.
Chang, C.-C., & Lin, C.-J. (n.d.). LIBSVM: A library for support vector machines. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm
CKIP Group. (n.d.). A Chinese word segmentation system, http://ckipsvr.iis.sinica.edu.tw/
(Dong, Z. (n.d.). HowNet knowledge database. http://www.keenage.com/).

延伸閱讀