透過您的圖書館登入
IP:18.190.152.38
  • 期刊
  • OpenAccess

An Assessment of Character-based Chinese News Filtering Using Latent Semantic Indexing

並列摘要


We assess the Latent Semantic Indexing (LSI) approach to Chinese information filtering. In particular, the approach is for Chinese news filtering agents that use a character-based and hierarchical filtering scheme. The traditional vector space model is employed as an information filtering model, and each document is converted into a vector of weights of terms. Instead of using words as terms in the JR nominating tradition, terms refer to Chinese characters. LSI captures the semantic relationship between documents and Chinese characters. We use the Sin-gular-value Decomposition (SVD) technique to compress the term space into a lower dimension which achieves latent association between documents and terms. The results of experiments show that the recall and precision rates of Chinese news filtering using the character-based ap-proach incorporating the LSI technique are satisfactory.

並列關鍵字

無資料

參考文獻


Armstrong, R.,Freitag, D.,Joachims, T.,Mitchell, T.(1995).1995 AAAI Spring Symposium on Information Gathering from Heterogeneous, Distributed Environments.
Nicholas J., N., W. Bruce W. B., W. B..Information filtering and information retrieval: two sides of the same coin?.Comm. ACM.35,29-38.
Chien, L. F.(1996).Proceedings of the ROCLING IX.
Chien, L. F.(1995).Proceedings of 18th ACM SIGIR.
Cullum, J. K.(1985).Lanczos algorithmas for Large Symmetric Eigen value Computations.

被引用紀錄


Hung, C. T. (2001). LSI-based Document Retrieval [master's thesis, National Taiwan Normal University]. Airiti Library. https://www.airitilibrary.com/Article/Detail?DocID=U0021-2603200719120009
Chiu, H. W. (2014). Chinese Spell Checking Based on Noisy Channel Model [master's thesis, National Tsing Hua University]. Airiti Library. https://www.airitilibrary.com/Article/Detail?DocID=U0016-2912201413553062

延伸閱讀