  • 學位論文


Information Retrieval System Based on Similarity-Based Clustering Method

指導教授 : 楊敏生




Because of the rapid spread of Internet, it has become an important tool for human life. Since the range of information on Internet is wide, the amount of information for searching academic papers becomes very large. To create an information system for quickly searching relative knowledge is essential. Most clustering algorithms used for association rules between words are hierarchical clustering. Therefore, in this thesis, we use a robust possibilistic clustering method for achieving better association rules between words. In this study, there are 60 papers retrieved from a website as samples for experimental comparisons. We find that the inferred association rules using our method are indeed with better results.


[3] G. Salton and C. Buckley, Term-weighting approaches in automatic text retrieval, Information Processing&Management 24 (1988) 513-523.
[4] C.H. Jiang, Building the fuzzy retrieval system based on data mining algorithm, Master thesis, I-Shou University, 2009.
[5] M.S. Yang and K.L. Wu, A similarity-based robust clustering method, IEEE Trans. on Pattern Analysis and Machine Intelligence 26 (2004) 434-438.
[6] J.A. Hartigan, Clustering Algorithms, New York: Wiley, 1975.
[1] ComScore, ComScore Reports Global Search Market Growth of 46 Percent in 2009, http://www.comscore.com/Press_Events/Press_Releases/2010/1/Global_Search_Market_Grows_46_Percent_in_2009.
