以顯性評價為主之相似性推薦

推薦系統(Recommendation Systems)現今在網際網路的商用行為中，經常被用來協助解決資訊過載(Information Overload)、推薦及預測服務，由過去學者的研究發現推薦系統中最常運用相似度量測(Similarity Measure)方法，來找出相似使用者或尋找具有相似行為之鄰居群(Neighborhood)，而這類相似度量測方法，多半利用使用者對產品的評價(rating)通常為顯性評分值(explicit rating)，來找出使用者間的相似度，藉由相似使用者的資訊對目標使用者做預測推薦，因此一個良好的相似度量測方法，往往是決定推薦成效的關鍵所在。過去學者常用的相似度量測方法最常見的有皮爾森相關係數(Pearson Correlation Coefficient, PCC)、餘弦相似(Cosine Similarity, COS)、限制性皮爾森相關(Constrained Pearson’s Correlation, CPC)、修正餘弦相似(Adjusted Cosine, ACOS)、PIP相似性及歐幾里德距離(Euclidean Distance, ED)等，這些方法在計算使用者相似度時都會以顯性評分來做為計算的依據，但這些方法都有一些問題，例如：(1)未考慮到正向評分值在推薦時的重要性、(2)當資料量大時則計算相似度時間可能會過長、(3)當評分值與平均值相同時會發生除以零的問題等。為了改善上述所提的問題，本研究提出一個新的以顯性評分為主之相似度量測方法(Rating-based Similarity Measure, RBSM)，本研究提出以快速的二元相似度量測方法為基礎，以正向評分值做為分析相似度之要項，藉由此方法快速找出具推薦價值之相似使用者及其推薦項目，本研究同時也考量使用者的評分特性，將實驗資料依使用者特性分群之後進行相似度計算，以期求得更準確的使用者相似度。利用本研究所提的顯性評分為主之相似度量測方法，經由初步實驗驗證能提升預測的效能，本研究的初步貢獻為經由實驗驗證本研究所提之方法有較高之準確度，同時可以縮短推薦計算之系統執行時間。

關鍵字

相似度量測；顯性評分；推薦系統

並列摘要

Recommendation systems (RS) are usually used for handling the information overloading, for recommendation, and for prediction, especially under current Internet environment. According to previous studies, the most commonly applied method for RS is the similarity measure. Similarity measure is usually used for finding similarity user or neighbors. To apply similarity measure, one of the commonly approach is to used the user rating, which is also call explicit rating for calculation. The rating difference or distance between the active user and the similar user is used for prediction. Therefore, a good similarity measure can affect the result of RS. It is noticed that the similarity measures such as Pearson Correlation (PCC), Cosine Similarity (COS), Constrained Pearson’s Correlation (CPC), Adjusted Cosine (ACOS), PIP, or Euclidean Distance (ED) are highly used for finding similar users or similar items. These similarity measures usually rely on a user-item matrix in which the explicit ratings are used for calculation. The outcome of recommendation is usually made based on the information of the similar user (item). Hence, the similarity measure for finding the similar user (item) is critical for RS. But, if we examine the traditional similarity measurements, there exists some problems when applying to the RS. (1)They did not take the positive rating and the co-rating count into consideration. (2) When the rating value is equal to the average rating value, the similarity measure of PCC and COS will encounter the problem of division by zero problem, which will cause system failure. (3) The scalability problem is usually not discussed. In order to handle these problems, this research proposes A Rating-based Similarity Measure (RBSM). Our method transforms the explicit-ratings into binary and considers both the positive-rating and co-rating count for finding the similar user. A simple similarity computation is proposed to find the neighborhood. For finding similar neighbor efficiently, users are divided, based on the co-rating amount, into three groups i.e., high, medium, and low. From the experiments, it is proved that our method has better outcome in recall, F1 value, and MAE value if compare to the traditional methods. Our results also show that the proposed method can handle the scalability problem.

並列關鍵字

Similarity Measures ； Explicit Rating ； Recommendation Systems

參考文獻

[1] G. Adomavicius and Y. Kwon(2007), “New Recommendation Techniques for Multicriteria Rating Systems,” IEEE Intelligent Systems, vol. 22 , no. 3, pp. 48-55.

[2] H. J. Ahn(2008), “A New Similarity Measure for Collaborative Filtering to Alleviate the New User Cold-starting Problem,” International Journal of Information Sciences, vol. 178, no. 1, pp. 37-51.

[4] M. Balabanovic and Y. Shoham(1997), “Fab: Content-based, Collaborative Recommendation,” Communications of the ACM, vol. 40, no.3, pp.66-72.

[7] M. D’agostino, and V. Dardanoni(2009), “What’s so special about Euclidean distance?,” Social Choice and Welfare, vol. 33, no. 2, pp. 211-233.

[8] A. L. Deng, Y. Y. Zhu, and BL. A. Shi(2003), “Collaborative Filtering Recommendation Algorithm Based on Item Rating Prediction,” Journal of Software, vol. 14, no. 9, pp. 1621-1628.

被引用紀錄

曾國書（2012）。基於Android平台之個人化應用程式管理系統〔碩士論文，淡江大學〕。華藝線上圖書館。https://doi.org/10.6846/TKU.2012.00201

吳宗翰（2012）。具動態時段之適地性協同過濾推薦系統〔碩士論文，國立臺北科技大學〕。華藝線上圖書館。https://doi.org/10.6841/NTUT.2012.00093

李建宏（2012）。可改善推薦系統評價預測之使用者分群方法研究〔碩士論文，國立中正大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0033-2110201613520469

楊銘鴻（2013）。自動推薦於資源整合最佳化之研究-以健康休閒產業育成資源為例〔碩士論文，國立臺北大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0023-2607201314413300

楊昇翰（2016）。以社群網評論分析作為信任基礎之推薦系統〔碩士論文，朝陽科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0078-1108201714025314

國際替代計量

以顯性評價為主之相似性推薦

主題瀏覽