結合關聯法則與模糊叢聚之網際探勘架構

現今網際探勘領域中，統計叢聚技術常被用來分析網站瀏覽者對網頁之瀏覽偏好。然而此法只能將每一使用者瀏覽路徑歸類於單一群組中，即事先假設每一瀏覽路徑只包含單一種使用者偏好，卻忽略同一使用者瀏覽路徑可能包含數個網頁偏好。對此，另有學者提出模糊叢聚技術以彌補上述之不足。但此類型研究於分析瀏覽路徑相似程度方面，只能根據網頁距離計算。因此當網站瀏覽者以不同瀏覽路徑觀看相同網頁時，容易產生錯誤的分析結果。針對上述情況，本論文提出一結合模糊叢聚技術及關聯法則之網際探勘架構。此法首先過濾瀏覽路徑中可能造成分析誤差之超連結網頁，再利用關聯法則計算網頁間之關聯性。最後則擴充模糊叢聚技術於瀏覽路徑相似度之計算方式，以網頁關聯法則信心度取代網頁距離，藉由適當的分群以求得使用者真正之瀏覽偏好。

關鍵字

網際探勘；使用者瀏覽路徑；模糊叢聚分析；相似程度；關聯法則

並列摘要

Lately, most studies have relied on statistic clustering techniques to analyze web user profile data in web mining. However, this approach can only sort each user session into a single cluster. That is, it ignores a user session may contain several browsing prefers. According to this insufficiency, fuzzy clustering techniques were proposed instead. But those methods only can use similarity score of session to calculate the similarity between pages. Therefore, if users browse the same web page by different paths, that causes wrong results. This research proposes a framework which combines the fuzzy clustering and association rules. This approach filters out the noisy data, and employs association rules to calculate the confidence of the rule as the association between different URL addresses. Finally, an improved fuzzy clustering is adopted, which replaces the similarity score of session with the confidence between pages, to found out the user prefers effectively.

並列關鍵字

Web Mining ； User Session ； Fuzzy Cluster ； Similarity ； Association Rule

參考文獻

[3] 0M. Carl Drott, "Using Web Server Logs to Improve Site Design", SIGDOC 1998, Pages 43-50.

[4] 0 Myra Spiliopoulou , "Web Usage Mining for Web Site Evaluation", Commun. ACM 43, 8(Aug. 2000), Pages 127-134.

[5] 0Cooley, R., Mobasher, B., Srivastava, J., "Web Mining: Information and Pattern Discovery on the World Wide Web", Proceedings of the Ninth IEEE International Conference on Tools with Artificial Intelligence 1997, Pages 558-567.

[9] Sadaaki Miyamoto, "An Overview and New Methods in Fuzzy Clustering", 1998. Proceedings KES ’98. 1998 Second International Conference on, Volume:1, 1998, Pages 33-40.

[10] Frigui, H., Krishnapuram, R., "A Robust Competitive Clustering Algorithm with Applications in Computer Vision", IEEE Transactions on Pattern Analysis and Machine Intelligence, Volume: 21 Issue:5, May 1999, Pages 450-465.

被引用紀錄

邱佳偉（2008）。以關聯式規則與序列型樣探勘網路瀏覽行為之研究－以國內某休閒旅遊服飾網站為例〔碩士論文，國立臺北科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0006-0407200814233800

國際替代計量

結合關聯法則與模糊叢聚之網際探勘架構

主題瀏覽