基於混合式監督學習法之線上社群網路異常偵測

隨著線上社群網路的迅速發展，世界各地都在各個社交平台上提供的大量信息。儘管大部分的資訊都是有用的，但在平台上仍然存在着一些實體利用異常用戶傳播惡意內容（如垃圾郵件或謠言以實現其金錢或政治目標）。本文中，我們提出了根據用戶的個人資料，以及過去發布的動態的資料去建立異常用戶偵測機制。除了以往文獻考慮的特徵之外，我們還設計了幾個與近似重複內容（包括詞彙相似度，語義相似度）相關的特徵，並結合以監督式學習方法以提高異常用戶檢測的準確性

關鍵字

線上社群網路；異常偵測；近似重複計算；監督式學習

並列摘要

With the rapid development of online social network (OSN), worldwide connections between each individual turn into much stronger. A huge number of information provided by some entities around the world are well dispersed in OSN every day. Most of those are useful but not all as anomalous entities utilize anomaly users to spread malicious content (like spam or rumors to achieve their pecuniary or political aims. In this paper, we propose a mechanism to detect such anomaly users according to the user profile and tweet content of each user. Besides the features considered in the past literature, we design several new features related to near-duplicate content (including lexical similarity, semantic similarity) to enhance the precision of detecting anomaly users. Utilizing the data by public honeypot dataset, the proposed approach deals with supervised learning approach to carry out the detection task.

並列關鍵字

Online Social Network ； Anomaly Detection ； Near-Duplicate Computing ； Supervised Learning Approach

參考文獻

[1] B.DYBWAD, “Twitter Drops ‘What are You Doing?’ Now Asks ‘What’s Happening?,’” Mashable.com. 2009.

Google Scholar

[2] Curalate, “Social Content is the New Storefront,” no. November, pp. 1–23, 2017.

Google Scholar

[3] E.GRIECO, “More Americans are turning to multiple social media sites for news,” Pew Search Center, 2017. .

Google Scholar

[4] Z.Chu, S.Gianvecchio, H.Wang, andS.Jajodia, “Who is Tweeting on Twitter: Human, Bot, or Cyborg?,” Acsac 2010, p. 21, 2010.

Google Scholar

[5] K.Thomas, C.Grier, J.Ma, V.Paxson, andD.Song, “Design and evaluation of a real-time URL spam filtering service,” Proc. - IEEE Symp. Secur. Priv., pp. 447–462, 2011.

Google Scholar

國際替代計量

基於混合式監督學習法之線上社群網路異常偵測

全文下載

主題瀏覽