分散式資訊網路資料下之隱私保護SimRank計算方法

資訊網路分析在最近幾年吸引了許多研究者的注意。而在各種已知的網路資料分析方法中，如何去計算各個節點之間的相似程度是一個很重要的問題。如果我們能夠計算各個節點之間的相似程度，對於像是資料分群、連結預測和群組辨識等等的應用都會有很大的幫助。然而在一個龐大的資料網路中，節點之間連結的分布通常都是非常稀疏的，需要將更多的資料結合在一起做分析才能得到更準確的節點相似程度計算結果。這也使得各個資料擁有者願意彼此合作來完成資料的分析。如何在不洩漏任何隱私的情況下結合各個陣營的資料來計算節點間相似程度便是一個很重要的問題。在這篇論文中，我們提出了一個解決方法，PP-SimRank。PP-SimRank利用完全同態加密來保護所有節點間連結的資訊，並且在不會洩漏任何隱私的情況下完成所有節點相似程度的計算。

關鍵字

隱私保護；節點相似程度計算

並列摘要

Information network analysis has drawn a lot attention in recent years. Among all the aspects of network analysis, similarity measure of nodes has been shown useful in many applications, such as clustering, link prediction and community identification, to name a few. As linkage data in a large network is inherently sparse, it is noted that collecting more data can improve the quality of similarity measure. This gives different parties a motivation to cooperate. In this paper, we address the problem of link-based similarity measure of nodes in an information network distributed over different parties. Concerning the data privacy, we propose a privacy-preserving SimRank protocol based on fully-homomorphic encryption to provide cryptographic protection for the links.

並列關鍵字

Privacy-preserving ； Similarity Measure

參考文獻

international conference on Knowledge discovery and data mining, pp. 797–806,

conference on Knowledge discovery and data mining, pp. 1298–1306, 2011.

works,” Journal of the American society for information science and technology,

[6] R.A. Jarvis and E.A. Patrick, “Clustering using a similarity measure based on shared

1034, 1973.

國際替代計量

分散式資訊網路資料下之隱私保護SimRank計算方法

全文下載

主題瀏覽