社群網路生成在社群網路分析領域是一個重要的問題。人工產生的隨機圖被用來模擬真實世界社群網路已經行之有年。隨著真實世界社群網路如Twitter, Facebook等快速膨脹,我們需要可以產生大規模隨機圖的方法。Google提出的MapReduce[1]運算架構提供了我們將社群網路生成演算法平行化的需要及儲存大型網路檔案的分散式檔案系統。這篇論文提出將BA model[5]當中preferential attachment機制用到的degree由實際值代換成期望值,藉此讓BA model的生成過程可以被獨立地分割利於平行化。我們以MapReduce做實驗的結果顯示我們設計的平行演算法有良好的可擴放性並且產生出來的圖保有BA model原本的特性如power-law degree分布。這篇論文也將由Sergei Vassilvitskii在XXL Graph Algorithms[9]提出數三角形問題中把高degree的點分開處理的想法應用到最大團問題的MapReduce演算法。實驗的結果顯示這個方法讓計算效率有明顯的改進。
Social network generation is an important problem in social network analysis. Artificially generated random graphs are used to model real world social networks for a long time. With the expanding size of real world social networks such as Twitter, Facebook, we need some methods to generate large scale random graphs. MapReduce[1] proposed by Google is the right framework to provide parallelism on social network generation algorithm and distributed file system to store large output graph data. In our paper we propose a method to replace the exact degree, which is used as the attachment preference in original BA model[5], with the expected value of degree. By this substitution BA model generation algorithm can be divided into several independent tasks to fit in parallel computing. And our experiment results on MapReduce shows that after this substitution, graphs generated by BA model keep their original properties such as power-law degree distribution with good scalability. In this paper we also apply the idea of differentiate high degree vertices in triangle counting algorithm proposed in XXL Graph Algorithms by Sergei Vassilvitskii[9], to finding maximal clique algorithm in MapReduce. As a result of our experiment, the computation efficiency is obviously improved.
為了持續優化網站功能與使用者體驗,本網站將Cookies分析技術用於網站營運、分析和個人化服務之目的。
若您繼續瀏覽本網站,即表示您同意本網站使用Cookies。