我們提出了一個主動式學習的方法來解決網路化資料的分類問題。在網路的領域,當資料間彼此有鏈結時他們可能會有相似的屬性。近年來,一些集體分類的方法利用鏈結的資訊來改善網路化資料的分類方法已經被提出。但集體分類的方法需要不斷迭代來推論出鄰居特徵,那將會花費大量的時間在訓練集體分類器上。因此,我們提出了一個主動式學習的方法來減少訓練的時間而且準確度較集體分類方法更佳。我們主要的想法是利用分群來聚集有鏈結以及有相似屬性的資料以增加訓練的效率。我們使用區域性網路分群的方法來對資料分群,接著在每個群獨立訓練分類器。在實驗部分,我們使用真實世界的網路資料來評量我們的方法以及顯示我們的方法勝過其他的基準線方法,包含了使用集體分類方法的ALFNET。
This paper presents a novel active learning approach to solve the classification of networked data. In network domain, the instances may have similar attributes when they are connected for each other. Recently, some collective classification methods used the link informations to solve the classification of networked data have been proposed. But collective classification need to infer the neighbor features iteratively, that will spend a lot of time learning a collective classifier. Hence, we present an active learning approach that reduce the learning time and the accuracy also be better than collective classification methods. Our main idea is to increase the learning efficiency by clustering the data and aggregate the instances which has links and similar attributes. We use community-based network clustering method to cluster the data, and then train the local classifiers independently for each cluster. In the experiments, we evaluate our approach on real-world networks and show that our method outperforms several baselines, including the ALFNET [6] which using the collective classifiers.
為了持續優化網站功能與使用者體驗,本網站將Cookies分析技術用於網站營運、分析和個人化服務之目的。
若您繼續瀏覽本網站,即表示您同意本網站使用Cookies。