透過您的圖書館登入
IP:13.58.247.31
  • 學位論文

基於共同作者網路之作者名稱消歧異方法

Author Disambiguation by mining the coauthor graph

指導教授 : 林守德
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


現今,我們常依靠搜索引擎來查詢學術論文。許多主流的搜索引擎都提供了專門搜索學術文章的服務,像是Google Scholar、Microsoft Academic Search等。許多論壇也擁有自己的線上論文庫。這些系統要面對的難題之一就是作者名稱歧異問題,因為一個作者可能會在不同的地方用不同的方式寫自己的名字,搜索引擎必須要能夠辨認一個人名的多種寫法才能返回精確又完整的結果。作者名稱是用戶主要用於搜索的關鍵字之一,因此消除作者名稱歧異是搜索引擎所需具備的重要能力。 造成作者名稱歧異問題的原因有很多,其中一項主要原因就是名字可能有很多種寫法。要解決這個問題通常需要依賴一定的與作者個人相關的輔助信息,但這些信息並非總是那麼方便取得。在這篇論文中,我們設計了一個系統嘗試解決由於作者用不同方法來寫名字所造成的作者名稱歧異問題,我們只需要作者的在論文上的屬名以及共同作者網路作為輔助信息。實驗證明我們的系統相較於一些傳統的方式更為有用。

並列摘要


Nowadays, we rely mostly on search engines when surveying academic papers. Many mainstream search engines provide searches specific to academic articles such as Google Scholar, Microsoft Academic Search … etc. Many conferences also host their online paper archive. One of the challenging problems to these systems is author name disambiguation. An author may represent their name differently in different locations. Because the name is one of the most keyword used to search papers, it became crucial for search engines to recognize different names from the same person to generate accurate and complete results. Through author name ambiguity may be caused by many reasons, one of the most common reasons that lead to this author identity ambiguity is different name representations. Solving this problem generally require much supplementary information about authors that may not be available. In this paper, we purpose a system that tries to resolve the author name ambiguity issue caused by the different way the author write his/her name, the only information it requires is author’s name and co-author relationship. The experiment shows the effectiveness of our system compared to some traditional methods.

參考文獻


[1] Arnab Sinha, Zhihong Shen, Yang Song, Hao Ma, Darrin Eide, Bo-June (Paul) Hsu, and Kuansan Wang. 2015. An Overview of Microsoft Academic Service (MAS) and Applications. In Proceedings of the 24th International Conference on World Wide Web (WWW ’15 Companion). ACM, New York, NY, USA, 243-246. DOI=http://dx.doi.org/10.1145/2740908.2742839
[3] Louppe, Gilles, et al. "Ethnicity sensitive author disambiguation using semi-supervised learning." arXiv preprint arXiv:1508.07744 (2015).
[2] McRae-Spencer, Duncan M., and Nigel R. Shadbolt. "Also by the same author: AKTiveAuthor, a citation graph approach to name disambiguation." Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries. ACM, 2006.
[4] Yang, Kai-Hsiang, et al. "Author name disambiguation for citations using topic and web correlation." International Conference on Theory and Practice of Digital Libraries. Springer Berlin Heidelberg, 2008.
[5] Tan, Yee Fan, Min Yen Kan, and Dongwon Lee. "Search engine driven author disambiguation." Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries. ACM, 2006.

延伸閱讀