透過您的圖書館登入
IP:3.15.46.13
  • 學位論文

使用基因演算法推估Google搜尋引擎的網頁排名因素及其權重

Estimating Google’s Ranking Factors and Their Weights Using a Genetic Algorithm Approach

指導教授 : 陸承志

摘要


本研究旨在以機器學習方法來找出逼近Google搜尋引擎排名的可操作性排序因素以及其權重。所謂可操作性,指的是網站擁有者或者網路行銷業者可以據以來做搜尋引擎最佳化 (Search Engine Optimization, SEO),亦即適度調整網頁的內部或外部品質,以便在特定關鍵字的搜尋結果中獲得排名的提昇。我們關心的是那些可以從搜尋引擎提供的管理者工具或者客觀的第三方取得公開數據的排序因素,而非所有可能的排序因素。本研究以四類工業產品的關鍵詞 (query) ,蒐集 Google 搜尋結果前20筆網頁,且以不同排序因素分成三個階段進行實驗: (1) 外部連結與PageRank之間的關聯、 (2) Authority與PageRank之間的關聯、 (3) 綜合實驗。本研究實驗結果顯示在不同關鍵詞與多種因素組合下計算出的權重值,一致地呈現 PageRank 的權重值遠比其他因素來得高,增加外部連結或Authority等因素對排名預測結果的影響很少。

並列摘要


The study aims to approximate Google’s ranking factors and their weights by a genetic algorithm based method. The factors we are interested in are those whose data are publicly available from webmasters tools provided by search engines or other third-party providers, rather than all possible ranking factors. We collect the top 20 results from Google search results and divided three parts into ranking factors for four categories of industrial products' keywords as our dataset. Three experiments were conducted to find the : (1) Correlation between the External links and PageRank ; (2) Correlation between the Authority and PageRank ; (3) the weights of all factors considered. Experimental results indicated that, in all combinations of factors, PageRank consistently dominates the search results ranking in our experiment and adding other factors such as number of links and authority had little effect on the precision improvement of the new ranking results.

參考文獻


[6] Craswell, N., S. Robertson, H. Zaragoza, and M. Taylor (2005), Relevance Weighting for Query Independent Evidence, Proceedings of the 28th annual international ACM SIGIR, pp. 416-423
[3] Beel, J., B. Gipp, and Erik Wilde (2010), Academic Search Engine Optimization (ASEO): Optimizing Scholarly Literature for Google Scholar & Co., Journal of Scholarly Publishing, p. 176-190
[4] Bifet, A., C. Castillo, P. Chirita, and I. Weber (2005), An Analysis of Factors Used in Search Engine Ranking, In First International Workshop on Adversarial Information Retrieval on the Web, 2005, pp. 1-10
[8] Evans, M. P. (2007), Analysing Google rankings through search engine optimization data, Internet Research, Vol. 17 No. 1, p. 21-37
[9] Fortunato S., M. Boguna, A. Flammini and F. Menczer, (2008), Approximating PageRank from In-Degree, ALGORITHMS AND MODELS FOR THE WEB-GRAPH, Lecture Notes in Computer Science, 2008, Volume 4936/2008, 59-71

被引用紀錄


楊盛安(2013)。利用語意相關詞和基因演算法來逼近中文搜尋引擎排名〔碩士論文,元智大學〕。華藝線上圖書館。https://doi.org/10.6838/YZU.2013.00049

延伸閱讀