透過您的圖書館登入
IP:3.14.15.94
  • 學位論文

混合式進化演算法於微陣列資料分類法則探勘之研究

Using A Hybrid Meta-evolutionary Algorithm for Mining Classification Rules Through Microarray Data

指導教授 : 陳大正
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


隨著資訊技術的蓬勃發展,基因微陣列資料成為癌症分類研究中重要的研究領域。由於基因微陣列資料具有高維度基因屬性且樣本數少的特性,因而造成在分類時辨識度低,而且需要冗長的運算時間成本。因此,在面臨基因微陣列資料分類問題時,如何萃取出關鍵且具代表性的基因屬性以獲得較高的分類預測準確率並且有效的提升運算品質成為一項重要的研究議題。因此,本研究提出一個以基因演算法結合二元粒子群最佳化演算法之法則探勘模型,藉由混合式進化演算法在分類問題上進行樣本數值的評量,並同時萃取預測變數、相對應的隸屬函數及參數值之模糊分類法則,透過微陣列資料的屬性維度之調整及隸屬函數的選擇,僅需使用較少量的基因屬性即可達到高準確率的分類。此外,透過模糊法則可觀察出資料屬性與類別之間的相互關係。為了降低分類時所需的龐大運算,本研究結合了格網運算技術來有效降低分析運算所需耗費的時間成本,以建立高預測準確率之分類法則。將實驗結果與過去相關文獻中的方法以及商業探勘軟體進行比較,透過研究顯示本研究所提出的方法能獲得較佳的分類正確率並有效的降低運算的時間。

並列摘要


With the rapid development of information technology, microarray data is an important field of study for cancer research. However, microarray data is with high dimensional attributes and small sample size resulting in lengthy computation time and low classification accuracy. Due to gene microarray data classification issues, how to get more accurate prediction results with better quality becomes an important area of research. This thesis has proposed a hybrid evolutionary algorithm which combines a genetic algorithm and binary particle swarm optimization with fuzzy discriminate function. The proposed method is used to estimate the fitness value for classification, significant variables extraction, and parameters of fuzzy membership function in the meanwhile. Through the adjustment of the dimension of microarray data and the choice of membership function, fewer significantly characteristic attributes can reach high classification accuracy. Fuzzy rules can also be observed through data attributes and the relationship between categories. To reduce the vast computation time for classification process, this study integrates grid computing technology in the proposed approach. The experimental results show our proposed method can achieve higher classification accuracy and effectively reduces the computation time.

參考文獻


[39] 曹惠鈴,"運用混合式進化演算法於法則探勘之回應模型",國立虎尾科技大學資訊管理系碩士論文,2007。
[1] Fayyad, U. and Uthurusamy, R., "Data mining and knowledge discovery in databases", Communications of the ACM, 39(11), pp. 24-26, 1996.
[2] Gelatt C. D.and Kirkpatrick S. and Jr., and Vecch M. P. i, "Optimization by simulated annealing ", Science, Vol. 220, pp. 671-680, 1983.
[3] Cheung K. -W.and Kwok, J. T. and Law, M. H., and Tsui, K. -C, "Mining customer productratings for personalized marketing ", Decision Support Systems, Vol. 35, pp. 231-243, 2003.
[4] Ooi. P. Tan, "Genetic algorithms applied to multi -class prediction for the analysis of gene expression data ", Bioinformatics, Vol. 19, pp. 37-44, 2003.

被引用紀錄


李宜澤(2014)。應用以網格運算為基礎之免疫演算法改善高維度分數法於干擾控制之研究〔碩士論文,國立虎尾科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0028-1208201422271200

延伸閱讀