對於一份經由表面增強的雷射解析電離飛行時間質譜儀(Surface Enhance Laser Desorption/Ionization Time-Of-Flight Mass Spectrometry, SELDI-TOF MS)並且搭配該儀器生產公司所附的Ciphergen Protein Chip 軟體產生的我國國人臨床肝癌峰值資料,想要將其做患有癌症與否的分析是有著相當程度的困難。雖然已經藉由軟體大幅降低所需分析的資料量和雜訊,但是卻衍生出其他的問題。所以本文先提出由生物醫學之觀點,將資料轉換為有相同依據的特徵向量。再以此作為分析上的依據,使用特徵選取搭配支援向量機做資料的分類。 將基因法則(Genetic Algorithm)應用於特徵選取上則是近年來被發展出來的方法。而本文所提出的權重型基因法則(Gene Weighted Genetic Algorithm, GWGA),將染色體中的每一個基因給予一個交配的權重値,並以機率的方式來做交配的運算,改變了傳統基因法則交配的方式以解決容易陷入局部最佳解(local optimum)的問題,並且能夠適當的減少特徵數目。
It’s very difficult to analyse the clinical SELDI-TOF MS (Surface Enhance Laser Desorption/Ionization Time-Of-Flight Mass Spectrometry)data which is obtained form our countrymen. The data also had been pre-processed with the software which is appended with the instrument producer. Although the data have already been reduced the capacity of data and noise, come into another problems. In this way, it is proposed to transform the data into feature vectors on equal basis which according to the biomedical point in this dissertation. Using feature selection with SVM(Support Vector Machine) to classifer these feature vectors datas. Among the different categories of feature selection algorithms, the genetic algorithm (GA) is a rather recent development. In this dissertation, the Gene Weighted Genetic Algorithm, that is give a gene weighted value in each gene of chromosome, and make the crossover operation by the probability, this change can reduce local optimum when used traditional method of Genetic Algorithm.