進行資料探勘或機器學習時,過多的參數可能會影響學習的效率,而需花費大量的時間進行學習,但是在資料中並非每一個屬性參數對於整個資料都是必要的。粗糙集合理論(Rough Set Theory)在減少屬性參數上是一項有效的工具。本研究將使用粗糙集合理論參數折減方式(reduct),將從UCI資料庫與ROSE2軟體的範例檔案中,篩選出的十六組資料降低屬性參數,再以人工類神經網路(ANN)、支向機(SVM)、貝氏網路、ID3與C4.5決策樹,這五種學習方法檢驗參數折減對學習效率是否提高,並且觀查對學習準確度之影響程度。經由實驗顯示,使用約略集合理論找出最小參數屬性後,在學習效率上確實有提昇,而且在學習準確度,也存有提昇的空間。
For data mining or machine learning, the plethora of parameters that may affect the efficiency of learning, and spend a lot of time to study. Further more not all attributes are important for the data. Rough set theory is an effective tool to reduce the attributes. This study applies the reduced attributes of rough set theory approach. Using the UCI database and example of software ROSE2. 16 set of data were selected to reduce the attributes, and then the artificial neural network, Support vector machine, Bayesian network, ID3 and C4.5 decision tree learning methods are used for comparison. The result show that the application of rough set theory to find the smallest attribute subset for learning, indeed improves the learning efficiency, and accuracy is also improved.