透過您的圖書館登入
IP:3.145.16.90
  • 學位論文

隨機森林模型效力評估

Evaluating the Effectiveness of Random Forest Model

指導教授 : 洪慧念

摘要


隨機森林是一種在機器學習中熱門的演算法,它是由多棵決策樹所組成的模型(model),我們先生成指定的決策樹個數(ex:100),再由所有決策樹所估計的結果投票(類別型應變數)或取平均(連續型應變數)來做最後的結果預測。在R軟體的randomForest套件(package)中,隨機森林在使用上十分方便,只要決定決策樹的個數(ntry)和節點分枝的變數隨機選取的個數(mtry)就可以進行資料的分析,而其分析真實資料的結果(第3章)比一些統計模型還要好,且同時也有尋找重要變數的功能,是一個十分完整且方便的模型。

關鍵字

隨機森林

並列摘要


Random Forest is a popular machine learning algorithms. It is a decision tree model consists of multiple trees. First, we generate a specified number of tree (ex: 100), then we predict the final result by taking average of all the results (for continuous response) or by majority voting of the results (for categorical response). Random forests in R software package “randomForest” is very easy to use. As long as we choose the number of the decision tree (ntry) and the number of variables to be selected for node branching (mtry), then we can analyze the data by this model. Its analysis results of the real data (Chapter 3) are better than some of the statistical model. What’s more, our model also has the ability for finding important variables. Therefore, it is a very complete and convenient model.

並列關鍵字

Random Forest

參考文獻


[1] Breiman, L. (1996). Bagging predictors. Machine learning, 24(2), 123-140.
[3] Breiman, L. (2001). Random Forests. Machine learning, 45(1), 5-32.
[8] Hoerl A.E. and Kennard. R.W. Ridge regression: Biased estimation for nonorthogonal problems. Technometrics, 12(3):55-67, 1970.
[11] Svetnik, V., Liaw, A., Tong, C., Culberson, J. C., Sheridan, R. P., & Feuston, B. P. (2003). Random forest: a classification and regression tool for compound classification and QSAR modeling. Journal of chemical information and computer sciences, 43(6), 1947-1958
[12] Trevor Hastie,Robert Tibshirani,Jerome Friedman(2001),The Elements of Statistical Learning:Data Mining, Inference, and Prediction, New York , Springer

被引用紀錄


簡國棟(2016)。以本體知識結合資料探勘進行財報舞弊偵查〔碩士論文,國立中正大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0033-2110201614062703
黃裕仁(2017)。預測試管嬰兒成功率-使用隨機森林、RIPPER及決策樹資料探勘演算法〔碩士論文,國立虎尾科技大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0028-2906201722334000

延伸閱讀


國際替代計量