透過您的圖書館登入
IP:18.191.202.45
  • 期刊

Decision tree improvement algorithm and its application

摘要


Aiming at the problems of low efficiency and excessive fitting in data mining classification processing of decision tree algorithm. Therefore, in the process of data mining, the C4.5 algorithm was deeply studied and an improved algorithm, namely BC4.5 algorithm, was proposed. The main idea of the proposed algorithm is a branch of the improved C4.5 algorithm and the Pruning strategy measure and adjust the C4.5 algorithm in the attribute information gain rate scope, comparing the information gain and probability is obtained by bayesian classifier, use a simplified CCP (Cost-Complexity Pruning) method and evaluation standard, the procedure of the subtree root node has to generate the decision tree surface five check five gain value, to determine whether to remove the decision tree nodes and branches. Simulation experiments are conducted on the improved C4.5 algorithm and the traditional algorithm. The results showed that the improved C4.5 algorithm has a significant improvement in execution time, which is 8.75% shorter than the traditional algorithm. With the increase of the number of experiments, the accuracy rate of the improved algorithm reaches more than 90%.

關鍵字

C4.5 B-C4.5 Pruning strategies CCP Decision tree

參考文獻


J.Sanz, J.Fernandez, H.Bustince, C.Gradin,”A decision tree based approach with sampling techniques to predict the survival status of poly-trauma patients,”IJCIS,vol.10,pp.440–455,2017.
BERGMAN R N, KALABA R E, SPINGARN K. Optimizing Inputs for Diagnosis of Diabetes I. Fitting a Minimal Model to Data[J].Journal of Optimization Theory and Applications.2011, 20(9):317-320.
SONETHUNG D,SRIPANIDKULHALI. Improving type 2 diabetes mellitus risk prediction[C]. International Joint Conference Computer Science and Software Engineering(JCSSE),2016.
DEWAN MD,FARID. Improve the quality of supervised discretization of continuous valued attributes in Data Mining[C].Proceeding of 14th International Conference on Computer and Information Technology(ICCIT 2011),2011.
S.Hamali, R.P.NSuci,A.F.Utami,Hanisman and FArga,"Using analytic hierarchy process and Decision Tree for a production decision making,"2016 International Conference on Information Management and Technology (ICIMTech),Bandung,Indonesia,2016, pp.329-332.

延伸閱讀