透過您的圖書館登入
IP:3.146.255.113
  • 期刊

Research on Personal Credit Risk Assessment based on Machine Learning Algorithm

摘要


The main purpose of this paper is to use machine learning algorithm to establish different classification models to evaluate and predict personal credit risk. In this paper, we take the data of give me some credit in the kaggle competition as an example. We take the seriousdlqin2yrs variable which is overdue for more than 90 days or worse as the target variable, and take other characteristic variables of the data as independent variables for modeling and analysis. In the stage of data preprocessing, this paper first uses the k-nearest neighbor method to fill in the missing values in the data, then processes the outliers in the data, and tests the multicollinearity among variables. In the process of model construction, logistic regression model is constructed by step-by-step screening method, decision tree model is constructed by cart algorithm, classification prediction model is constructed by SVM algorithm, and integrated model is constructed based on three algorithms. With the help of AUC value and ROC curve, by comparing the prediction effect of different models in training data set and test data set, it is found that the integrated learning model performs better, has higher classification effect and has stability.

參考文獻


Fisher R A.The use of multiple measurement in taxonomic problems[J].Annuals of eugenics,1936(7):179-188.
Baesens B,Van Gestel T, Viaene S. Benchmarking state-of-the-art classification algorithms for credit scoring[J].Journal of the operational research society,2003(54): 627-635.
Makowski P. Credit scoring brancher out [J]. Credit world, 1985, 75: 30-37.
Kuang Nan Fang, Guijun Zhang, Huiying Zhang. An early warning method of personal credit risk based on lasso logistic model [J]. Research on quantitative economy, technology and economy, 2014,31 (02): 125-136.
Wenbing Xiao, qi Fei. Research on the personal credit evaluation model and optimal parameter selection based on support vector machine [J]. System engineering theory and practice, 2006 (10): 73-79.

延伸閱讀