Research on Credit Risk Evaluation and Forecast Method Based on Machine Learning Model

How to evaluate and identify the potential default risk of the borrower before issuing the loan and calculate the default probability of the borrower is the basis and important link of the credit risk management of modern financial institutions. This paper mainly studies the statistical analysis of historical loan data of banks and other financial institutions using the idea of non-equilibrium data classification, and uses a random forest algorithm to establish a loan default prediction model. The experimental results show that the random forest algorithm surpasses the decision tree and the logistic regression classification algorithm in the prediction performance. In addition, by using the random forest algorithm to rank the importance of features, it is possible to obtain features that have a greater impact on the eventual default, so that it can more effectively determine the risk of lending in the financial sector.

關鍵字

Random forest ； loan default forecast ； data mining

參考文獻

Milad Malekipirbazari, VuralAksakalli(2015)Risk assessment in social lending via random forests.

Nazeeh Ghatasheh(2014) Business Analytics using Random Forest Trees for Credit Risk Prediction: A Comparison Study.

Lu Minfeng. Epidemic crisis and commercial bank digital countermeasures research observation and thinking [A]. Observation and thinking, 2020 (5): 36-43.

Google Scholar

國際替代計量

Research on Credit Risk Evaluation and Forecast Method Based on Machine Learning Model

全文下載

主題瀏覽