Research on Financial Default Risk Prediction Method based on Big Data Model

The issuing of credit is the main source of income for Banks, but the borrower's default will bring huge losses to Banks. How to effectively evaluate and identify the borrower's potential default risk and calculate the borrower's default probability before issuing loans is the foundation and important link of modern financial institutions' credit risk management. This paper mainly studies the statistical analysis of the historical data of Banks and other financial institutions by using the idea of non-equilibrium data classification, and establishes the model of loan default by using the random forest algorithm. Experimental results show that neural network and random forest algorithm outperform decision tree and logistic regression classification algorithm in predicting performance. Moreover, the stochastic forest model can be used to rank the importance of features, which intuitively reflects the important reference value of different data of borrowers, so as to effectively judge the risk of lending in the financial field.

關鍵字

Random Forest ； Bank Credit Investigation ； Loan Default Prediction ； Data Mining

參考文獻

Kumaresan.A, A case study on system integration efforts involved in achieving fast data in a financial industry.

Hsueh,S. Kuo,C, Effective Matching for P2P Lending by Mining Strong Association Rules. P30-33.

Eric W. Fox, Jay M. Ver Hoef, Anthony R. Olsen, Comparing spatial regression to random forests for large environmental data sets, March 23, 2020 https://doi.org/10.1371/journal.pone.0229509.

Donges, N, A complete guide to the random forest algorithm. 2020.

Google Scholar

Well,K, Dec 28, 2017, Random Forest Simple Explanation. Retrieved from https://medium.com/ @williamkoehrsen/random-forest-simple-explanation-377895a60d2d.

Google Scholar

國際替代計量

Research on Financial Default Risk Prediction Method based on Big Data Model

全文下載

主題瀏覽