透過您的圖書館登入
IP:216.73.216.60
  • 期刊

Research on Financial Default Risk Prediction Method based on Big Data Model

摘要


The issuing of credit is the main source of income for Banks, but the borrower's default will bring huge losses to Banks. How to effectively evaluate and identify the borrower's potential default risk and calculate the borrower's default probability before issuing loans is the foundation and important link of modern financial institutions' credit risk management. This paper mainly studies the statistical analysis of the historical data of Banks and other financial institutions by using the idea of non-equilibrium data classification, and establishes the model of loan default by using the random forest algorithm. Experimental results show that neural network and random forest algorithm outperform decision tree and logistic regression classification algorithm in predicting performance. Moreover, the stochastic forest model can be used to rank the importance of features, which intuitively reflects the important reference value of different data of borrowers, so as to effectively judge the risk of lending in the financial field.

參考文獻


Kumaresan.A, A case study on system integration efforts involved in achieving fast data in a financial industry.
Hsueh,S. Kuo,C, Effective Matching for P2P Lending by Mining Strong Association Rules. P30-33.
Eric W. Fox, Jay M. Ver Hoef, Anthony R. Olsen, Comparing spatial regression to random forests for large environmental data sets, March 23, 2020 https://doi.org/10.1371/journal.pone.0229509.
Donges, N, A complete guide to the random forest algorithm. 2020.
Well,K, Dec 28, 2017, Random Forest Simple Explanation. Retrieved from https://medium.com/ @williamkoehrsen/random-forest-simple-explanation-377895a60d2d.

延伸閱讀