Logistic模型在银行贷款中的应用
The Application of Logistic Model in Bank Loan
摘要: 本文使用Logistic模型对台湾客户是否违约支付建立预测模型,通过这个模型可以在银行给客户贷款时判断客户是否会违约。首先,由于数据中有23个变量,其中有些变量并不显著,遂采用最优子集的方法判断出模型最优的变量个数为8。再通过Forward Stepwise Selection方法选择出8个变量并对此建立Logistic模型。通过将数据分为训练集和测试集来得到模型的精准度:模型整体预测准确率为80.2%,总体精度还算可以,模型对客户不违约的预测还是非常准确,但对客户违约的预测非常不理想。同时,采用另一种可视化的方法衡量模型的优劣,即ROC曲线,计算出AUC的值为0.66。模型的结果优于我们随机猜测,具有预测价值。
Abstract:
In this paper, the Logistic model is used to establish a prediction model for the default payment of Taiwan customers. Through this model, the bank can judge whether the customer will default when lending to the customer. First of all, since there are 23 variables in the data, some of which are not significant, the optimal number of variables in the model is judged to be 8 by the optimal subset method. Then the Forward Stepwise Selection method selects 8 variables and establishes the Logistic model. The accuracy of the model was obtained by dividing the data into training set and test set: the overall prediction accuracy of the model was 80.2%, and the overall accuracy was reasonable. The prediction of non-default by the model was still very accurate, but the prediction of default by the customer was very unsatisfactory. At the same time, another visual method was used to measure the merits of the model, namely the ROC curve, and the value of AUC was calculated as 0.66. The results of the model are better than our random guesses and have predictive value.
参考文献
|
[1]
|
宫宏宇, 谢艺观. 硅谷银行倒闭会引发新一轮金融危机吗? [J]. 宁波经济(财经视点), 2023, 589(4): 45-47.
|
|
[2]
|
石伊凡. 1929年经济危机、2008年金融危机的成因以及相似度分析[J]. 经济师, 2023, 407(1): 27-28.
|
|
[3]
|
任重. 美国次级贷危机的成因分析[J]. 杭州电子科技大学学报(社科版), 2009, 5(S1): 25-28. [Google Scholar] [CrossRef]
|
|
[4]
|
胡冰. 次贷危机对我国商业银行的启示[J]. 合作经济与科技, 2009, 376(17): 76-77.
|
|
[5]
|
彭建刚, 张丽寒, 刘波, 等. 聚合信用风险模型在我国商业银行应用的方法论探讨[J]. 金融研究, 2008, 338(8): 72-85.
|
|
[6]
|
彭佳琪, 徐璐. 武汉市在校生垃圾分类认知和行为调查研究——基于二元Logistic回归模型[J]. 科教导刊, 2022, 475(7): 151-154. [Google Scholar] [CrossRef]
|
|
[7]
|
李健民, 刘添文, 符思远, 等. 基于最优子集法建立肠道准备预测模型的研究[J]. 中国实用内科杂志, 2020, 40(3): 231-236. [Google Scholar] [CrossRef]
|