基于非平衡数据的机器学习企业财务风险预警模型研究
Research on Machine Learning Models for Enterprise Financial Risk Warning Based on Imbalanced Data
摘要: 制造业企业一般需要较高的研发投入与供应链协同要求,但由于当前国内整体行业的外部环境愈趋复杂、市场竞争激烈,其所面临的财务风险问题亦愈发严峻,因此构建适用于我国制造业企业的财务风险预警模型十分有必要。当前国内制造业的ST与非ST企业数量占比存在极大差距,所以多数研究运用的是非平衡数据。对此本文除了对选取的ST企业进行1:3匹配同等资产规模的非ST企业外,还利用了SMOTE过采样方法对非平衡的样本数据集进行处理。文中共采用了2020年到2023年的168家中国制造业上市企业数据,并综合12种机器学习模型进行了预警效果综合对比。结果表明Extra Trees模型的效果最优,且数据集在经过平衡处理后该模型的预测准确率得到了18%的提升。本研究期望所做的模型研究对于国内制造业企业在财务风险预警防控上具有实际的参考与应用价值,能够助力维护行业与经济的稳健发展。
Abstract: Manufacturing enterprises typically require high R&D investment and supply chain coordination. However, due to the increasingly complex external environment and intense market competition in the domestic industry, the financial risks they face are becoming more severe. Therefore, it is crucial to develop a financial risk warning model suitable for manufacturing enterprises in China. Given the significant disparity in the number of ST and non-ST enterprises in the domestic manufacturing sector, most studies utilize imbalanced data. In response, this paper matches selected ST enterprises with non-ST enterprises of the same asset size in a 1:3 ratio and employs the SMOTE oversampling method to address the imbalanced dataset. The study uses data from 168 Chinese manufacturing listed companies from 2020 to 2023 and compares the performance of 12 machine learning models for risk prediction. The results indicate that the Extra Trees model performs the best, with an 18% improvement in prediction accuracy after balancing the dataset. This research aims to provide practical reference and application value for financial risk warning and prevention in domestic manufacturing enterprises, contributing to the stable development of the industry and economy.
参考文献
|
[1]
|
丁琳. 基于公司治理的上市公司BP神经网络财务预警模型研究[J]. 当代经济, 2008(12): 116-117.
|
|
[2]
|
宋彪, 朱建明, 李煦. 基于大数据的企业财务预警研究[J]. 中央财经大学学报, 2015(6): 55-64.
|
|
[3]
|
龙志, 陈湘州. 企业财务风险预警模型的构建与检验[J]. 财会月刊, 2023, 44(24): 54-61.
|
|
[4]
|
马秀颖, 郝立宁, 木仁, 崔巍. 改进判别分析模型及其在上市公司财务风险预警问题中的应用[J]. 数理统计与管理, 2024(10): 1-25.
|
|
[5]
|
王德青, 薛守聪, 芦智昊, 郭梦霞, 侯伊雯. 基于函数型Logistic模型的企业财务困境预警研究[J]. 运筹与管理, 2025(1): 1-8.
|