基于贝叶斯加性回归树的风险保费建模方法研究
Research on Risk Premium Modeling Methods Based on Bayesian Additive Regression Trees
摘要: 车险索赔数据普遍存在索赔发生不均衡、索赔金额重尾分布及风险特征高度非线性等问题,给风险保费建模带来挑战。本文提出一种融合机器学习与期望分位数(Expectile)保费原理的风险保费建模框架。该框架由“两阶段纯保费模型”和“贝叶斯期望分位数加性回归树(BEART)风险附加模型”两部分构成,分别用于刻画车险的出险概率和条件累计索赔额情况及索赔分布的尾部风险特征。实证结果表明,该框架在风险识别、尾部风险刻画及预测稳定性方面具有良好表现,可为复杂车险风险环境下风险保费厘定提供可靠的建模依据。
Abstract: Auto insurance claims data are commonly characterized by imbalanced claim occurrence, heavy-tailed claim severity distributions, and highly nonlinear risk features, posing substantial challenges for risk premium modeling. This paper proposes a risk premium modeling framework that integrates machine learning techniques with the Expectile premium principle. The framework consists of two components: a two-stage pure premium model and a Bayesian Expectile Additive Regression Tree (BEART)-based risk loading model, which are employed to characterize claim occurrence probability, conditional cumulative claim amounts, and tail risk features of the claims distribution, respectively. Empirical results demonstrate that the proposed framework exhibits strong performance in risk discrimination, tail risk characterization, and predictive stability, providing a reliable modeling basis for risk premium determination in complex motor insurance risk environments.
文章引用:王艺霖. 基于贝叶斯加性回归树的风险保费建模方法研究[J]. 统计学与应用, 2026, 15(3): 55-65. https://doi.org/10.12677/sa.2026.153055

参考文献

[1] 杨亮, 孟生旺. 基于分位回归的风险保费预测[J]. 统计与信息论坛, 2016, 31(9): 83-88.
[2] Heras, A., Moreno, I. and Vilar-Zanón, J.L. (2018) An Application of Two-Stage Quantile Regression to Insurance Ratemaking. Scandinavian Actuarial Journal, 2018, 753-769. [Google Scholar] [CrossRef
[3] Yang, L., Li, Z. and Meng, S. (2020) Risk Loadings in Classification Ratemaking. arXiv: 2002.01798.
[4] 蔡超, 黄聪聪, 董皓天. 分位数回归提升树模型及应用[J]. 系统科学与数学, 2022, 42(5): 1216-1233.
[5] Cai, C., Dong, H. and Wang, X. (2022) Expectile Regression Forest: A New Nonparametric Expectile Regression Model. Expert Systems, 40, e13087. [Google Scholar] [CrossRef
[6] Jiang, C., Jiang, M., Xu, Q. and Huang, X. (2017) Expectile Regression Neural Network Model with Applications. Neurocomputing, 247, 73-86. [Google Scholar] [CrossRef
[7] Chipman, H.A., George, E.I. and McCulloch, R.E. (2010) BART: Bayesian Additive Regression Trees. The Annals of Applied Statistics, 4, 266-298. [Google Scholar] [CrossRef
[8] 郭哲琦, 高苏浩. 基于两阶段Expectile回归的风险保费定价[J]. 统计与决策, 2024, 40(3): 162-167.
[9] Newey, W.K. and Powell, J.L. (1987) Asymmetric Least Squares Estimation and Testing. Econometrica, 55, 819-847. [Google Scholar] [CrossRef
[10] 曹桃云, 张日权. 非对称误差分布的贝叶斯累加回归树模型研究及应用[J]. 系统科学与数学, 2022, 42(11): 3119-3133.
[11] 金娇, 管思晴, 扈灵嫣, 等. Expectile回归模型的贝叶斯统计推断研究[J]. 统计学与应用, 2021, 10(5): 929-939.