基于数据优化的保险客户承保预测

doi:10.12677/SA.2019.85089

期刊菜单

基于数据优化的保险客户承保预测
Insurance Customer Purchase Prediction Based on Data Optimization

DOI: 10.12677/SA.2019.85089, PDF, 科研立项经费支持
作者: 李莎莎：对外经济贸易大学，北京
关键词: 保险；逻辑回归；决策树；随机森林；组合模型；Insurance； Logistic Regression； Decision Tree； Random Forest； Combination Model

摘要: 近年来，人民生活水平的普遍提高，使得保险行业迎来了新的春天。一直以来的粗放式经营模式已经无法满足保险公司日益发展的要求。如何摆脱传统的营销方式，快速发掘出有价值的客户，在市场中不远远落后，对于保险公司来说越来越重要。本文使用某人寿保险公司的客户数据。首先，基于给定的客户基本信息、通话信息、投保信息、赠险信息等进行描述性统计分析，查看数据情况，对数据进行数据清洗提升数据质量；其次，使用单独的逻辑回归模型进行学习，生成可行性分析报告；然后，分别使用决策树与逻辑回归的组合模型以及随机森林与逻辑回归的组合模型进行预测；最后，将三种模型进行对比发现随机森林与逻辑回归的组合模型效果更好。

Abstract: In recent years, with the general improvement of people's living standards, the insurance industry ushered in a new spring. The extensive business model has been unable to meet the requirements of the increasing development of insurance companies. How to get rid of the traditional way of mar-keting, quickly discover valuable customers and keep up with the market, is becoming more and more important for insurance companies. This article uses customer data from a life insurance company. Firstly, descriptive statistical analysis was conducted based on the given basic information of customers, call information, insurance information and risk donation information, etc., to view the data situation, and data cleaning was carried out to improve the data quality. Secondly, a separate logistic regression model is used for learning to generate a feasibility analysis report. Then, the combined model of decision tree and logistic regression and the combined model of random forest and logistic regression were respectively used for prediction. Finally, a comparison of the three models shows that the combined model of random forest and logistic regression is more effective.

文章引用：李莎莎. 基于数据优化的保险客户承保预测[J]. 统计学与应用, 2019, 8(5): 784-796. https://doi.org/10.12677/SA.2019.85089

参考文献

[1]	苗东. 大都会保险公司客户关系管理研究[D]: [硕士学位论文]. 上海: 华东理工大学, 2013.
[2]	卞爱军. 基于信息化平台的寿险客户细分管理研究——以扬州寿险公司为例[D]: [硕士学位论文]. 南京: 南京理工大学, 2008.
[3]	柯新喜. 基于决策树模型的社会保险客户分类研究[J]. 福建电脑, 2016, 32(6): 105-107.
[4]	王贵龙. 基于关联向量机的保险客户识别研究[D]: [硕士学位论文]. 西安: 西安工业大学, 2011.
[5]	赵萍. 数据挖掘在寿险客户关系管理中的应用[D]: [硕士学位论文]. 天津: 天津大学, 2007.
[6]	董娜, 常建芳, 吴爱国. 基于贝叶斯模型组合的随机森林预测方法[J]. 湖南大学学报(自然科学版), 2019, 46(2): 123-130.
[7]	苏杭西子. 基于随机森林模型的个人信用风险评估研究[D]: [硕士学位论文]. 长沙: 湖南大学, 2018.
[8]	李航. 统计学习[M]. 北京: 清华大学出版社, 2012:77-79.
[9]	邴欣. 机器学习在推荐系统中的应用[D]: [硕士学位论文]. 济南: 山东大学, 2016.
[10]	钱超. 基于特征优化的逻辑回归模型在广告点击率问题中的应用研究[D]: [硕士学位论文]. 武汉: 华中师范大学, 2018.
[11]	宋天龙. Python数据分析与数据化运营[M]. 北京: 机械工业出版社, 2017: 99-102.
[12]	刘晨晨. 基于数据挖掘的通信客户流失预警模型研究[D]: [硕士学位论文]. 武汉: 华中师范大学, 2017.
[13]	王文敬. 基于SMOTE过抽样法的个人信用评分模型研究[D]: [硕士学位论文]. 上海: 上海师范大学, 2019.

为你推荐

友情链接