基于混合核函数SVM修正MIDAS模型的房地产景气指数预测研究
Study on Forecast of Real Estate Prosperity Index Based on MIDAS Model Corrected by Mixed Kernel Function SVM
摘要: 房地产市场具有影响因素多样性、非线性波动等特点,随着相关数据信息可获得性的进一步提高,因素间存在的频率差、相互干扰等问题不可忽视。在以往研究的基础上,综合考虑时效性强的百度指数以及关联度高的房地产市场相关指标,构建MIDAS (混频数据处理模型)模型解决百度指数与房地产景气指数间频率差的问题,建立随机森林的封装筛选模型实现重要特征的选择,并基于混合核的SVR (支持向量机)对MIDAS预测结果进行进一步修正。实证表明,本文模型充分利用了混频数据信息,也符合房地产市场非线性波动的特点,在预测精度与误差波动性上都得到了一定提升。
Abstract: The real estate market has the characteristics of multi influence factors and nonlinear fluctuations. With the further increase of the availability of relevant data information, the frequency difference and mutual interference between the factors cannot be ignored. Based on the existing research, we considered both the Baidu index with strong timeliness and related indicators of the real estate market with high relevance; we built the MIDAS model to solve the problem of frequency difference between Baidu index and real estate prosperity index, and established an optional packing and screening model random forests for the selection of features. Finally, based on the hybrid kernel SVR, we further modified the MIDAS prediction results. The empirical results showed that the model made full use of the mixed data information and also met the characteristics of the nonlinear fluctuations in the real estate market. The proposed model improved both the accuracy of forecasting and the volatility of the error.
文章引用:向灿. 基于混合核函数SVM修正MIDAS模型的房地产景气指数预测研究[J]. 应用数学进展, 2018, 7(12): 1554-1564. https://doi.org/10.12677/AAM.2018.712182

参考文献

[1] 章伟. 粗糙集BP神经网络在房地产价格预测中的应用[J]. 计算机仿真, 2011, 28(7): 365-368.
[2] 刘彩云, 姚俭. 基于多因素影响的房地产价格预测模型[J]. 统计与决策, 2017(17): 33-38.
[3] Del Giudice, V.D., Paola, P.D. and Forte, F. (2017) Using Genetic Algorithms for Real Estate Appraisals. Buildings, 2.
[4] Bae, S.W., et al. (2018) Pre-dicting the Real Estate Price Index Using Machine Learning Methods and Time Series Analysis Model. Housing Studies, 26, 107-133.
[5] Ginsberg, J., Mohebbi, M.H., Patel, R.S., et al. (2009) Detecting Influenza Epidemics Using Search Engine Query Data. Nature, 457, 1012-1014. [Google Scholar] [CrossRef] [PubMed]
[6] Askitas, N. and Zimmermann, K.F. (2009) Google Econometrics and Unemployment Forecasting. Discussion Papers of Diw Berlin, 55, 107-120.
[7] 董倩, 孙娜娜, 李伟. 基于网络搜索数据的房地产价格预测[J]. 统计研究, 2014, 31(10): 81-88.
[8] 孟雪井, 孟祥兰, 胡杨洋. 基于文本挖掘和百度指数的投资者情绪指数研究[J]. 宏观经济研究, 2016(1): 144-153.
[9] Ghysels, E., Santa-Clara, P. and Valkanov, R. (2004) The MIDAS Touch: Mixed Data Sampling Re-gressions. Cirano Working Papers, 5, 512-517.
[10] Pan, Y., Xiao, Z., Wang, X., et al. (2017) A Multiple Support Vector Machine Approach to Stock Index Forecasting with Mixed Frequency Sampling. Knowledge-Based Systems, 122, 90-102. [Google Scholar] [CrossRef
[11] 刘金全, 刘汉, 印重. 中国宏观经济混频数据模型应用——基于MIDAS模型的实证研究[J]. 经济科学, 2010, 32(5): 23-34.
[12] 李小琳, 孙玥, 刘洋. 基于SVM修正的模糊时间序列模型在沪指预测中的应用[J]. 中国科学技术大学学报, 2016(3): 238-246.
[13] 刘汉, 刘金全. 中国宏观经济总量的实时预报与短期预测——基于混频数据预测模型的实证研究[J]. 经济研究, 2011(3): 4-17.
[14] 蒋翠清, 王睿雅, 丁勇. 融入软信息的P2P网络借贷违约预测方法[J]. 中国管理科学, 2017(11): 12-21.
[15] 孙菲艳, 田雨波, 任作琳. 采用混合核支持向量机的DOA估计[J]. 电讯技术, 2016, 56(3): 302-307.