基于GBDT模型的贵州省GDP空间化研究
Spatial Analysis of Guizhou Province’s GDP Based on the GBDT Model
DOI: 10.12677/pm.2024.146245, PDF,    科研立项经费支持
作者: 王 耀, 杜 宁*, 王 莉, 张显云, 熊英杰:贵州大学矿业学院,贵州 贵阳
关键词: GDP空间化机器学习GBDT模型Spatialization of GDP Machine Learning GBDT Model
摘要: 传统的GDP统计数据仅能反映地区经济发展的总体水平,而GDP空间化可以反映出经济活动的空间特征。本文以贵州省为研究对象,用2000~2022年的长时间序列多源数据集作为模型训练数据,选用传统多元线性回归模型(MLR)与5种机器学习模型:DT、RF、AdaBoost、XGBoost和GBDT模型进行对比分析,结果表明:机器学习模型拟合结果精度和交叉验证精度均优于传统多元线性回归模型,表明在GDP与多元变量之间存在复杂关系时,线性回归模型往往具有局限性,机器学习模型通过不断地迭代计算,能够更好地处理非线性关系,从而提高模型的预测性能。其中以GBDT模型误差最小(拟合结果R2为0.90、MAE和RMSE分别为51.25亿元和76.32亿元;交叉验证R2为0.98、MAE和RMSE分别为0.04亿元和0.13亿元),相较于其他模型,该模型表现出最佳的拟合能力,模型的稳定性最高。贵州省经济空间分布特征主要是以城市为中心的经济发展圈层结构,结合现状分析了成因并提出了规划、建设及财政等方面建议。
Abstract: Traditional GDP statistics only reflect the overall level of regional economic development, while spatialization of GDP can unveil the spatial characteristics of economic activities. This paper takes Guizhou Province as the research object, utilizing a long-term time series multi-source dataset from 2000 to 2022 as the model training data. It compares the traditional multiple linear regression model (MLR) with five machine learning models: DT, RF, AdaBoost, XGBoost, and GBDT models. The results indicate that machine learning models exhibit superior fitting accuracy and cross-validation precision compared to the traditional multiple linear regression model. This suggests that linear regression models often have limitations when dealing with complex relationships between GDP and multiple variables. Machine learning models, through iterative computation, can better handle nonlinear relationships, thereby enhancing the predictive performance of the model. Among these, the GBDT model demonstrates the smallest error (with a fitting result R2 of 0.90, MAE and RMSE of 51.25 billion yuan and 76.32 billion yuan respectively; cross-validation R2 of 0.98, MAE and RMSE of 0.04 billion yuan and 0.13 billion yuan respectively), exhibiting the best fitting capability and highest stability compared to other models. The spatial distribution characteristics of Guizhou Province’s economy primarily manifest as an urban-centric economic development concentric structure. Combining current analysis, this paper elucidates the causes and proposes suggestions in terms of planning, construction, and finance.
文章引用:王耀, 杜宁, 王莉, 张显云, 熊英杰. 基于GBDT模型的贵州省GDP空间化研究[J]. 理论数学, 2024, 14(6): 242-254. https://doi.org/10.12677/pm.2024.146245

参考文献

[1] 涂雯. 贵州省区域经济发展差异性与差异化发展研究[J]. 商业经济, 2024(3): 41-43.
[2] 徐宗学, 唐清竹, 陈浩, 等. 基于精细化空间格局的城市承灾体脆弱性评估[J]. 水科学进展, 2024, 35(1): 38-47.
[3] 唐小辉, 蔡中祥, 刘宏建, 等. 基于NPP-VIIRS夜间灯光数据的产业结构估测——以河南省为例[J]. 河南大学学报(自然科学版), 2023, 53(3): 305-313.
[4] 闫梦川. 基于夜间灯光数据的长江三角洲地区GDP空间化分析[D]: [硕士学位论文]. 大连: 辽宁师范大学, 2022.
[5] 谢甫, 孙建国, 于明雪, 等. 基于珞珈一号和随机森林的兰州市GDP空间化[J]. 遥感信息, 2022, 37(2): 53-59.
[6] 王平云, 王晓艳, 相妮. 基于夜间灯光数据的山东省GDP预测及空间化[J]. 城市勘测, 2022(1): 20-23.
[7] 尹丽. 基于NPP/VIIRS灯光数据的贵州省GDP空间化模型研究[J]. 信阳师范学院学报(自然科学版), 2022, 35(1): 79-84.
[8] 魏凯艳, 孙九林, 张仲伍, 等. 基于NPP-VIIRS夜间灯光数据的山西省GDP空间化模拟[J]. 浙江大学学报(理学版), 2021, 48(6): 735-740, 749.
[9] 王俊华, 张廷斌, 易桂花, 等. DMSP/OLS夜间灯光数据的四川省GDP空间化分析[J]. 测绘科学, 2019, 44(8): 50-60.
[10] Li, C., Chen, G., Luo, J., et al. (2021) Port Economics Comprehensive Scores for Major Cities in the Yangtze Valley, China Using the DMSP-OLS Night-Time Light Imagery. In: Elvidge, C., Li, X., zhou, Y.Y., Cao, C. and Warner, T.A., Eds., Remote Sensing of Night-Time Light, Routledge, 153-175.
[11] Gu, Y., Shao, Z., Huang, X. and Cai, B. (2022) GDP Forecasting Model for China’s Provinces Using Nighttime Light Remote Sensing Data. Remote Sensing, 14, Article 3671. [Google Scholar] [CrossRef
[12] Han, X.D., Zhou, Y., Wang, S.X., et al. (2013) GDP Spatialization in China Based on DMSP/OLS Data and Land Use Data. Remote Sensing Technology and Application, 27, 396-405.
[13] Chen, Q., Hou, X., Zhang, X. and Ma, C. (2016) Improved GDP Spatialization Approach by Combining Land-Use Data and Night-Time Light Data: A Case Study in China’s Continental Coastal Area. International Journal of Remote Sensing, 37, 4610-4622. [Google Scholar] [CrossRef
[14] Ji, X., Li, X., He, Y. and Liu, X. (2019) A Simple Method to Improve Estimates of County-Level Economics in China Using Nighttime Light Data and GDP Growth Rate. ISPRS International Journal of Geo-Information, 8, 419. [Google Scholar] [CrossRef
[15] Doll, C.H., Muller, J. and Elvidge, C.D. (2000) Night-Time Imagery as a Tool for Global Mapping of Socioeconomic Parameters and Greenhouse Gas Emissions. AMBIO: A Journal of the Human Environment, 29, 157-162. [Google Scholar] [CrossRef