基于LightGBM算法的区域成矿预测—以夏河–合作为例
Region Mineralization Prediction Based on the LightGBM Algorithm—A Case Study of Xiahe-Hezuo
DOI: 10.12677/pm.2024.146250, PDF,   
作者: 高雅欣:成都理工大学,数学地质四川省重点实验室,四川 成都
关键词: 机器学习成矿定量预测多源地学数据Machine Learning Quantitative Mineral Prediction Multi-Source Geological Data
摘要: 本研究旨在利用LightGBM算法对夏河–合作地区进行矿产定量预测。LightGBM是一种集成学习方法,是随机森林的一种变体,在构建决策树时采用了随机划分。首先,我们收集了夏河–合作地区的相关地质和地球化学数据。然后,我们对数据进行预处理和特征工程,包括缺失值填充、数据标准化等。接下来,我们构建了LightGBM模型,并对模型进行了训练和调优。最后,我们使用训练好的模型对夏河–合作地区的矿产进行了定量预测,并评估了预测结果的准确性和可靠性。本研究的结果表明,LightGBM算法在夏河–合作地区矿产定量预测中具有较高的效果和潜力,为地质矿产勘查提供了一种有效的预测方法。
Abstract: This study aims to utilize the LightGBM algorithm for quantitative mineral prediction in the Xiahe-Hezuo region. LightGBM, an ensemble learning method and a variant of the random forest, uses random partitioning in decision tree construction. First, we collected relevant geological and geochemical data from the Xiahe-Hezuo region. Then, we performed data preprocessing and feature engineering, including missing value imputation and data normalization. Next, we constructed a LightGBM model and conducted training and tuning. Finally, we used the trained model to make quantitative predictions of mineral resources in the Xiahe-Hezuo region and evaluated the accuracy and reliability of the predictions. The results of this study indicate that the LightGBM algorithm has high effectiveness and potential in quantitative mineral prediction in the Xiahe-Hezuo region, providing an effective prediction method for geological mineral exploration.
文章引用:高雅欣. 基于LightGBM算法的区域成矿预测—以夏河–合作为例[J]. 理论数学, 2024, 14(6): 300-315. https://doi.org/10.12677/pm.2024.146250

参考文献

[1] 赵忠海, 陈俊, 乔锴, 等. 基于分形理论的遥感蚀变信息和构造分析研究: 以黑龙江多宝山地区为例[J]. 现代地质, 2023, 37(1): 153-163.
[2] 郭为民. 浅谈地球探测技术的几种方法与应用[J]. 中文科技期刊数据库(全文版)自然科学, 2022(8): 76-78.
[3] 叶成名. 基于高光谱遥感的青藏高原岩矿信息提取方法与应用研究[D]: [博士学位论文]. 成都: 成都理工大学, 2011.
[4] 李程. 深部地质地球化学三维定量矿产预测方法研究[D]: [博士学位论文]. 成都: 成都理工大学, 2022.
[5] 李康宁, 贾儒雅, 李鸿睿, 等. 西秦岭甘肃夏河——合作地区与中酸性侵入岩有关的金铜多金属成矿系统及找矿预测[J]. 地质通报, 2020, 39(8): 1191-1203.
[6] 张帅, 肖克炎, 朱裕生. 甘肃夏河——合作一带成矿预测及预测方法比较[J]. 地质学刊, 2018, 42(3): 393-400.
[7] 毛景文. 西秦岭地区造山型与卡林型金矿床[J]. 矿物岩石地球化学通报, 2001(1): 11-13.
[8] 张帅. 甘肃省合作-美武地区综合信息找矿预测研究[D]: [博士学位论文]. 北京: 中国地质大学, 2021.
[9] 张继荣. 甘肃省夏河地区成矿预测及找矿靶区研究[D]: [硕士学位论文]. 西安: 长安大学, 2016.
[10] Fix, E. and Hodges, J.L. (1952) Discriminatory Analysis. Nonparametric Discrimination: Small Sample Performance. International Statistical Review, 57, 238-247.
[11] Fix, E. (1985) Discriminatory Analysis: Nonparametric Discrimination, Consistency Properties (Vol. 1). USAF School of Aviation Medicine.
[12] Lloyd, S. (1982) Least squares quantization in PCM. IEEE Transactions on Information Theory, 28, 129-137. [Google Scholar] [CrossRef
[13] Breiman, L. (2001) Random Forests. Machine Learning, 45, 5-32. [Google Scholar] [CrossRef
[14] Egozcue, J.J., Pawlowsky-Glahn, V., Mateu-Figueras, G. and Barcelo-Vidal, C. (2003) Isometric Logratio Transformations for Compositional Data Analysis. Mathematical Geology, 35, 279-300.
[15] Debeljak, M. and Džeroski, S. (2011) Decision Trees in Ecological Modelling. In: Jopp, F., Reuter, H. and Breckling, B., Eds., Modelling Complex Ecological Dynamics: An Introduction into Ecological Modelling for Students, Teachers & Scientists, Springer, 197-209. [Google Scholar] [CrossRef
[16] Charbuty, B. and Abdulazeez, A. (2021) Classification Based on Decision Tree Algorithm for Machine Learning. Journal of Applied Science and Technology Trends, 2, 20-28. [Google Scholar] [CrossRef
[17] Friedman, J.H. and Popescu, B.E. (2008) Predictive Learning via Rule Ensembles. The Annals of Applied Statistics, 2, 916-954. [Google Scholar] [CrossRef
[18] Friedman, J.H. (2002) Stochastic Gradient Boosting. Computational Statistics & Data Analysis, 38, 367-378. [Google Scholar] [CrossRef
[19] Zuo, F., Memmolo, A., Huang, G. and Pirozzoli, S. (2019) Direct Numerical Simulation of Conical Shock Wave-Turbulent Boundary Layer Interaction. Journal of Fluid Mechanics, 877, 167-195. [Google Scholar] [CrossRef
[20] Aitchison, J. (1983) Principal Component Analysis of Compositional Data. Biometrika, 70, 57-65. [Google Scholar] [CrossRef