基于QDA-RF-MLP的矿井突水水源判识模型
Water Inrush Source Identification Model Based on QDA-RF-MLP
摘要: 受采动破坏与断裂构造影响,深部煤矿突水水源的水化学特征难以解释,具有混合性和重叠性,导致传统单一模型识别精度受限。本文以安徽祁南煤矿为研究背景,提出一种基于二次判别分析(QDA)、随机森林(RF)与多层感知机(MLP)的堆叠集成水源判识模型。在系统解析矿井主要含水层水化学演化规律的基础上,利用SMOTE技术重构非平衡数据集,并引入水化学离子比率等高阶特征。通过五折交叉验证优选基学习器参数,构建双层异构融合模型。实验结果表明:该集成模型在独立测试集上的识别准确率达95.83%,F1分数达0.96,显著优于单一最优基模型RF (92.7%),有效克服了小样本条件下的过拟合问题,对复杂混合水源具有较强的鲁棒性。
Abstract: Under the influence of mining disturbances and geological fault structures, the hydrochemical signatures of water inrush sources in deep coal mines often exhibit complex nonlinear mixing and overlapping characteristics. Consequently, traditional single-model approaches frequently suffer from limited identification accuracy. Taking the Qinan Coal Mine in Anhui Province as a case study, this paper proposes a stacking ensemble learning model for water source identification, integrating Quadratic Discriminant Analysis (QDA), Random Forest (RF), and Multilayer Perceptron (MLP). First, the hydrochemical evolution mechanisms of the primary aquifers were systematically analyzed. Subsequently, the Synthetic Minority Over-sampling Technique (SMOTE) was applied to address dataset imbalance, and high-order features—specifically hydrochemical ion ratios, were introduced to enhance discriminative capability. A two-layer heterogeneous fusion model was then constructed, with base learner parameters optimized via five-fold cross-validation. Experimental results demonstrate that the proposed ensemble model achieved an accuracy of 95.83% and an F1-score of 0.96 on an independent test set. This performance significantly outperforms the best single base model (RF, 92.7%). The study confirms that the proposed method effectively mitigates overfitting risks associated with small sample sizes and exhibits strong robustness in identifying complex, mixed water sources.
文章引用:张晨亮, 姚彬, 许明镜, 薛俊辰. 基于QDA-RF-MLP的矿井突水水源判识模型[J]. 地球科学前沿, 2026, 16(3): 273-284. https://doi.org/10.12677/ag.2026.163026

参考文献

[1] 黎良杰, 钱鸣高, 李树刚. 断层突水机理分析[J]. 煤炭学报, 1996, 21(2): 119-123.
[2] 程磊, 孙洁. 2016-2022年我国煤矿事故统计与规律分析[J]. 煤炭工程, 2023, 55(11): 125-129.
[3] 崔中良, 黄保胜, 龙向前. 矿井突水水源判别研究现状及发展趋势[J]. 中国钨业, 2018, 33(4): 42-50.
[4] 靳德武. 煤矿水害防治中的综合水文地质分析方法[J]. 煤田地质与勘探, 1998, 26(2): 54-56.
[5] 刘文明, 桂和荣, 孙雪芳, 等. 潘谢矿区矿井突水水源的QLT法判别[J]. 中国煤炭, 2001, 27(5): 31-34.
[6] 宋宝文. 根据水化学成份预测矿井突水水源几例[J]. 煤田地质与勘探, 1981, 44(2): 65-69.
[7] 贺亮亮, 吕广罗, 胡安焱, 等. 基于水化学特征分析的矿井突水水源判别[J]. 中国煤炭地质, 2022, 34(6): 34-39.
[8] 桑向阳, 林云, 刘保民, 等. 基于水位、水温、突水量和水质的充水水源识别神经网络模型[J]. 河南理工大学学报(自然科学版), 2024, 43(5): 36-42.
[9] 潘国营, 王素娜, 孙小岩, 等. 同位素技术在判别矿井突水水源中的应用[J]. 矿业安全与环保, 2009, 36(1): 32-34, 90.
[10] 宋立兵, 董东林, 王振荣, 等. 基于多方法耦合的矿井突水水源识别模型研究[J]. 煤炭工程, 2022, 54(2): 140-146.
[11] Yan, B., Ren, F., Cai, M. and Qiao, C. (2020) Bayesian Model Based on Markov Chain Monte Carlo for Identifying Mine Water Sources in Submarine Gold Mining. Journal of Cleaner Production, 253, Article ID: 120008. [Google Scholar] [CrossRef
[12] 申雄, 栗继祖, 赵德康. 基于PCA的Fisher多元统计方法识别矿井充水水源[J]. 矿业安全与环保, 2024, 51(3): 144-152.
[13] 黄平华, 陈建生. 基于多元统计分析的矿井突水水源Fisher识别及混合模型[J]. 煤炭学报, 2011, 36(S1): 131-136.
[14] 于小鸽, 裴富华, 刘燚菲, 等. 贝叶斯判别和Fisher判别在矿井突水水源判别中的应用比较[J]. 中国科技论文, 2022, 17(1): 9-14.
[15] 潘军. 基于Piper-层次聚类-灰色关联度的突水水源识别[J]. 煤炭科学技术, 2024, 52(S1): 221-227.
[16] 马雷, 钱家忠, 赵卫东. 基于GIS和水质水温的矿井突水水源快速判别[J]. 煤田地质与勘探, 2014, 42(2): 49-53.
[17] 徐星, 王公忠. BP神经网络在矿井突水水源识别中的应用[J]. 煤炭技术, 2016, 35(7): 144-146.
[18] 刘伟韬, 李蓓蓓, 杜衍辉, 等. 基于改进的SSA-BP神经网络的矿井突水水源识别模型研究[J]. 工矿自动化, 2024, 50(2): 98-105.
[19] 于小鸽, 刘燚菲, 翟培合. 基于PCA-AWOA-ELM模型的矿井突水水源识别[J]. 煤炭科学技术, 2023, 51(3): 182-189.
[20] 孙文洁, 杨恒, 李祥, 等. PCA与ELM模型相结合的矿井突水水源快速识别方法研究[J]. 煤炭工程, 2020, 52(1): 111-115.
[21] 黄敏, 毛岸, 路世昌, 等. 矿井突水水源识别的主成分分析-混沌麻雀搜索-RF模型[J]. 安全与环境学报, 2023, 23(8): 2607-2614.
[22] 肖观红, 鲁海峰. 基于PCA-GA-RF的矿井突水水源快速识别模型[J]. 煤矿安全, 2024, 55(6): 184-191.
[23] 邵良杉, 李相辰. 基于MIV-PSO-SVM模型的矿井突水水源识别[J]. 煤炭科学技术, 2018, 46(8): 183-190.
[24] Wei, Z., Dong, D., Ji, Y., Ding, J. and Yu, L. (2022) Source Discrimination of Mine Water Inrush Using Multiple Combinations of an Improved Support Vector Machine Model. Mine Water and the Environment, 41, 1106-1117. [Google Scholar] [CrossRef
[25] Ji, Y., Dong, D., Mei, A. and Wei, Z. (2022) Study on Key Technology of Identification of Mine Water Inrush Source by PSO-LightGBM. Water Supply, 22, 7416-7429. [Google Scholar] [CrossRef
[26] 季媛. 矿井突水水源快速识别关键技术研究[D]: [博士学位论文]. 徐州: 中国矿业大学, 2022.
[27] 董东林, 张陇强, 张恩雨, 等. 基于PSO-XGBoost的矿井突水水源快速判识模型[J]. 煤炭科学技术, 2023, 51(7): 72-82.
[28] Hinton, G.E. and Salakhutdinov, R.R. (2006) Reducing the Dimensionality of Data with Neural Networks. Science, 313, 504-507. [Google Scholar] [CrossRef] [PubMed]
[29] 徐继伟, 杨云. 集成学习方法: 研究综述[J]. 云南大学学报(自然科学版), 2018, 40(6): 1082-1092.
[30] 孙林华, 桂和荣. 皖北桃源矿深部含水层地下水地球化学数理统计分析[J]. 煤炭学报, 2013, 38(S2): 442-447.