基于SCAD方法的烘丝段水分控制参数筛选及应用
Selection and Application of Moisture Control Parameters in the Drying Section Based on the SCAD Method
DOI: 10.12677/sa.2025.143070, PDF,   
作者: 许 磊:云南贝叶思大数据技术服务有限公司,云南 昆明
关键词: 烘丝段水分控制变量选择模型比较SCAD方法Drying Section Moisture Control Parameter Selection Model Comparison SCAD Method
摘要: 为实现对制丝过程烘丝段出料水分的精准控制,科学、客观地识别并筛选影响水分的关键工艺参数。选取以某牌号卷烟的制丝过程全批次稳态数据为研究对象,在传统逐步回归方法的基础上,引入三种通过添加惩罚项来压缩变量系数的Lasso族方法,即:Lasso方法、适应性Lasso方法和SCAD方法,分别构建以烘丝段出料水分为因变量的变量选择模型,并采用AIC、BIC和MSE三种评价指标对模型进行比较,最后依据最优模型进行关键工艺参数的筛选及重要性排序。结果表明:① Lasso族方法在模型拟合优度、预测精度和复杂度控制方面均显著优于传统逐步回归方法,其中SCAD方法的综合性能表现最优;② 基于SCAD方法确定了烘丝段的4个关键工艺参数,按其重要性排序依次为:II区筒壁温度、膨胀单元蒸汽体积流量、切叶丝含水率和工艺气速度。
Abstract: To achieve precise control over the moisture content of the discharge in the drying section of the tobacco primary processing, it is essential to scientifically and objectively identify and screen the key process parameters affecting moisture. This study selects steady state data from the entire batch of a specific brand of cigarette production process as the research object. Building on the traditional stepwise regression method, three Lasso family methods—Lasso, Adaptive Lasso, and SCAD are introduced, which compress variable coefficients by adding penalty terms. Variable selection models are constructed with the moisture content of the drying section discharge as the dependent variable. The models are compared using three evaluation metrics: AIC, BIC, and MSE. Finally, the optimal model is used to screen and rank the importance of key process parameters. The results show that: ① The Lasso-family methods significantly outperform the traditional stepwise regression method in terms of model goodness-of-fit, prediction accuracy, and complexity control, with the SCAD method demonstrating the best overall performance; ② Based on the SCAD method, four key process parameters for the drying section are identified, ranked in order of importance as follows: Zone II wall temperature, expansion unit steam volumetric flow rate, cut tobacco moisture content, and process air velocity.
文章引用:许磊. 基于SCAD方法的烘丝段水分控制参数筛选及应用[J]. 统计学与应用, 2025, 14(3): 183-193. https://doi.org/10.12677/sa.2025.143070

参考文献

[1] 李艳, 卢游, 范翔, 等. 基于半参数回归模型的残差控制图[J]. 数理统计与管理, 2016, 35(4): 700-706.
[2] 钟文焱, 陈晓杜, 马庆文, 等. 基于多因素分析的烘丝机入口含水率预测模型的建立与应用[J]. 烟草科技, 2015, 48(5): 67-73.
[3] 谭超, 吴同, 覃鑫. 偏最小二乘组合后向区间选择在近红外定量建模中的应用[J]. 计算机与应用化学, 2008, 25(4): 509-512.
[4] 卫盼盼, 吴祚友, 安银立, 等. 烟叶物理特性与打叶风分工艺参数的关系[J]. 烟草科技, 2014(8): 5-9.
[5] 王明锋, 焦玉磊, 赵建华, 等. 部分烟气成分对卷烟感官舒适性的影响[J]. 中国烟草学报, 2014, 20(5): 12-18.
[6] 李晋明, 端凯, 汪海, 等. 卷烟香型关键致香物筛选与香型定量判定模式[J]. 烟草科技, 2015, 48(12): 66-71.
[7] Breiman, L. (1996) Heuristics of Instability and Stabilization in Model Selection. The Annals of Statistics, 24, 2350-2383. [Google Scholar] [CrossRef
[8] Fan, J. and Li, R. (2001) Variable Selection via Nonconcave Penalized Likelihood and Its Oracle Properties. Journal of the American Statistical Association, 96, 1348-1360. [Google Scholar] [CrossRef
[9] Tibshirani, R. (1996) Regression Shrinkage and Selection via the Lasso. Journal of the Royal Statistical Society Series B: Statistical Methodology, 58, 267-288. [Google Scholar] [CrossRef
[10] Efron, B., Hastie, T., Johnstone, I. and Tibshirani, R. (2004) Least Angle Regression. The Annals of Statistics, 32, 407-409. [Google Scholar] [CrossRef
[11] Zou, H. (2006) The Adaptive Lasso and Its Oracle Properties. Journal of the American Statistical Association, 101, 1418-1429. [Google Scholar] [CrossRef
[12] 吴喜之. 应用回归及分类——基于R [M]. 北京: 中国人民大学出版社, 2016.
[13] 谢佳春, 李兴绪. 明瑟收入函数模型变量选择方法比较研究[J]. 发展战略, 2016(1): 68-74.
[14] 梁斌, 陈敏, 缪柏其, 等. 基于LARS-Lasso的指数跟踪及其在股指期货套利策略中的应用[J]. 数理统计与管理, 2011, 30(6): 1104-1113.
[15] 吴喜之. 复杂数据统计方法——基于R的应用[M]. 北京: 中国人民大学出版社, 2012.
[16] Daudin, J.J. (1986) Selection of Variables in Mixed-Variable Discriminant Analysis. Biometrics, 42, 473-481. [Google Scholar] [CrossRef
[17] 刘立祥. 线性回归中自变量的选择与逐步回归方法[J]. 统计与决策, 2015(21): 80-82.