基于变量选择的红酒质量分析
Analysis of Red Wine Quality Based on Variable Selection
DOI: 10.12677/pm.2025.1510251, PDF,    科研立项经费支持
作者: 葛明霞, 董翠玲*:新疆师范大学数学科学学院,新疆 乌鲁木齐
关键词: 红酒质量变量选择岭回归LASSO弹性网Wine Quality Variable Selection Ridge Regression LASSO Elastic Net
摘要: 葡萄酒因其特殊的营养价值和较好的保健效果越来越受到消费者的欢迎,对葡萄酒质量的定量与定性分析是消费者关注的焦点。对Kaggle网站公开的红酒质量数据集开展了系统的研究,首先运用描述性统计分析方法,对数据集进行初步探索,直观呈现各变量的基本特征与分布规律;然后采用岭回归、LASSO、弹性网三种变量选择方法对该数据集建立多元线性模型,选出影响红酒质量的关键变量,该研究成果为红酒质量的精准检测提供了科学参考。
Abstract: Due to its special nutritional value and good health-care effects, wine is increasingly welcomed by consumers. The quantitative and qualitative analysis of wine quality has become the focus of consumers’ attention. A systematic study was carried out on the red wine quality dataset publicly available on the Kaggle website. Firstly, descriptive statistical analysis methods were used to conduct a preliminary exploration of the dataset, intuitively presenting the basic characteristics and distribution laws of each variable. Then, three variable selection methods, namely ridge regression, LASSO (Least Absolute Shrinkage and Selection Operator), and elastic net, were adopted to establish multiple linear models for the dataset, so as to select the key variables affecting red wine quality. The research results provide a scientific reference for the precise detection of red wine quality.
文章引用:葛明霞, 董翠玲. 基于变量选择的红酒质量分析[J]. 理论数学, 2025, 15(10): 82-93. https://doi.org/10.12677/pm.2025.1510251

参考文献

[1] 王柏. 基于数据挖掘技术的红酒评分预测模型的设计与分析[J]. 现代商贸工业, 2019, 40(7): 191-193.
[2] 王彬华. 基于葡萄酒质量评价体系实证研究[J]. 理论探索, 2024(4): 155-158.
[3] 王强, 汪丹丹. 基于多元线性回归的葡萄酒质量评价[J]. 渭南师范学院学报, 2013, 28(9): 126-130.
[4] 朱家明. 葡萄与葡萄酒质量的综合评价[J]. 通化师范学院学报, 2013, 35(3): 8-12.
[5] 朱家明. 基于多元统计分析法的葡萄酒品鉴模型[J]. 太原师范学院学报, 2013, 12(2): 8-10.
[6] 朱存斌, 朱家明, 陈岩. 葡萄酒质量的评价与分析[J]. 斯木佳大学学报, 2013, 31(3): 419-424.
[7] 陈欣. 葡萄酒的质量预测模型[J]. 西安文理学院学报, 2013, 16(2): 45-47.
[8] 程相, 陈家旭, 吴文鑫. 应用多元统计分析葡萄、葡萄酒理化指标与葡萄酒质量的相关性[J]. 中外葡萄与葡萄酒, 2013, 4(21): 43-47.
[9] 刘令, 熊奕达, 赵云龙. 影响葡萄酒质量的因子相关分析[J]. 吉林建筑工程学院学报, 2013, 30(5): 72-74.
[10] 董莹, 崔瑞雪. 基于因子分析的红葡萄酒质量评价[J]. 大连民族学院学报, 2014, 16(3): 284-288.
[11] 方壮, 向华艳, 周洪建. 多元非线性回归分析在葡萄酒质量评价中的应用[J]. 湖北民族学院学报2014, 32(4): 426-429.
[12] 刘兵兵, 宋帝. 应用多元回归分析进行葡萄酒质量评价建模[J]. 安庆师范学院学报, 2014, 20(4): 29-35.
[13] 裴文华. 基于机器学习的红酒质量分类研究[J]. 科技和产业, 2022, 22(12): 304-309.
[14] 刘婷. 葡萄酒质量的评价研究[J]. 科技和产业, 2013, 13(8): 114-120.
[15] Hoerl, A.E. and Kennard, R.W. (1970) Ridge Regression: Biased Estimation for Nonorthogonal Problems. Technometrics, 12, 55-67. [Google Scholar] [CrossRef
[16] Tibshirani, R. (1996) Regression Shrinkage and Selection via the Lasso. Journal of the Royal Statistical Society Series B: Statistical Methodology, 58, 267-288. [Google Scholar] [CrossRef
[17] Zou, H. and Hastie, T. (2005) Regularization and Variable Selection via the Elastic Net. Journal of the Royal Statistical Society Series B: Statistical Methodology, 67, 301-320. [Google Scholar] [CrossRef