基于高光谱成像技术的亚麻籽粒品种快速分类
Rapid Classification of Flax Seed Grain Varieties Based on Hyperspectral Imaging Technique
DOI: 10.12677/sea.2024.135071, PDF,    国家自然科学基金支持
作者: 李 怡*, 韩俊英#:甘肃农业大学信息科学技术学院,甘肃 兰州
关键词: 高光谱成像技术亚麻品种分类K近邻算法组合筛选Hyperspectral Imaging Technique Flax Classification of Varieties K-Nearest Neighbor Algorithm Combined Screening
摘要: 亚麻是世界上一种重要的经济作物和油料作物,快速筛选不同品种的亚麻籽对于亚麻育种以及农业种植具有重要意义。市场上亚麻籽品种繁多,如何快速、准确、高效地鉴别亚麻籽的品种,从而实现亚麻品种优育以及高产高收亟待解决。本文提出基于高光谱图像技术的K近邻(KNN)分类模型,以期实现不同品种亚麻籽的快速分类。本研究以甘肃省农业科学院提供的10个不同品种的亚麻籽作为本次试验的样本,每个品种随机选取50粒亚麻籽,利用高光谱成像系统采集870.07~1709.45 nm范围内的亚麻籽光谱图像。将采集到的图像黑白校正之后,以50粒亚麻籽为感兴趣区域,求取该区域内平均光谱作为原始光谱数据。由于原始光谱数据两端噪声较大,有效信息会受到噪声干扰,为了增强信噪比,本研究截取950~1680 nm范围内的亚麻籽的光谱波段作为有效波段进行分析。为避免数据在采集过程中受无关信息的干扰较强从而影响建模效果,因此对去噪后的光谱波段信息进行SG (Savitzky Golay)平滑预处理,并在SG平滑预处理的基础上分别进行最大归一化(MN)和二阶求导预处理(2ndDer)。数据预处理后分别采用竞争性自适应重加权算法(CARS)、连续投影算法(SPA)单一提取和CARS + SPA、CARS-SPA组合筛选方法提取特征波长,利用K近邻算法建立CARS-KNN、SPA-KNN、CARS + SPA-KNN、CARS-SPA-KNN这4种亚麻籽品种鉴别模型。实验结果表明:基于(SG-2ndDer)-CARS-KNN、(SG-2ndDer)-SPA-KNN、(SG-2ndDer)-CARS + SPA-KNN、(SG-2ndDer)-CARS-SPA-KNN这4种分类模型对亚麻籽的分类准确率最高可以达到100%。故利用近红外高光谱成像技术结合KNN算法对亚麻籽品种进行快速无损鉴别是优异和可靠的方案。
Abstract: Flax is an important economic and oil crop globally; rapid screening of different flax seed varieties is essential for flax breeding and agricultural cultivation. There are many flax seed varieties on the market; how to quickly, accurately, and efficiently identify the flax seed varieties to realize the superiority of flax varieties and high-yield yield needs to be resolved. This paper proposes the K-Nearest Neighbor (KNN) classification model based on the hyperspectral image technique to classify flax seed varieties quickly. In this study, ten different varieties of flax seeds provided by the Gansu Provincial Academy of Agricultural Sciences were used as the samples for this experiment, and 50 flax seeds of each variety were randomly selected. The spectral images of flax seeds in the range of 870.07~1709.45 nm were collected by using a hyperspectral imaging system. After the black-and-white correction of the acquired images, the 50 flax seeds were the region of interest, and the average spectrum in the region was the original spectral data. Due to the significant noise at both ends of the raw spectral data, the practical information will be interfered with by noise; in order to enhance the signal-to-noise ratio, this study intercepted the spectral bands of linseed in the range of 950~1680 nm as the influential bands for analysis. In order to avoid the data in the acquisition process by irrelevant information interference is more muscular, thus affecting the modeling effect, the de-noised spectral band information is pre-processed by SG (Savitzky Golay) smoothing, and based on SG smoothing, respectively, the Maximum Normalization (MN) and second-order derivation pre-processing (2ndDer). After data preprocessing, the feature wavelengths were extracted by Competitive Adaptive Re-weighting Algorithm (CARS), Successive Projection Algorithm (SPA) single extraction and CARS + SPA, CARS-SPA combined screening methods, respectively, and the K-Nearest Neighbor algorithm was utilized to establish the identification models for the four flax seed varieties, namely, CARS-KNN, SPA-KNN, CARS + SPA-KNN, and CARS-SPA-KNN. The experimental results show that the classification accuracy of flax seed based on the four classification models (SG-2ndDer)-CARS-KNN, (SG-2ndDer)-SPA-KNN, (SG-2ndDer)-CARS + SPA-KNN, and (SG-2ndDer)-CARS- SPA-KNN) can reach up to 100%. Therefore, using near-infrared hyperspectral imaging technology combined with the KNN algorithm for rapid and non-destructive identification of linseed varieties is an excellent and reliable program.
文章引用:李怡, 韩俊英. 基于高光谱成像技术的亚麻籽粒品种快速分类[J]. 软件工程与应用, 2024, 13(5): 685-703. https://doi.org/10.12677/sea.2024.135071

参考文献

[1] Mazza, G. and Biliaderis, C.G. (1989) Functional Properties of Flax Seed Mucilage. Journal of Food Science, 54, 1302-1305. [Google Scholar] [CrossRef
[2] Dorrell, D.G. (1970) Distribution of Fatty Acids within the Seed of Flax. Canadian Journal of Plant Science, 50, 71-75. [Google Scholar] [CrossRef
[3] Goyal, A., Sharma, V., Upadhyay, N., Gill, S. and Sihag, M. (2014) Flax and Flaxseed Oil: An Ancient Medicine & Modern Functional Food. Journal of Food Science and Technology, 51, 1633-1653. [Google Scholar] [CrossRef] [PubMed]
[4] Kaur, P., Waghmare, R., Kumar, V., Rasane, P., Kaur, S. and Gat, Y. (2018) Recent Advances in Utilization of Flaxseed as Potential Source for Value Addition. OCL, 25, Article No. A304. [Google Scholar] [CrossRef
[5] Zhao, L., Wang, L.M., Zhao, W., Dang, Z., Zhang, J.P. and Dang, Z. H, (2016) The Isolation of Crude Fat Acid Components in a RIL Population of Oil Flax and Their Correlation Analysis. The Crop Journal, 32, 33-37.
[6] Bhatty, R.S. and Cherdkiatgumchai, P. (1990) Compositional Analysis of Laboratory‐Prepared and Commercial Samples of Linseed Meal and of Hull Isolated from Flax. Journal of the American Oil ChemistsSociety, 67, 79-84. [Google Scholar] [CrossRef
[7] 王利华, 霍贵成. ω-3不饱和脂肪酸的生物学作用[J]. 东北农业大学学报, 2001, 32(1): 100-104.
[8] Suri, K., Singh, B., Kaur, A., Yadav, M.P. and Singh, N. (2020) Influence of Microwave Roasting on Chemical Composition, Oxidative Stability and Fatty Acid Composition of Flaxseed (Linum usitatissimum L.) Oil. Food Chemistry, 326, Article ID: 126974. [Google Scholar] [CrossRef] [PubMed]
[9] 张春利. 小麦抗穗发芽分子标记的发掘与验证[D]: [博士学位论文]. 北京: 中国农业科学院, 2008.
[10] 周全, 朱大洲, 王成, 等. 成像光谱技术在农产品/食品检测中的研究进展[J]. 食品科学, 2010, 31(23): 423-428.
[11] Zhu, S., Zhang, J., Chao, M., Xu, X., Song, P., Zhang, J., et al. (2019) A Rapid and Highly Efficient Method for the Identification of Soybean Seed Varieties: Hyperspectral Images Combined with Transfer Learning. Molecules, 25, Article 152. [Google Scholar] [CrossRef] [PubMed]
[12] Moola, W.S., Bijker, W., Belgiu, M. and Li, M. (2021) Vegetable Mapping Using Fuzzy Classification of Dynamic Time Warping Distances from Time Series of Sentinel-1A Images. International Journal of Applied Earth Observation and Geoinformation, 102, Article ID: 102405. [Google Scholar] [CrossRef
[13] 曹宇佳, 孙嘉迎, 孙微, 等. 基于珠海一号高光谱数据的农作物精细化分类研究[J]. 测绘与空间地理信息, 2023, 46(z1): 101-103, 106.
[14] 袁建清, 仇逊超, 贾银江, 等. 高光谱成像的水稻冠层穗颈瘟早期识别[J]. 西南大学学报(自然科学版), 2023, 45(6): 57-65.
[15] Zhang, J., Feng, X., Liu, X. and He, Y. (2018) Identification of Hybrid Okra Seeds Based on Near-Infrared Hyperspectral Imaging Technology. Applied Sciences, 8, Article 1793. [Google Scholar] [CrossRef
[16] 孙婷. 高粱品种的高光谱分类及多品种混合识别研究[D]: [硕士学位论文]. 自贡: 四川轻化工大学, 2021.
[17] Chang, C. and Lin, C. (2011) LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2, 1-27. [Google Scholar] [CrossRef
[18] 马惠玲, 王若琳, 蔡骋, 等. 基于高光谱成像的苹果品种快速鉴别[J]. 农业机械学报, 2017, 48(4): 305-312.
[19] 叶佳丽, 江海霞, 郭栋良, 等. 亚麻籽蛋白质、亚麻酸和木酚素含量近红外预测模型建立与应用[J]. 中国油料作物学报, 2021, 43(2): 353-360.
[20] 胡建. 基于FT-IR的鳜鱼鲜味强度检测方法研究[D]: [硕士学位论文]. 武汉: 华中农业大学, 2023.
[21] Li, H., Liang, Y., Xu, Q. and Cao, D. (2009) Key Wavelengths Screening Using Competitive Adaptive Reweighted Sampling Method for Multivariate Calibration. Analytica Chimica Acta, 648, 77-84. [Google Scholar] [CrossRef
[22] Araújo, M.C.U., Saldanha, T.C.B., Galvão, R.K.H., Yoneyama, T., Chame, H.C. and Visani, V. (2001) The Successive Projections Algorithm for Variable Selection in Spectroscopic Multicomponent Analysis. Chemometrics and Intelligent Laboratory Systems, 57, 65-73. [Google Scholar] [CrossRef