基于RGBI色彩空间的香蕉图像分割研究
Study on Banana Image Segmentation Based on RGBI Color Space
DOI: 10.12677/airr.2025.145111, PDF,   
作者: 幸南霖:重庆三峡学院计算机科学与工程学院,重庆
关键词: 图像分割RGBI色彩空间自适应阈值Image Segmentation RGBI Color Space Adaptive Threshold
摘要: 香蕉是我国华南地区主要经济作物,采后品质分级与机器人采摘均需快速、准确地分割果实。传统RGB空间易受光照、阴影及叶片遮挡干扰,分割精度受限。为此,本文提出基于RGBI四维色彩空间的香蕉图像分割方法:在设备原生RGB基础上引入亮度分量I = (R + G + B)/3,实现颜色–亮度联合表征;通过自适应阈值与形态学后处理,在保持实时性的同时显著提升抗光照鲁棒性。试验以480幅温室及田间自然光图像为对象,对比RGB-Otsu、HSV-Otsu、Lab-Kmeans等方法,RGBI-Otsu的Dice系数达0.915,mIoU达0.878,单幅处理时间19.7 ms,优于对照组。进一步将RGBI特征嵌入轻量化UNet,在Jetson Nano边缘端可达18 FPS,满足采摘机器人实时需求。研究表明,RGBI空间在不增加硬件成本的前提下,为香蕉及其他果蔬的精准分割提供了经济高效的新途径。
Abstract: Banana is the main cash crop in South China. After harvesting quality classification and robot picking, the fruit needs to be quickly and accurately divided. The traditional RGB space is easily disturbed by light, shadow and blade occlusion, and the segmentation accuracy is limited. To this end, this paper proposes a banana image segmentation method based on RGBI four-dimensional color space: the brightness component I = (R + G + B)/3 is introduced on the basis of the device’s native RGB to realize the joint color-brightness characterization; through adaptive threshold and morphological post-processing, the anti-light robustness is significantly improved while maintaining real-time. The test took 480 greenhouse and field natural light images as the object. Compared with RGB-Otsu, HSV-Otsu, Lab-Kmeans and other methods, the Dice coefficient of RGBI-Otsu reached 0.915, mIoU reached 0.878, and the single processing time was 19.7 ms, which was better than the control group. The RGBI feature is further embedded in the lightweight UNet, which can reach 18 FPS at the edge of Jetson Nano to meet the real-time needs of picking robots. Research shows that RGBI space provides a new economical and efficient way for the accurate division of bananas and other fruits and vegetables without increasing hardware costs.
文章引用:幸南霖. 基于RGBI色彩空间的香蕉图像分割研究[J]. 人工智能与机器人研究, 2025, 14(5): 1177-1185. https://doi.org/10.12677/airr.2025.145111

参考文献

[1] 国家统计局. 中国农村统计年鉴2024 [M]. 北京: 中国统计出版社, 2024.
[2] 邹湘军, 彭红星, 郭艾侠, 等. 基于机器视觉的果实检测与定位研究进展[J]. 农业机械学报, 2023, 54(2): 1-15.
[3] 王飞跃, 张大兵. 香蕉成熟度计算机视觉检测方法研究[J]. 农业工程学报, 2022, 38(7): 187-194.
[4] 刘成良, 张漫. 多光谱成像在果蔬品质检测中的应用[J]. 光谱学与光谱分析, 2021, 41(5): 1352-1358.
[5] 丁震, 胡钟山, 杨静宇, 等. 一种基于模糊聚类的快速二值化方法[J]. 计算机学报, 1998, 21(S1): 331-334.
[6] Gonzalez, R.C. and Woods, R.E. (2017) Digital Image Processing. 4th Edition, Pearson.
[7] Otsu, N. (1979) A Threshold Selection Method from Gray-Level Histograms. IEEE Transactions on Systems, Man, and Cybernetics, 9, 62-66. [Google Scholar] [CrossRef
[8] He, K., Zhang, X., Ren, S. and Sun, J. (2017) Deep Residual Learning for Image Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 1251-1265.
[9] Ronneberger, O., Fischer, P. and Brox, T. (2015) U-net: Convolutional Networks for Biomedical Image Segmentation. In: International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer International Publishing, 234-241. [Google Scholar] [CrossRef
[10] Goodfellow, I., Bengio, Y. and Courville, A. (2016) Deep Learning. MIT Press.
[11] Zhang, Y. and Chen, K. (2018) A Survey on Image Segmentation Using Deep Learning. IEEE Access, 6, 48685-48696.
[12] Liu, W., Rabinovich, A. and Berg, A.C. (2016) SSD: Single Shot MultiBox Detector. In: European Conference on Computer Vision, Springer, 21-37.
[13] Chen, L., Papandreou, G., Kokkinos, I., Murphy, K. and Yuille, A.L. (2018) Deeplab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40, 834-848. [Google Scholar] [CrossRef] [PubMed]
[14] Wang, X. and Gupta, A. (2018) Videos as Space-Time Region Proposals. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-22 June 2018, 1626-1635.
[15] Zhang, J. and Liu, Y. (2019) Real-Time Semantic Segmentation for Autonomous Driving. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, 15-20 June 2019, 1065-1074.