基于光学神经网络的语义分割
Semantic Segmentation Based on Optical Neural Networkn
DOI: 10.12677/mos.2024.133350, PDF,   
作者: 葛泽宇, 张雨超*:上海理工大学光子芯片研究院,上海;上海理工大学人工智能纳米光子学中心光电信息与计算机工程学院,上海
关键词: 光学神经网络语义分割光学技术Optical Neural Network Semantic Segmentation Optical Technology
摘要: 语义分割是计算机视觉领域的关键挑战之一。目前,语义分割主要是基于卷积神经网络(Convolutional neural network, CNN)的各种深度学习算法来实现。然而,受限于传统的冯诺依曼架构,随着深度学习算法复杂度的不断增加,基于电子计算机的神经网络面临着算力,数据吞吐率和能耗的诸多限制,由于加工工艺已经接近了后摩尔时代的极限,进一步提升性能变得非常困难。本文提出了一种基于光学神经网络(Optical neural network, ONN)的图像语义分割方法。光学技术具有信息计算传输速度快、信息携带能力强、抗干扰性好等特点,在光神经网络中,矩阵乘法以光速进行并行计算,并且仅需极低的能量消耗,有效解决人工神经网络中面临的复杂的矩阵乘法问题。此外,基于存算一体架构,光神经网络的调制层既可以存储学习的参数,又参与了光学计算,从而避免了计算机面临的数据读取瓶颈。我们所设计的光学神经网络在Portrait 2000数据集上进行训练和测试,并在Person Correlation Coefficient (PCC)指标中达到0.8,有着良好的语义分割的效果。
Abstract: Semantic segmentation is one of the key challenges in the field of computer vision. At present, various deep learning algorithms based on convolutional neural network (CNN) are employed in this task. However, owing to the serial nature of von Neumann architectures, the electronic hardware platforms now confront significant challenges of computing speed, data throughput, and energy consumption, and the performance growth has become unsustainable due to the manufacturing process of electronic transistors approaches its physical limit. In this paper, an image semantic segmentation method based on optical neural network (ONN) is proposed. ONN has the characteristics of fast computing speed, low computing latency, and strong information carrying capacity. For the optical neural networks, matrix multiplication can be parallelly calculated at the speed of light with more energy-efficient, which effectively solves the complex matrix multiplication problem in artificial neural networks. Furthermore, as a computing in memory architecture, the modulation layers of ONN not only store the learned parameters but also participate in the optical calculation. The network is trained and tested on the Portrait 2000 dataset and achieved 0.8 in the Person Correlation Coefficient (PCC) index, showing good segmentation effect.
文章引用:葛泽宇, 张雨超. 基于光学神经网络的语义分割[J]. 建模与仿真, 2024, 13(3): 3842-3850. https://doi.org/10.12677/mos.2024.133350

参考文献

[1] 高荣芳, 吉春旭, 程国建. 从浅层网络到深度网络的图像分类研究综述[J]. 计算机与数字工程, 2016, 44(10): 2020-2024.
[2] Ronneberger, O., Fischer, P. and Brox, T. (2015) U-Net: Convolutional Networks for Biomedical Image Segmentation. Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, 5-9 October 2015, 234-241. [Google Scholar] [CrossRef
[3] Li, Y., Liu, Y. and Sun, Q. (2021) Real-Time Semantic Segmentation via Region and Pixel Context Network. Proceedings of the 2020 25th International Conference on Pattern Recognition (ICPR), Milan, 10-15 January 2021, 7043-7049. [Google Scholar] [CrossRef
[4] Sourav, G., Niko, S., Feras, D., et al. (2020) Semantics for Robotic Mapping, Perception and Interaction: A Survey. [Google Scholar] [CrossRef
[5] Adam, J.M., Liu, W., Zang, Y., et al. (2023) Deep Learning-Based Semantic Segmentation of Urban-Scale 3D Meshes in Remote Sensing: A Survey. International Journal of Applied Earth Observation and Geoinformation, 121, Article ID: 103365. [Google Scholar] [CrossRef
[6] Shen, Y., Harris, N.C., Skirlo, S., et al. (2017) Deep Learning with Coherent Nanophotonic Circuits. Nature Photonics, 11, 441-446. [Google Scholar] [CrossRef
[7] Xu, X., Tan, M., Corcoran, B., et al. (2021) 11 TOPS Photonic Convolutional Accelerator for Optical Neural Networks. Nature, 589, 44-51. [Google Scholar] [CrossRef] [PubMed]
[8] Lin, X., Rivenson, Y., Yardimci, N.T., et al. (2018) All-Optical Machine Learning Using Diffractive Deep Neural Networks. Science, 361, 1004-1008. [Google Scholar] [CrossRef] [PubMed]
[9] 刘硕. 阈值分割技术发展现状综述[J]. 科技创新与应用, 2020(24): 129-130.
[10] 李改梅, 杨润玲, 周军妮. 基于二维直方图与FCM相结合的图像快速分割方法[J]. 现代电子技术, 2007, 30(15): 176-178.
[11] 陈方昕. 基于区域生长法的图像分割技术[J]. 科技信息, 2008(15): 58-59.
[12] 李飞, 薛彬, 黄亚楼. 初始中心优化的K-Means聚类算法[J]. 计算机科学, 2002, 29(7): 94-96.
[13] 赵梦琦. 基于分水岭算法的图像分割算法研究[D]: [硕士学位论文]. 西安: 西安邮电大学, 2020.
[14] Shelhamer, E., Long, J. and Darrell, T. (2017) Fully Convolutional Networks for Semantic Segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 640-651. [Google Scholar] [CrossRef
[15] Zhou, T., Fang, L., Yan, T., et al. (2020) In situ Optical Backpropagation Training of Diffractive Optical Neural Networks. Photonics Research, 8, Article 940. [Google Scholar] [CrossRef
[16] Krizhevsky, A., Sutskever, I. and Hinton, G. (2012) ImageNet Classification with Deep Convolutional Neural Networks. Communications of the ACM, 60, 84-90. [Google Scholar] [CrossRef
[17] Shen, X., Tao, X., Gao, H., et al. (2016) Deep Automatic Portrait Matting. Computer VisionECCV 2016, Amsterdam, 11-14 October 2016, 92-107. [Google Scholar] [CrossRef
[18] Chen, H., Feng, J., Jiang, M., et al. (2021) Diffractive Deep Neural Networks at Visible Wavelengths. Engineering, 7, 1483-1491. [Google Scholar] [CrossRef