基于动态Kolmogorov-Arnold网络与自适应非线性建模的医学图像分割
Medical Image Segmentation Based on Dynamic Kolmogorov-Arnold Network and Adaptive Nonlinear Modeling
DOI: 10.12677/csa.2026.163100, PDF,    科研立项经费支持
作者: 吴彦霏, 林 玲*:伊犁师范大学网络安全与信息技术学院,新疆 伊宁;敬长桦, 拓家凤:伊犁河谷智能计算研究与应用重点实验室,新疆 伊宁
关键词: 非线性适应单元U-KAN医学图像分割Nonlinear Adaptive Unit U-KAN Medical Image Segmentation
摘要: 针对医学图像分割中复杂非线性模式的精确捕获、模型可解释性以及传统固定非线性激活函数局限性的挑战,本文提出了一种新型的动态Kolmogorov-Arnold网络U-Net (Dynamic KAN UNet, DKAN-UNet)。该网络在经典的U-Net编码器–解码器架构基础上,引入了创新的动态KAN模块(DKAN Block),旨在实现特征依赖的自适应非线性建模。具体而言,DKAN Block包含一个轻量级的非线性适应单元(Non-linearity Adaptation Unit, NAU),该单元能够根据输入特征的局部特性,动态地调整KAN层中可学习激活函数的参数(如B-spline基函数的网格点或系数),从而为不同区域和语义信息提供定制化的非线性变换。此外,我们设计了多尺度动态KAN融合策略,在编码器和解码器中部署DKAN Block,并通过跳跃连接实现多尺度特征的自适应非线性交互。实验结果表明,DKAN-UNet在多个医学图像分割数据集上取得了显著优于现有U-KAN等方法的性能,同时显著增强了模型的非线性建模能力和可解释性。
Abstract: To address the challenges of accurately capturing complex nonlinear patterns, model interpretability, and the limitations of traditional fixed nonlinear activation functions in medical image segmentation, this paper proposes a novel Dynamic Kolmogorov-Arnold Network U-Net (Dynamic KAN UNet, DKAN-UNet). This network builds upon the classic U-Net encoder-decoder architecture by introducing an innovative Dynamic KAN module (DKAN Block), aiming to achieve feature-dependent adaptive nonlinear modeling. Specifically, the DKAN Block incorporates a lightweight Non-linearity Adaptation Unit (NAU), which can dynamically adjust the parameters of the learnable activation function in the KAN layer (such as the grid points or coefficients of the B-spline basis functions) based on the local characteristics of the input features, thereby providing customized nonlinear transformations for different regions and semantic information. Additionally, we have designed a multi-scale dynamic KAN fusion strategy, deploying DKAN Blocks in both the encoder and decoder and achieving adaptive nonlinear interaction of multi-scale features through skip connections. Experimental results demonstrate that DKAN-UNet achieves significantly superior performance on multiple medical image segmentation datasets compared to existing methods such as U-KAN, while significantly enhancing the model’s nonlinear modeling capability and interpretability.
文章引用:吴彦霏, 敬长桦, 拓家凤, 林玲. 基于动态Kolmogorov-Arnold网络与自适应非线性建模的医学图像分割[J]. 计算机科学与应用, 2026, 16(3): 214-224. https://doi.org/10.12677/csa.2026.163100

参考文献

[1] Ronneberger, O., Fischer, P. and Brox, T. (2015) U-Net: Convolutional Networks for Biomedical Image Segmentation. In: Navab, N., Hornegger, J., Wells, W. and Frangi, A., Eds., Medical Image Computing and Computer-Assisted InterventionMICCAI 2015, Springer, 234-241. [Google Scholar] [CrossRef
[2] Zhou, Z., Rahman Siddiquee, M.M., Tajbakhsh, N. and Liang, J. (2018) Unet++: A Nested U-Net Architecture for Medical Image Segmentation. In: Stoyanov, D., et al., Eds., Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support, Springer, 3-11. [Google Scholar] [CrossRef] [PubMed]
[3] Oktay, O., Schlemper, J., Folgoc, L.L., et al. (2018) Attention U-Net: Learning Where to Look for the Pancreas. arXiv: 1804.03999.
[4] Liu, Z., Wang, Y., Vaidya, S., et al. (2024) KAN: Kolmogorov-Arnold Networks. arXiv: 2404.19756.
[5] Wang, J., Li, H., Zhang, Y., et al. (2024) U-KAN: A U-Shaped KAN for Medical Image Segmentation. arXiv: 2405.01211.
[6] Çiçek, Ö., Abdulkadir, A., Lienkamp, S.S., Brox, T. and Ronneberger, O. (2016) 3D U-Net: Learning Dense Volumetric Segmentation from Sparse Annotation. In: Ourselin, S., Joskowicz, L., Sabuncu, M., Unal, G. and Wells, W., Eds., Medical Image Computing and Computer-Assisted InterventionMICCAI 2016, Springer, 424-432. [Google Scholar] [CrossRef
[7] Penkin, M. and Krylov, A. (2025) FunKAN: Functional Kolmogorov-Arnold Network for Medical Image Enhancement and Segmentation. arXiv: 2509.13508.
https://arxiv.org/abs/2509.13508
[8] Han, S., Mao, H. and Dally, W.J. (2015) Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding. arXiv: 1510.00149.
https://arxiv.org/abs/1510.00149
[9] Shazeer, N., Mirhoseini, A., Maziarz, K., Davis, A., Le, Q., Hinton, G. and Dean, J. (2017) Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer. arXiv: 1701.06538.
https://arxiv.org/abs/1701.06538
[10] Ramachandran, P., Zoph, B. and Le, Q.V. (2017) Swish: A Self-Gated Activation Function. arXiv: 1710.05941.
https://arxiv.org/abs/1710.05941
[11] Misra, D. (2020) Mish: A Self Regularized Non-Monotonic Activation Function. arXiv: 1908.08681.
https://arxiv.org/abs/1908.08681
[12] Gulcehre, C., Moczulski, M., Denil, M., Pascanu, R. and Bengio, Y. (2016) Dynamic Activation Functions. arXiv: 1603.00936.
https://arxiv.org/abs/1603.00936
[13] Milletari, F., Navab, N. and Ahmadi, S. (2016) V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation. 2016 Fourth International Conference on 3D Vision (3DV), Stanford, 25-28 October 2016, 565-571. [Google Scholar] [CrossRef
[14] Mao, A., Mohri, M. and Zhong, Y. (2023) Cross-Entropy Loss Functions: Theoretical Analysis and Applications. International Conference on Machine Learning, 23803-23828.
[15] Codella, N., Rotemberg, V., Tschandl, P., Celebi, M. E., Dusza, S., Gutman, D., et al. (2019) Skin Lesion Analysis Toward Melanoma Detection 2018: A Challenge Hosted by the International Skin Imaging Collaboration (ISIC). arXiv: 1902.03368.
https://arxiv.org/abs/1902.03368
[16] Landman, B., Xu, Z., Igelsias, J., Styner, M., Langerak, T. and Klein, A. (2015) MICCAI Multi-Atlas Labeling Beyond the Cranial Vault—Workshop and Challenge.
https://www.synapse.org/Synapse:syn3193805
[17] Bernard, O., Lalande, A., Zotti, C., Cervenansky, F., Yang, X., Heng, P., et al. (2018) Deep Learning Techniques for Automatic MRI Cardiac Multi-Structures Segmentation and Diagnosis: Is the Problem Solved? IEEE Transactions on Medical Imaging, 37, 2514-2525. [Google Scholar] [CrossRef] [PubMed]
[18] Dice, L.R. (1945) Measures of the Amount of Ecologic Association between Species. Ecology, 26, 297-302. [Google Scholar] [CrossRef
[19] Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J. and Zisserman, A. (2009) The Pascal Visual Object Classes (VOC) Challenge. International Journal of Computer Vision, 88, 303-338. [Google Scholar] [CrossRef
[20] Karimi, D. and Salcudean, S.E. (2020) Reducing the Hausdorff Distance in Medical Image Segmentation with Convolutional Neural Networks. IEEE Transactions on Medical Imaging, 39, 499-513. [Google Scholar] [CrossRef] [PubMed]
[21] Heimann, T., van Ginneken, B., Styner, M.A., Arzhaeva, Y., Aurich, V., Bauer, C., et al. (2009) Comparison and Evaluation of Methods for Liver Segmentation from CT Datasets. IEEE Transactions on Medical Imaging, 28, 1251-1265. [Google Scholar] [CrossRef] [PubMed]