基于语义相似性的联邦对比学习
Federated Contrastive Learning Based on Semantic Similarity
DOI: 10.12677/mos.2025.145437, PDF,   
作者: 徐阿龙:上海理工大学光电信息与计算机工程学院,上海
关键词: 联邦学习原型对比学习知识蒸馏个性化Federated Learning Prototype Contrastive Learning Knowledge Distillation Personalization
摘要: 联邦学习在数据异构场景中面临着数据分布偏斜以及知识积累效率低下的双重挑战,尽管研究者们已提出诸如原型对比学习和知识蒸馏等方法来应对这些挑战,但这些方法未能充分考虑语义相似性的影响,从而导致语义相似的类别难以区分、全局原型质量欠佳以及知识积累效率低下。为解决这些问题,文章提出了一种基于语义相似性的联邦对比学习框架。该框架通过结合原型相似性和数据量聚合全局原型,为后续训练提供高质量的基础。然后利用全局语义关联矩阵指导知识蒸馏,高效地积累共性知识。最后,使用全局语义关联矩阵动态筛选困难负原型进行对比学习,以精细化决策边界。实验结果表明,与现有算法相比,本文提出的算法在准确率上提升了6.7%至7.7%,罕见类别的召回率和F1值提升了20%,提高了系统在数据异构场景下的泛化性能。
Abstract: Federated learning faces the dual challenges of data distribution skew and low knowledge accumulation efficiency in heterogeneous data scenarios. Although researchers have proposed methods such as prototype contrastive learning and knowledge distillation to address these challenges, these methods fail to fully consider the impact of semantic similarity. As a result, semantically similar categories are difficult to distinguish, the quality of global prototypes is suboptimal, and knowledge accumulation efficiency remains low. To address these issues, this paper proposes a federated contrastive learning framework based on semantic similarity. The framework combines prototype similarity and data volume to aggregate global prototypes, providing a high-quality foundation for subsequent training. It then utilizes the global semantic association matrix to guide knowledge distillation, efficiently accumulating common knowledge. Finally, it dynamically screens difficult negative prototypes for contrastive learning using the global semantic association matrix to refine decision boundaries. Experimental results show that compared with existing algorithms, the proposed algorithm in this paper improves accuracy by 6.7% to 7.7%, and the recall and F1 values of rare categories are increased by 20%, thereby enhancing the system’s generalization performance in heterogeneous data scenarios.
文章引用:徐阿龙. 基于语义相似性的联邦对比学习[J]. 建模与仿真, 2025, 14(5): 829-842. https://doi.org/10.12677/mos.2025.145437

参考文献

[1] Mothukuri, V., Parizi, R.M., Pouriyeh, S., Huang, Y., Dehghantanha, A. and Srivastava, G. (2021) A Survey on Security and Privacy of Federated Learning. Future Generation Computer Systems, 115, 619-640. [Google Scholar] [CrossRef
[2] Wang, S., Yan, Z., Zhang, D., Wei, H., Li, Z. and Li, R. (2023) Prototype Knowledge Distillation for Medical Segmentation with Missing Modality. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, 4-10 June 2023, 1-5. [Google Scholar] [CrossRef
[3] Tan, Y., Long, G., Liu, L., Zhou, T., Lu, Q., Jiang, J., et al. (2022) FedProto: Federated Prototype Learning across Heterogeneous Clients. Proceedings of the AAAI Conference on Artificial Intelligence, 36, 8432-8440. [Google Scholar] [CrossRef
[4] Xu, J., Tong, X. and Huang, S.L. (2023) Personalized Federated Learning with Feature Alignment and Classifier Collaboration. The 11th International Conference on Learning Representations, Kigali, 1-5 May 2023.
[5] Zhou, Y., Qu, X., You, C., Zhou, J., Tang, J., Zheng, X., et al. (2025) FedSA: A Unified Representation Learning via Semantic Anchors for Prototype-Based Federated Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 39, 23009-23017. [Google Scholar] [CrossRef
[6] Zhang, J., Liu, Y., Hua, Y. and Cao, J. (2024) FedTGP: Trainable Global Prototypes with Adaptive-Margin-Enhanced Contrastive Learning for Data and Model Heterogeneity in Federated Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 38, 16768-16776. [Google Scholar] [CrossRef
[7] Li, Q., He, B. and Song, D. (2021) Model-Contrastive Federated Learning. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, 20-25 June 2021, 10708-10717. [Google Scholar] [CrossRef
[8] Mu, X., Shen, Y., Cheng, K., Geng, X., Fu, J., Zhang, T., et al. (2023) FedProc: Prototypical Contrastive Federated Learning on Non-IID Data. Future Generation Computer Systems, 143, 93-104. [Google Scholar] [CrossRef
[9] Tahir, A., Chen, Y. and Nilayam, P. (2022) FedSS: Federated Learning with Smart Selection of Clients.
[10] Liu, Q., Sun, S., Liang, Y., Xue, J. and Liu, M. (2024) Personalized Federated Learning for Spatio-Temporal Forecasting: A Dual Semantic Alignment-Based Contrastive Approach.
[11] Wu, C., Wu, F., Lyu, L., Huang, Y. and Xie, X. (2022) Communication-Efficient Federated Learning via Knowledge Distillation. Nature Communications, 13, Article No. 2032. [Google Scholar] [CrossRef] [PubMed]
[12] Yao, D., Pan, W., Dai, Y., Wan, Y., Ding, X., Yu, C., et al. (2024) FedGKD: Toward Heterogeneous Federated Learning via Global Knowledge Distillation. IEEE Transactions on Computers, 73, 3-17. [Google Scholar] [CrossRef
[13] Jeong, E., Oh, S., Kim, H., Park, J., Bennis, M. and Kim, S.L. (2018) Communication-Efficient On-Device Machine Learning: Federated Distillation and Augmentation under Non-IID Private Data.
[14] Itahara, S., Nishio, T., Koda, Y., Morikura, M. and Yamamoto, K. (2023) Distillation-Based Semi-Supervised Federated Learning for Communication-Efficient Collaborative Training with Non-IID Private Data. IEEE Transactions on Mobile Computing, 22, 191-205. [Google Scholar] [CrossRef
[15] Zhu, Z., Hong, J. and Zhou, J. (2021) Data-Free Knowledge Distillation for Heterogeneous Federated Learning. International Conference on Machine Learning, Online, 18-24 July 2021, 12878-12889.