基于分组蒸馏的可验证隐私保护公平个性化联邦学习方法
A Verifiable Privacy-Preserving Fair Personalized Federated Learning Method Based on Group Distillation
DOI: 10.12677/csa.2026.165186, PDF,   
作者: 王江源:青海大学计算机技术与应用学院,青海 西宁
关键词: 可验证个性化联邦学习隐私保护Verifiable Personalized Federated Learning Privacy Protection
摘要: 个性化联邦学习(Personalized Federated Learning, PFL)允许每个用户在共享全局模型的基础上训练个性化模型,显著提升了模型在异构数据环境下的性能。然而也面临数据隐私泄露、客户端之间模型性能失衡以及聚合结果可信性等方面的挑战。为了解决上述问题,文章设计了基于用户数据相似性的层次聚类分组机制与跨组知识蒸馏机制,在不泄露原始梯度的前提下实现相似数据用户的精准聚类,将数据特征相近的用户归为同一组,共同开展联邦训练,实现数据层面的个性化;并缩小用户之间的模型性能差距,保障了系统的公平性。在隐私保护方面,设计了基于掩码的安全聚合机制,确保各用户上传的梯度参数在传输与聚合过程中始终处于隐私保护状态。同时,设计了验证机制,使诚实用户在接收聚合结果后能够独立校验其正确性。实验结果表明,本方案满足了高级别隐私保护要求,确保了用户之间的公平性,并且模型准确率优于现有方案。
Abstract: Personalized Federated Learning (PFL) allows each user to train a personalized model based on a shared global model, significantly improving model performance in heterogeneous data environments. However, it also faces challenges such as data privacy leakage, imbalance in model performance among clients, and the trustworthiness of aggregation results. To address these issues, a hierarchical clustering grouping mechanism based on user data similarity and a cross-group knowledge distillation mechanism were designed. This enables precise clustering of users with similar data without revealing original gradients, allowing users with similar data to train together in the same group for federated training, achieving data-level personalization; it also reduces the performance gap between users, ensuring system fairness. In terms of privacy protection, a mask-based secure aggregation mechanism was designed to ensure that the gradient parameters uploaded by users remain protected throughout transmission and aggregation. At the same time, a verification mechanism was designed so that honest users can independently verify the correctness of the aggregated results once received. Experimental results show that this solution meets high-level privacy protection requirements, ensures fairness among users, and achieves model accuracy superior to existing solutions.
文章引用:王江源. 基于分组蒸馏的可验证隐私保护公平个性化联邦学习方法[J]. 计算机科学与应用, 2026, 16(5): 312-326. https://doi.org/10.12677/csa.2026.165186

参考文献

[1] Mcmahan, B., Moore, E., Ramage, D., et al. (2017) Communication-Efficient Learning of Deep Networks from Decentralized Data. Artificial Intelligence and Statistics, 54, 1273-1282.
[2] Sheller, M.J., Edwards, B., Reina, G.A., Martin, J., Pati, S., Kotrotsou, A., et al. (2020) Federated Learning in Medicine: Facilitating Multi-Institutional Collaborations without Sharing Patient Data. Scientific Reports, 10, Article No. 12598. [Google Scholar] [CrossRef] [PubMed]
[3] Abdul Salam, M., Fouad, K.M., Elbably, D.L. and Elsayed, S.M. (2024) Federated Learning Model for Credit Card Fraud Detection with Data Balancing Techniques. Neural Computing and Applications, 36, 6231-6256. [Google Scholar] [CrossRef
[4] Kairouz, P. and McMahan, H.B. (2021) Advances and Open Problems in Federated Learning. Foundations and Trends® in Machine Learning, 14, 1-210. [Google Scholar] [CrossRef
[5] Pei, J., Liu, W., Li, J., Wang, L. and Liu, C. (2024) A Review of Federated Learning Methods in Heterogeneous Scenarios. IEEE Transactions on Consumer Electronics, 70, 5983-5999. [Google Scholar] [CrossRef
[6] Dembani, R., Karvelas, I., Akbar, N.A., Rizou, S., Tegolo, D. and Fountas, S. (2025) Agricultural Data Privacy and Federated Learning: A Review of Challenges and Opportunities. Computers and Electronics in Agriculture, 232, Article 110048. [Google Scholar] [CrossRef
[7] 郭倩, 赵津, 过弋. 基于分层聚类的个性化联邦学习隐私保护框架[J]. 信息网络安全, 2024, 24(8): 1196-1209.
[8] Sabah, F., Chen, Y., Yang, Z., Raheem, A., Azam, M., Ahmad, N., et al. (2025) FairDPFL-SCS: Fair Dynamic Personalized Federated Learning with Strategic Client Selection for Improved Accuracy and Fairness. Information Fusion, 115, Article 102756. [Google Scholar] [CrossRef
[9] Tan, A.Z., Yu, H., Cui, L. and Yang, Q. (2023) Towards Personalized Federated Learning. IEEE Transactions on Neural Networks and Learning Systems, 34, 9587-9603. [Google Scholar] [CrossRef] [PubMed]
[10] Li, X., Jiang, M., Zhang, X., et al. (2021) FedBN: Federated learning on Non-IID Features via Local Batch Nor-Malization. arXiv:2102.07623.
[11] Wang, H., Kaplan, Z., Niu, D. and Li, B. (2020) Optimizing Federated Learning on Non-IID Data with Reinforcement Learning. IEEE INFOCOM 2020-IEEE Conference on Computer Communications, Toronto, 6-9 July 2020, 1698-1707. [Google Scholar] [CrossRef
[12] Chen, Y., Qin, X., Wang, J., Yu, C. and Gao, W. (2020) FedHealth: A Federated Transfer Learning Framework for Wearable Healthcare. IEEE Intelligent Systems, 35, 83-93. [Google Scholar] [CrossRef
[13] Yang, H., He, H., Zhang, W. and Cao, X. (2021) FedSteg: A Federated Transfer Learning Framework for Secure Image Steganalysis. IEEE Transactions on Network Science and Engineering, 8, 1084-1094. [Google Scholar] [CrossRef
[14] Fallah, A., Mokhtari, A. and Ozdaglar, A. (2020) Personalized Federated Learning with Theoretical Guarantees: A Model-Agnostic Meta-Learning Approach. Advances in Neural Information Processing Systems, 33, 3557-3568.
[15] Jiang, Y., Konečný, J., Rush, K., et al. (2019) Improving Federated Learning Personalization via Model Agnostic Meta Learning. arXiv:1909.12488.
[16] Zhang, J., Guo, S., Ma, X., et al. (2021) Parameterized Knowledge Transfer for Personalized Federated Learning. Advances in Neural Information Processing Systems, 34, 10092-10104.
[17] Li, D. and Wang, J. (2019) FedMD: Heterogenous Federated Learning via Model Distillation. arXiv:1910.03581.
[18] Zhu, L., Liu, Z. and Han, S. (2019) Deep Leakage from Gradients. arXiv:1906.08935.
[19] Geiping, J., Bauermeister, H., Dröge, H., et al. (2020) Inverting Gradients-How Easy Is It to Break Privacy in Federated Learning? Advances in Neural Information Processing Systems, 33, 16937-16947.
[20] Mohassel, P. and Zhang, Y. (2017) SecureML: A System for Scalable Privacy-Preserving Machine Learning. 2017 IEEE Symposium on Security and Privacy (SP), San Jose, 22-26 May 2017, 19-38. [Google Scholar] [CrossRef
[21] Byali, M., Chaudhari, H., Patra, A. and Suresh, A. (2020) FLASH: Fast and Robust Framework for Privacy-Preserving Machine Learning. Proceedings on Privacy Enhancing Technologies, 2020, 459-480. [Google Scholar] [CrossRef
[22] Xu, R., Baracaldo, N., Zhou, Y., Anwar, A. and Ludwig, H. (2019) HybridAlpha: An Efficient Approach for Privacy-Preserving Federated Learning. Proceedings of the 12th ACM Workshop on Artificial Intelligence and Security, London, 15 November 2019, 13-23. [Google Scholar] [CrossRef
[23] 徐茹枝, 仝雨蒙, 戴理朋. 基于异构数据的联邦学习自适应差分隐私方法研究[J]. 信息网络安全, 2025, 25(1): 63-77.
[24] Zhang, L., Xu, J., Vijayakumar, P., Sharma, P.K. and Ghosh, U. (2023) Homomorphic Encryption-Based Privacy-Preserving Federated Learning in IoT-Enabled Healthcare System. IEEE Transactions on Network Science and Engineering, 10, 2864-2880. [Google Scholar] [CrossRef
[25] Park, J., Yu, N.Y. and Lim, H. (2022) Privacy-Preserving Federated Learning Using Homomorphic Encryption with Different Encryption Keys. 2022 13th International Conference on Information and Communication Technology Convergence (ICTC), Jeju Island, 19-21 October 2022, 1869-1871. [Google Scholar] [CrossRef
[26] Liao, J., Chen, Z. and Larsson, E.G. (2022) Over-the-Air Federated Learning with Privacy Protection via Correlated Additive Perturbations. 2022 58th Annual Allerton Conference on Communication, Control, and Computing (Allerton), Monticello, 27-30 September 2022, 1-8. [Google Scholar] [CrossRef
[27] Wei, K., Li, J., Ding, M., Ma, C., Yang, H.H., Farokhi, F., et al. (2020) Federated Learning with Differential Privacy: Algorithms and Performance Analysis. IEEE Transactions on Information Forensics and Security, 15, 3454-3469. [Google Scholar] [CrossRef
[28] Bonawitz, K., Ivanov, V., Kreuter, B., Marcedone, A., McMahan, H.B., Patel, S., et al. (2017) Practical Secure Aggregation for Privacy-Preserving Machine Learning. Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, Dallas, 30 October 2017-3 November 2017, 1175-1191. [Google Scholar] [CrossRef
[29] Liu, Z., Guo, J., Lam, K. and Zhao, J. (2023) Efficient Dropout-Resilient Aggregation for Privacy-Preserving Machine Learning. IEEE Transactions on Information Forensics and Security, 18, 1839-1854. [Google Scholar] [CrossRef
[30] Pan, Y., Su, Z., Ni, J., Wang, Y. and Zhou, J. (2024) Privacy-Preserving Heterogeneous Personalized Federated Learning with Knowledge. IEEE Transactions on Network Science and Engineering, 11, 5969-5982. [Google Scholar] [CrossRef
[31] Guo, X., Liu, Z., Li, J., Gao, J., Hou, B., Dong, C., et al. (2021) VeriFL: Communication-Efficient and Fast Verifiable Aggregation for Federated Learning. IEEE Transactions on Information Forensics and Security, 16, 1736-1751. [Google Scholar] [CrossRef
[32] Gao, S., Luo, J., Zhu, J., Dong, X. and Shi, W. (2023) VCD-FL: Verifiable, Collusion-Resistant, and Dynamic Federated Learning. IEEE Transactions on Information Forensics and Security, 18, 3760-3773. [Google Scholar] [CrossRef