基于生成式数据增强的联邦多标签弱监督图神经网络及其应用研究
Generative Data Augmentation Based Federated Multi-Label Weakly Supervised Graph Neural Networks and Their Applications
DOI: 10.12677/csa.2025.1511313, PDF,    国家科技经费支持
作者: 钟 磊, 徐佳隆, 姜雪娇, 梁树华, 吴 民:中国南方电网海南电网有限责任公司,海南 海口;李 胜, 艾 渊*, 杨景旭:南方电网数字电网集团有限公司,广东 广州
关键词: 图神经网络类不平衡噪声多标签分类联邦学习Graph Neural Network Class Imbalance Noisy Multi-Label Classification Federated Learning
摘要: 隐私敏感领域的图像分类问题面临诸多挑战,尤其设备磨损等因素会在多标签电力场景分类任务中引入标签噪声,而区域差异会导致不同区域的本地数据集上出现类别不平衡问题。尽管联邦学习(Federated Learning, FL)是一种有效的隐私保护解决方案,但很少有联邦学习框架专门针对多标签电力场景分类中的标签噪声和类别不平衡问题来研究有效的解决方案。在此背景下,本文提出了一种基于生成式数据增强的联邦多标签弱监督图神经网络(Federated Multi-label Graph Neural Network with Generative Data Augmentation for Noisy Electricity Image Classification, FedMGNN),用于监督的标签信息包含噪声的电力图像分类。该网络在本地和全局模型构建过程中融入两方面的关键创新,旨在从本地模型和全局模型两方面协同解决标签噪声和类别不平衡问题。首先,设计了一种客户端的本地识别模型,将基于生成模型的数据增强算法与多标签对比学习框架相融合,从而同时应对两个关键挑战:(1) 通过合成代表性样本弥补少数类别的数据不足问题,从数据上缓解类别不平衡;(2) 利用本地模型和接收的全局模型构建对比学习框架,通过对比损失和样本筛选策略来过滤含噪声标签的样本,减轻标签噪声的影响。其次,设计了一种全局聚合策略,采用类别不平衡感知加权方法,将分布式本地模型整合为统一的多标签分类框架。最后,通过在不同标签噪声比例的联邦场景图像数据集上的实验验证了所提出的FedMGNN模型的有效性。
Abstract: The image classification problems in privacy-sensitive domains faces numerous challenges, particularly as factors like equipment wear and tear introduce label noise in multi-label electricity scene classification tasks, and regional differences lead to class imbalance in local datasets from different areas. While Federated Learning (FL) is an effective privacy-preserving solution, few FL frameworks specifically address the issues of label noise and class imbalance in multi-label electricity scene classification tasks. In this context, this paper proposes a Federated Multi-label Graph Neural Network with Generative Data Augmentation for Noisy Electricity Image Classification (FedMGNN), which is designed to handle noisy label information in supervised electricity image classification. The network integrates two key innovations in both local and global model construction to collaboratively address label noise and class imbalance from both the local and global model perspectives. First, we design a local recognition model for the client, combining data augmentation based on generative models with a multi-label contrastive learning framework. This approach addresses two key challenges simultaneously: (1) by synthesizing representative samples, it alleviates the data scarcity problem of minority classes, thus mitigating class imbalance; (2) by leveraging both the local model and the received global model to construct the contrastive learning framework, it filters noisy label samples through contrastive loss and sample selection criteria, thereby reducing the impact of label noise. Secondly, we propose a global aggregation strategy that uses a class imbalance-aware weighting method to integrate distributed local models into a unified multi-label classification framework. Finally, experiments on a federated scene image dataset with varying label noise ratios validate the effectiveness of the proposed FedMGNN model.
文章引用:钟磊, 徐佳隆, 姜雪娇, 李胜, 艾渊, 梁树华, 吴民, 杨景旭. 基于生成式数据增强的联邦多标签弱监督图神经网络及其应用研究[J]. 计算机科学与应用, 2025, 15(11): 378-392. https://doi.org/10.12677/csa.2025.1511313

参考文献

[1] McMahan, B., Moore, E., Ramage, D., et al. (2017) Communication-Efficient Learning of Deep Networks from Decentralized Data. Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, Vol. 54, 1273-1282.
[2] Büyüktaş, B., Weitzel, K., Völkers, S., Zailskas, F. and Demir, B. (2024) Transformer-Based Federated Learning for Multi-Label Remote Sensing Image Classification. 2024 IEEE International Geoscience and Remote Sensing Symposium, Athens, 7-12 July 2024, 8726-8730. [Google Scholar] [CrossRef
[3] Zhang, M. and Zhou, Z. (2014) A Review on Multi-Label Learning Algorithms. IEEE Transactions on Knowledge and Data Engineering, 26, 1819-1837. [Google Scholar] [CrossRef
[4] Zhang, J., Wei, T. and Zhang, M. (2024) Label-Specific Time-Frequency Energy-Based Neural Network for Instrument Recognition. IEEE Transactions on Cybernetics, 54, 7080-7093. [Google Scholar] [CrossRef] [PubMed]
[5] Hang, J. and Zhang, M. (2022) Collaborative Learning of Label Semantics and Deep Label-Specific Features for Multi-Label Classification. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44, 9860-9871. [Google Scholar] [CrossRef] [PubMed]
[6] Lanchantin, J., Wang, T., Ordonez, V. and Qi, Y. (2021) General Multi-Label Image Classification with Transformers. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, 20-25 June 2021, 16478-16488. [Google Scholar] [CrossRef
[7] Cui, J., Li, Y., Zhang, Q., He, Z. and Zhao, S. (2025) A Federated Learning Framework Using FedProx Algorithm for Privacy-Preserving Palmprint Recognition. In: Yu, S.Q., et al., Eds., Biometric Recognition, Springer, 187-196. [Google Scholar] [CrossRef
[8] Wang, J., Liu, Q., Liang, H., Gauri, J. and Poor, H.V. (2021) A Novel Framework for the Analysis and Design of Heterogeneous Federated Learning. IEEE Transactions on Signal Processing, 69, 5234-5249. [Google Scholar] [CrossRef
[9] Niu, X. and Wei, E. (2023) FedHybrid: A Hybrid Federated Optimization Method for Heterogeneous Clients. IEEE Transactions on Signal Processing, 71, 150-163. [Google Scholar] [CrossRef
[10] Huang, X., Li, P. and Li, X. (2024) Stochastic Controlled Averaging for Federated Learning with Communication Compression.
[11] Gao, L., Fu, H., Li, L., Chen, Y., Xu, M. and Xu, C. (2022) FedDC: Federated Learning with Non-IID Data via Local Drift Decoupling and Correction. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, 18-24 June 2022, 10112-10121. [Google Scholar] [CrossRef
[12] Li, J., Zhang, C., Zhou, J.T., Fu, H., Xia, S. and Hu, Q. (2022) Deep-LIFT: Deep Label-Specific Feature Learning for Image Annotation. IEEE Transactions on Cybernetics, 52, 7732-7741. [Google Scholar] [CrossRef] [PubMed]
[13] Yu, Z.B. and Zhang, M.L. (2022) Multi-Label Classification with Label-Specific Feature Generation: A Wrapped Approach. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44, 5199-5210.
[14] Jia, B. and Zhang, M. (2023) Multi-Dimensional Multi-Label Classification: Towards Encompassing Heterogeneous Label Spaces and Multi-Label Annotations. Pattern Recognition, 138, Article ID: 109357. [Google Scholar] [CrossRef
[15] Wu, X., Chen, Q., Li, W., Xiao, Y. and Hu, B. (2020) AdaHGNN: Adaptive Hypergraph Neural Networks for Multi-Label Image Classification. Proceedings of the 28th ACM International Conference on Multimedia, Seattle, 12-16 October 2020, 284-293. [Google Scholar] [CrossRef
[16] Cho, H., Mathur, A. and Kawsar, F. (2022) Flame: Federated Learning across Multi-Device Environments. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 6, 1-29. [Google Scholar] [CrossRef
[17] Bei, Y., Chen, W., Chen, H., Zhou, S., Yang, C., Fan, J., et al. (2025) Correlation-Aware Graph Convolutional Networks for Multi-Label Node Classification. Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Vol. 1, 37-48. [Google Scholar] [CrossRef
[18] Chen, J.Y., Li, S.Y., Huang, S.J., et al. (2024) UNM: A Universal Approach for Noisy Multi-Label Learning. IEEE Transactions on Knowledge and Data Engineering, 36, 4968-4980. [Google Scholar] [CrossRef
[19] Liang, S., Huang, J., Hong, J., et al. (2023) FedNoisy: Federated Noisy Label Learning Benchmark.
[20] Gao, P., Geng, S., Zhang, R., Ma, T., Fang, R., Zhang, Y., et al. (2023) Clip-Adapter: Better Vision-Language Models with Feature Adapters. International Journal of Computer Vision, 132, 581-595. [Google Scholar] [CrossRef
[21] Wei, H., Feng, L., Chen, X. and An, B. (2020) Combating Noisy Labels by Agreement: A Joint Training Method with Co-Regularization. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, 13-19 June 2020, 13726-13735. [Google Scholar] [CrossRef
[22] Song, C.S., Granqvist, F. and Talwar, K. (2022) FLAIR: Federated Learning Annotated Image Repository.
[23] Liu, I., Lin, C., Yang, F. and Wang, Y.F. (2024) Language-Guided Transformer for Federated Multi-Label Classification. Proceedings of the AAAI Conference on Artificial Intelligence, 38, 13882-13890. [Google Scholar] [CrossRef
[24] Chen, J., Yan, H., Liu, Z., et al. (2024) When Federated Learning Meets Privacy-Preserving Computation. ACM Computing Surveys, 56, 1-36.
[25] Li, Q., He, B. and Song, D. (2021) Model-Contrastive Federated Learning. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, 20-25 June 2021, 10713-10722. [Google Scholar] [CrossRef
[26] Radford, A. (2021) Learning Transferable Visual Models from Natural Language Supervision. Proceedings of the 38th International Conference on Machine Learning, 18-24 July 2021, 8748-8763.