基于相对熵最优传输的长尾分类分布对齐方法
Relative Entropy Optimal Transport for Distribution Alignment in Long-Tailed Classification
DOI: 10.12677/sa.2025.149273, PDF,   
作者: 庄雅萍:福建师范大学数学与统计学院,福建 福州
关键词: 长尾分类最优传输神经崩溃Long-Tailed Classification Optimal Transport Neural Collapse
摘要: 现实世界中的多分类任务普遍面临严重的类别不平衡问题,尤其是在长尾分布下的传统深度学习方法往往容易在多数类上出现过拟合,却对少数类的识别能力不足,导致整体分类性能下降。为缓解这一问题,本文在分布对齐优化方法的基础上,提出一种基于相对熵最优传输的分布对齐优化方法。该方法通过引入相对熵正则化项,结合指定先验分布,使得分布匹配更具泛化能力。与原始方法对比,新方法能够根据数据集的类别分布特性自适应调整匹配策略,并促进神经崩溃现象的形成,增强特征的判别性。将新方法作为正则化项能与多种监督损失函数进行结合,在多个长尾数据集上进行评估时,结果表明新方法可促进分类模型在不同评价指标上的性能提升,验证了该方法在长尾分类场景中的有效性。
Abstract: In real-world multi-class classification tasks, severe class imbalance is a common challenge, especially under long-tailed distributions. Traditional deep learning methods tend to overfit majority classes while exhibiting poor recognition performance on minority classes, leading to an overall decline in classification performance. To mitigate this issue, this paper proposes a distribution alignment optimization method based on relative entropy optimal transport, extending conventional distribution alignment approaches. By introducing a relative entropy regularization term and incorporating a specified prior distribution, the proposed method improves the generalization ability of distribution matching. Compared with the original method, it adaptively adjusts the matching strategy according to the dataset’s class distribution characteristics, effectively promoting the formation of neural collapse, thereby enhancing feature discriminability. When integrated as a regularization term into various supervised loss functions and evaluated on multiple long-tailed datasets, the proposed method consistently improves classification performance across different evaluation metrics, validating its effectiveness in long-tailed classification scenarios.
文章引用:庄雅萍. 基于相对熵最优传输的长尾分类分布对齐方法[J]. 统计学与应用, 2025, 14(9): 249-261. https://doi.org/10.12677/sa.2025.149273

参考文献

[1] Aggarwal, U., Popescu, A. and Hudelot, C. (2020) Active Learning for Imbalanced Datasets. Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, Snowmass, 1-5 March 2020, 1428-1437. [Google Scholar] [CrossRef
[2] Rangamani, A., Lindegaard, M., Galanti, T. and Poggio, T.A. (2023) Feature Learning in Deep Classifiers through Intermediate Neural Collapse. International Conference on Machine Learning, Honolulu, 23-29 July 2023, 28729-28745.
[3] Fang, C., He, H., Long, Q. and Su, W.J. (2021) Exploring Deep Neural Networks via Layer-Peeled Model: Minority Collapse in Imbalanced Training. Proceedings of the National Academy of Sciences, 118, e2103091118. [Google Scholar] [CrossRef] [PubMed]
[4] Yang, Y., Chen, S., Li, X., Xie, L., Lin, Z. and Tao, D. (2022) Inducing Neural Collapse in Imbalanced Learning: Do We Really Need a Learnable Classifier at the End of Deep Neural Network? Proceedings of the 36th International Conference on Neural Information Processing Systems, New Orleans, 28 November-9 December 2022, 37991-38002.
[5] Gao, J., Zhao, H., dan Guo, D. and Zha, H. (2024) Distribution Alignment Optimization through Neural Collapse for Long-Tailed Classification. Proceedings of the 41st International Conference on Machine Learning, Vienna, 21-27 July 2024, 14969-14987.
[6] Wang, L., Han, M., Li, X., Zhang, N. and Cheng, H. (2021) Review of Classification Methods on Unbalanced Data Sets. IEEE Access, 9, 64606-64628. [Google Scholar] [CrossRef
[7] Goodfellow, I.J., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A. and Bengio, Y. (2014) Generative Adversarial Networks. Communications of the ACM, 63, 139-144.
[8] Lin, T., Goyal, P., Girshick, R., He, K. and Dollar, P. (2017) Focal Loss for Dense Object Detection. Proceedings of the IEEE International Conference on Computer Vision, Venice, 22-29 October 2017, 2980-2988. [Google Scholar] [CrossRef
[9] Cao, K., Wei, C., Gaidon, A., Arechiga, N. and Ma, T. (2019) Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss. Proceedings of the 33rd International Conference on Neural Information Processing Systems, Vancouver, 8-14 December 2019, 1567-1578.
[10] Yan, H., Qian, Y., Peng, F., Luo, J. and Li, F. (2024) Neural Collapse to Multiple Centers for Imbalanced Data. Advances in Neural Information Processing Systems, 37, 65583-65617.
[11] Zhang, E., Li, C., Geng, C. and Chen, S. (2025) All-Around Neural Collapse for Imbalanced Classification. IEEE Transactions on Knowledge and Data Engineering, 37, 4460-4470. [Google Scholar] [CrossRef
[12] Xie, L., Yang, Y., Cai, D. and He, X. (2023) Neural Collapse Inspired Attraction-Repulsion-Balanced Loss for Imbalanced Learning. Neurocomputing, 527, 60-70. [Google Scholar] [CrossRef
[13] Cuturi, M. (2013) Lightspeed Computation of Optimal Transportation Distances. Advances in Neural Information Processing Systems, 26, 2292-2300.
[14] Yan, Y., Tan, M., Xu, Y., Cao, J., Ng, M., Min, H., et al. (2019) Oversampling for Imbalanced Data via Optimal Transport. Proceedings of the AAAI Conference on Artificial Intelligence, 33, 5605-5612. [Google Scholar] [CrossRef
[15] Salimans, T., Zhang, H., Radford, A. and Metaxas, D. (2018) Improving GANs Using Optimal Transport. International Conference on Learning Representations, Vancouver, 30 April-3 May 2018, 1-13.
[16] Guo, D., Li, Z., Zhao, H., Zhou, M. and Zha, H. (2022) Learning to Re-Weight Examples with Optimal Transport for Imbalanced Classification. Proceedings of the 36th International Conference on Neural Information Processing Systems, New Orleans, 28 November-9 December 2022, 25517-25530.
[17] Shi, L., Zhen, H., Zhang, G. and Yan, J. (2023) Relative Entropic Optimal Transport: A (Prior-Aware) Matching Perspective to (Unbalanced) Classification. Proceedings of the 37th International Conference on Neural Information Processing Systems, New Orleans, 10-16 December 2023, 22085-22098.
[18] Krizhevsky, A. and Hinton, G. (2009) Learning Multiple Layers of Features from Tiny Images.
[19] Zhang, H., Cisse, M., Dauphin, Y.N. and Lopez-Paz, D. (2018) mixup: Beyond Empirical Risk Minimization. International Conference on Learning Representations, Vancouver, 30 April-3 May 2018, 1-13.