基于多任务学习的时空混合注意力脑电信号情绪识别
Spatio-Temporal Hybrid Attention-Based EEG Emotion Recognition via Multi-Task Learning
摘要: 针对传统脑电情绪识别方法中存在的特征表征单维化、时序依赖建模不充分及任务间关联性被忽视等局限,本文提出一种面向多任务学习的时空混合注意力脑电情绪识别模型(MT-STCNN)。该方法首先在特征层面融合微分熵(DE)与功率谱密度(PSD)特征,从信号复杂度与频域能量分布两个维度全面表征情绪状态;在网络架构中引入轻量级Transformer编码器,强化对长时序脑电信号的全局依赖建模能力;进一步构建多任务协同学习框架,联合优化效价与唤醒度两项情感维度识别任务,通过特征共享机制与跨任务注意力模块实现任务间的知识迁移与互补,提升模型的识别效率。在DEAP数据集上的实验结果显示,本文方法在效价与唤醒度识别上的平均准确率分别达到98.26%和98.67%,性能显著优于当前主流模型,充分验证了所提方法在脑电情绪识别任务中的有效性与先进性。
Abstract: Aiming at the issues of single feature modality, inadequate temporal dependency modeling, and neglected task correlations in traditional EEG-based emotion recognition, this paper proposes a spatio-temporal hybrid attention EEG emotion recognition method based on multi-task learning (MT-STCNN). At the feature level, the method integrates Differential Entropy (DE) and Power Spectral Density (PSD) features to comprehensively characterize emotional states from both information complexity and energy distribution perspectives. In the network architecture, a lightweight Transformer encoder is introduced to enhance global modeling capabilities for long-term temporal dependencies. Furthermore, a multi-task learning framework is constructed to jointly optimize the recognition of valence and arousal dimensions. Through feature sharing and a cross-task attention mechanism, the model’s recognition efficiency is improved. Experiments on the DEAP dataset demonstrate that the proposed model achieves average recognition accuracies of 98.26% for valence and 98.67% for arousal, significantly outperforming existing mainstream methods, thereby validating the effectiveness and advancement of the proposed approach in EEG-based emotion recognition.
文章引用:陈爽, 李佳艳, 于欣琪, 刘甲辉, 张丽艳. 基于多任务学习的时空混合注意力脑电信号情绪识别[J]. 人工智能与机器人研究, 2026, 15(2): 548-558. https://doi.org/10.12677/airr.2026.152053

参考文献

[1] Özerdem, M.S. and Polat, H. (2017) Emotion Recognition Based on EEG Features in Movie Clips with Channel Selection. Brain Informatics, 4, 241-252. [Google Scholar] [CrossRef] [PubMed]
[2] Liu, J., Meng, H., Li, M. and Zhang, F. (2020) Emotion Recognition from Multiband EEG Signals Using CapsNet. Sensors, 20, Article 2702.
[3] Zhang, Y. and Wu, D. (2021) Multi-Task Learning for EEG-Based Emotion Recognition with Heterogeneous Feature Fusion. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 29, 1234-1243.
[4] Jia, J., Zhang, B., Lv, H., Xu, Z., Hu, S. and Li, H. (2022) CR-GCN: Channel-Relationships-Based Graph Convolutional Network for EEG Emotion Recognition. Brain Sciences, 12, Article 987. [Google Scholar] [CrossRef] [PubMed]
[5] 张晓龙, 许钢, 高峰, 等. 基于脑电图多特征融合的情绪识别研究[J/OL]. 重庆工商大学学报(自然科学版), 1-10.
https://link.cnki.net/urlid/50.1155.N.20250423.1150.004, 2026-02-11.
[6] 刘宝洲. 周期图法功率谱估计及其改进算法的研究[J]. 电子测量技术, 2020, 43(5): 76-79.
[7] 姚文俊. 自相关法和Burg法在AR模型功率谱估计中的仿真研究[J]. 计算机与数字工程, 2007, 35(10): 32-34, 187.
[8] Alam, R., Zhao, H., Goodwin, A., Kavehei, O. and McEwan, A. (2020) Differences in Power Spectral Densities and Phase Quantities Due to Processing of EEG Signals. Sensors, 20, Article 6285. [Google Scholar] [CrossRef] [PubMed]
[9] Hou, Q., Zhou, D. and Feng, J. (2021) Coordinate Attention for Efficient Mobile Network Design. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, 20-25 June 2021, 13713-13722. [Google Scholar] [CrossRef
[10] Zhao, B., Li, X., Lu, X. and Wang, Z. (2023) TST: A Transformer-Based Spatial-Temporal Network for EEG Emotion Recognition. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 31, 1234-1243.
[11] Gao, Y., Chen, Z., Liu, W. and Zhang, T. (2024) Spatio-Temporal Hierarchical Transformer with Cross-Domain Adaptation for EEG-Based Emotion Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46, 3057-3071.
[12] Zhang, Y. and Yang, Q. (2022) A Survey on Multi-Task Learning: Recent Progress and Challenges. IEEE Transactions on Neural Networks and Learning Systems, 33, 2809-2824.
[13] Liu, J., Meng, H., Li, M., et al. (2022) Neural Correlates of Valence and Arousal in EEG-Based Emotion Recognition: A Multimodal Study. IEEE Transactions on Affective Computing, 13, 123-135.
[14] Wang, S., Wang, H., Cheng, C. and Shen, L. (2023) Multi-Task Learning with Dynamic Feature Alignment for EEG-Based Valence-Arousal Classification. IEEE Transactions on Affective Computing, 14, 987-999.
[15] Zheng, W.L., Zhu, J.Y. and Lu, B.L. (2023) EEG-Based Emotion Recognition: A Multimodal and Multi-Task Learning Perspective. IEEE Reviews in Biomedical Engineering, 16, 45-61.
[16] Yang, Y., Wu, Q., Qiu, M., Wang, Y. and Chen, X. (2018) Emotion Recognition from Multi-Channel EEG through Parallel Convolutional Recurrent Neural Network. 2018 International Joint Conference on Neural Networks (IJCNN), Rio de Janeiro, 8-13 July 2018, 1-7. [Google Scholar] [CrossRef
[17] Wang, T., Huang, X., Xiao, Z., Cai, W. and Tai, Y. (2024) EEG Emotion Recognition Based on Differential Entropy Feature Matrix through 2D-CNN-LSTM Network. EURASIP Journal on Advances in Signal Processing, 2024, Article No. 49. [Google Scholar] [CrossRef
[18] Tao, W., Li, C., Song, R., Cheng, J., Liu, Y., Wan, F., et al. (2023) EEG-Based Emotion Recognition via Channel-Wise Attention and Self Attention. IEEE Transactions on Affective Computing, 14, 382-393. [Google Scholar] [CrossRef
[19] Rudakov, E., Laurent, L., Cousin, V., Roshdi, A., Fournier, R., Nait-ali, A., et al. (2021) Multi-Task CNN Model for Emotion Recognition from EEG Brain Maps. 2021 4th International Conference on Bio-Engineering for Smart Technologies (BioSMART), Paris, 8-10 December 2021, 1-4. [Google Scholar] [CrossRef