基于MAML的少样本跨被试事件相关电位分类快速适应方法
MAML-Based Rapid Adaptation for Few-Shot Cross-Subject ERP Classification
DOI: 10.12677/csa.2026.165189, PDF,   
作者: 陈朝琛, 季洪飞, 李 洁*:同济大学计算机科学与技术学院,上海;白钟飞*:上海市养志康复医院,上海
关键词: 事件相关电位跨被试分类少样本学习模型无关元学习Event-Related Potential Cross-Subject Classification Few-Shot Learning Model-Agnostic Meta-Learning
摘要: 针对跨被试事件相关电位(ERP)解码中个体间差异大、新用户校准样本极少导致传统方法泛化性能严重下降的问题,文章提出一种基于模型无关元学习(MAML)的少样本快速适应方法DF-MAML。该方法以预训练的域不变特征提取器为初始参数,通过元学习框架在源被试构建的大量模拟任务上进行双层优化:内循环利用支持集模拟快速校准,外循环优化元参数使其具备“易于快速适应”的特性。在此基础上,进一步提出基于任务难度的动态分类器冻结策略,根据支持集与查询集的分布偏移自适应地决定是否冻结分类器梯度,从而稳定特征学习并提升元参数质量。在公开ERN数据集与自采中文语义句法违反数据集上的实验表明,在5-shot设定下,本方法的平均AUC分别达到0.7563和0.6705,显著优于EEGNet、原型网络及标准MAML等基线方法;消融实验验证了动态冻结策略的有效性;少样本性能曲线显示,提供5个校准样本可使性能较零样本提升约9%。本方法为低校准负担、高鲁棒性的脑机接口系统提供了关键技术支撑。
Abstract: To address the challenge in cross-subject event-related potential (ERP) decoding, where large inter-subject variability and extremely limited calibration samples for new users severely degrade the generalization performance of conventional methods, this paper proposes a MAML-based few-shot rapid adaptation approach. The proposed method initializes with a pre-trained domain-invariant feature extractor and performs bi-level optimization over a large number of simulated tasks constructed from source subjects within a meta-learning framework. Specifically, the inner loop leverages the support set to emulate rapid calibration, while the outer loop optimizes meta-parameters to facilitate fast adaptation. Furthermore, a task-difficulty-aware dynamic classifier freezing strategy is introduced, which adaptively decides whether to freeze classifier updates based on the distribution shift between the support and query sets, thereby stabilizing feature learning and improving the quality of the meta-learned parameters. Experiments conducted on a public ERN dataset and a self-collected Chinese semantic-syntax violation dataset demonstrate that, under the 5-shot setting, the proposed method achieves average AUC scores of 0.7563 and 0.6705, respectively, significantly outperforming baseline methods including EEGNet, Prototypical Networks, and standard MAML. Ablation studies further validate the effectiveness of the proposed dynamic freezing strategy. The few-shot performance analysis shows that using only five calibration samples yields an improvement of approximately 9% over the zero-shot setting. The proposed method offers a promising solution for developing brain-computer interface systems with low calibration cost and high robustness.
文章引用:陈朝琛, 季洪飞, 白钟飞, 李洁. 基于MAML的少样本跨被试事件相关电位分类快速适应方法[J]. 计算机科学与应用, 2026, 16(5): 350-364. https://doi.org/10.12677/csa.2026.165189

参考文献

[1] Sur, S. and Sinha, V. (2009) Event-Related Potential: An Overview. Industrial Psychiatry Journal, 18, 70-73. [Google Scholar] [CrossRef] [PubMed]
[2] Woldorff, M.G. (1993) Distortion of ERP Averages Due to Overlap from Temporally Adjacent ERPs: Analysis and Correction. Psychophysiology, 30, 98-119. [Google Scholar] [CrossRef] [PubMed]
[3] Mouraux, A. and Iannetti, G.D. (2008) Across-Trial Averaging of Event-Related EEG Responses and Beyond. Magnetic Resonance Imaging, 26, 1041-1054. [Google Scholar] [CrossRef] [PubMed]
[4] Allen, J. (1977) Short Term Spectral Analysis, Synthesis, and Modification by Discrete Fourier Transform. IEEE Transactions on Acoustics, Speech, and Signal Processing, 25, 235-238. [Google Scholar] [CrossRef
[5] Samar, V.J., Swartz, K.P. and Raghuveer, M.R. (1995) Multiresolution Analysis of Event-Related Potentials by Wavelet Decomposition. Brain and Cognition, 27, 398-438. [Google Scholar] [CrossRef] [PubMed]
[6] Huang, N.E., Shen, Z., Long, S.R., Wu, M.C., Shih, H.H., Zheng, Q., et al. (1998) The Empirical Mode Decomposition and the Hilbert Spectrum for Nonlinear and Non-Stationary Time Series Analysis. Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences, 454, 903-995. [Google Scholar] [CrossRef
[7] Müller, K., Mika, S., Tsuda, K. and Schölkopf, K. (2018) An Introduction to Kernel-Based Learning Algorithms. In: Yu Hu, H. and Hwang, J.N., Eds., Handbook of Neural Network Signal Processing, CRC Press, 4-1-4-40. [Google Scholar] [CrossRef
[8] Lotte, F., Congedo, M., Lécuyer, A., Lamarche, F. and Arnaldi, B. (2007) A Review of Classification Algorithms for EEG-Based Brain-Computer Interfaces. Journal of Neural Engineering, 4, R1-R13. [Google Scholar] [CrossRef] [PubMed]
[9] Krusienski, D.J., Sellers, E.W., McFarland, D.J., Vaughan, T.M. and Wolpaw, J.R. (2008) Toward Enhanced P300 Speller Performance. Journal of Neuroscience Methods, 167, 15-21. [Google Scholar] [CrossRef] [PubMed]
[10] Barachant, A., Bonnet, S., Congedo, M. and Jutten, C. (2012) Multiclass Brain-Computer Interface Classification by Riemannian Geometry. IEEE Transactions on Biomedical Engineering, 59, 920-928. [Google Scholar] [CrossRef] [PubMed]
[11] Rivet, B., Souloumiac, A., Attina, V. and Gibert, G. (2009) xDAWN Algorithm to Enhance Evoked Potentials: Application to Brain-Computer Interface. IEEE Transactions on Biomedical Engineering, 56, 2035-2043. [Google Scholar] [CrossRef] [PubMed]
[12] Congedo, M., Barachant, A. and Bhatia, R. (2017) Riemannian Geometry for EEG-Based Brain-Computer Interfaces; a Primer and a Review. Brain-Computer Interfaces, 4, 155-174. [Google Scholar] [CrossRef
[13] Xiao, X., Xu, M., Jin, J., Wang, Y., Jung, T. and Ming, D. (2020) Discriminative Canonical Pattern Matching for Single-Trial Classification of ERP Components. IEEE Transactions on Biomedical Engineering, 67, 2266-2275. [Google Scholar] [CrossRef] [PubMed]
[14] LeCun, Y., Bengio, Y. and Hinton, G. (2015) Deep learning. Nature, 521, 436-444. [Google Scholar] [CrossRef] [PubMed]
[15] Craik, A., He, Y. and Contreras-Vidal, J.L. (2019) Deep Learning for Electroencephalogram (EEG) Classification Tasks: A Review. Journal of Neural Engineering, 16, Article ID: 031001. [Google Scholar] [CrossRef] [PubMed]
[16] Schirrmeister, R.T., Springenberg, J.T., Fiederer, L.D.J., Glasstetter, M., Eggensperger, K., Tangermann, M., et al. (2017) Deep Learning with Convolutional Neural Networks for EEG Decoding and Visualization. Human Brain Mapping, 38, 5391-5420. [Google Scholar] [CrossRef] [PubMed]
[17] Lawhern, V.J., Solon, A.J., Waytowich, N.R., Gordon, S.M., Hung, C.P. and Lance, B.J. (2018) EEGNet: A Compact Convolutional Neural Network for EEG-Based Brain-Computer Interfaces. Journal of Neural Engineering, 15, Article ID: 056013. [Google Scholar] [CrossRef] [PubMed]
[18] Santamaría-Vázquez, E., Martínez-Cagigal, V., Vaquerizo-Villar, F. and Hornero, R. (2020) EEG-Inception: A Novel Deep Convolutional Neural Network for Assistive ERP-Based Brain-Computer Interfaces. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 28, 2773-2782. [Google Scholar] [CrossRef] [PubMed]
[19] Song, Y., Zheng, Q., Liu, B. and Gao, X. (2023) EEG Conformer: Convolutional Transformer for EEG Decoding and Visualization. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 31, 710-719. [Google Scholar] [CrossRef] [PubMed]
[20] Li, X., Song, D., Zhang, P., Zhang, Y., Hou, Y. and Hu, B. (2018) Exploring EEG Features in Cross-Subject Emotion Recognition. Frontiers in Neuroscience, 12, Article 162. [Google Scholar] [CrossRef] [PubMed]
[21] Li, H., Pan, S.J., Wang, S. and Kot, A.C. (2018) Domain Generalization with Adversarial Feature Learning. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 5400-5409. [Google Scholar] [CrossRef
[22] Sun, B. and Saenko, K. (2016) Deep CORAL: Correlation Alignment for Deep Domain Adaptation. In: Hua, G. and Jégou, H., Eds., Computer VisionECCV 2016 Workshops, Springer, 443-450. [Google Scholar] [CrossRef
[23] Zhang, H., Cisse, M., Dauphin, Y.N., et al. (2017) mixup: Beyond Empirical Risk Minimization. arXiv: 1710.09412.
[24] Arjovsky, M., Bottou, L., Gulrajani, I., et al. (2019) Invariant Risk Minimization. arXiv: 1907.02893.
[25] Ganin, Y., Ustinova, E., Ajakan, H., et al. (2016) Domain-Adversarial Training of Neural Networks. Journal of Machine Learning Research, 17, 1-35.
[26] Shen, X., Liu, X., Hu, X., Zhang, D. and Song, S. (2023) Contrastive Learning of Subject-Invariant EEG Representations for Cross-Subject Emotion Recognition. IEEE Transactions on Affective Computing, 14, 2496-2511. [Google Scholar] [CrossRef
[27] Zhou, K., Liu, Z., Qiao, Y., Xiang, T. and Loy, C.C. (2022) Domain Generalization: A Survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45, 4396-4415. [Google Scholar] [CrossRef] [PubMed]
[28] Li, Y., Zheng, W., Zong, Y., Cui, Z., Zhang, T. and Zhou, X. (2021) A Bi-Hemisphere Domain Adversarial Neural Network Model for EEG Emotion Recognition. IEEE Transactions on Affective Computing, 12, 494-504. [Google Scholar] [CrossRef
[29] Zhong, P., Wang, D. and Miao, C. (2022) EEG-Based Emotion Recognition Using Regularized Graph Neural Networks. IEEE Transactions on Affective Computing, 13, 1290-1301. [Google Scholar] [CrossRef
[30] Chen, P., Gao, Z., Yin, M., Wu, J., Ma, K. and Grebogi, C. (2022) Multiattention Adaptation Network for Motor Imagery Recognition. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 52, 5127-5139. [Google Scholar] [CrossRef
[31] Wang, H., Chen, P., Zhang, M., Zhang, J., Sun, X., Li, M., et al. (2024) EEG-Based Motor Imagery Recognition Framework via Multisubject Dynamic Transfer and Iterative Self-Training. IEEE Transactions on Neural Networks and Learning Systems, 35, 10698-10712. [Google Scholar] [CrossRef] [PubMed]
[32] Li, J., Wang, F., Huang, H., Qi, F. and Pan, J. (2023) A Novel Semi-Supervised Meta Learning Method for Subject-Transfer Brain-Computer Interface. Neural Networks, 163, 195-204. [Google Scholar] [CrossRef] [PubMed]
[33] Hospedales, T.M., Antoniou, A., Micaelli, P. and Storkey, A.J. (2021) Meta-Learning in Neural Networks: A Survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44, 5149-5169. [Google Scholar] [CrossRef] [PubMed]
[34] Finn, C., Abbeel, P. and Levine, S. (2017) Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks. International Conference on Machine Learning. PMLR 2017, Sydney, 6-11 August 2017, 1126-1135.
[35] Nichol, A., Achiam, J. and Schulman, J. (2018) On First-Order Meta-Learning Algorithms. arXiv: 1803.02999.
[36] Snell, J., Swersky, K. and Zemel, R. (2017) Prototypical Networks for Few-Shot Learning. arXiv: 1703.05175.
[37] Ning, X., Wang, J., Lin, Y., Cai, X., Chen, H., Gou, H., et al. (2024) Metaemotionnet: Spatial-Spectral-Temporal-Based Attention 3-D Dense Network with Meta-Learning for EEG Emotion Recognition. IEEE Transactions on Instrumentation and Measurement, 73, 1-13. [Google Scholar] [CrossRef
[38] Ng, H.W. and Guan, C. (2024) Subject-Independent Meta-Learning Framework Towards Optimal Training of EEG-Based Classifiers. Neural Networks, 172, Article ID: 106108. [Google Scholar] [CrossRef] [PubMed]
[39] Han, J., Bak, S., Kim, J., Choi, W., Shin, D., Son, Y., et al. (2024) META-EEG: Meta-Learning-Based Class-Relevant EEG Representation Learning for Zero-Calibration Brain-Computer Interfaces. Expert Systems with Applications, 238, Article ID: 121986. [Google Scholar] [CrossRef
[40] Pati, A., Mewada, D. and Samanta, D. (2023) Meta-Learning for Subject Adaptation in Low-Data Environments for EEG-Based Motor Imagery Brain-Computer Interfaces. ICLR 2023, Kigali, 1-5 May 2023, 1-6.
[41] Li, S., Wu, H., Ding, L. and Wu, D. (2022) Meta-Learning for Fast and Privacy-Preserving Source Knowledge Transfer of EEG-Based BCIs. IEEE Computational Intelligence Magazine, 17, 16-26. [Google Scholar] [CrossRef
[42] Margaux, P., Emmanuel, M., Sébastien, D., Olivier, B. and Jérémie, M. (2012) Objective and Subjective Evaluation of Online Error Correction during P300-Based Spelling. Advances in Human-Computer Interaction, 2012, Article ID: 578295. [Google Scholar] [CrossRef