|
[1]
|
李美娟. 基于深度学习和特征融合的语音情感识别方法研究[D]: [硕士学位论文]. 济南: 齐鲁工业大学, 2024.
|
|
[2]
|
Jin, Q., Li, C., Chen, S., et al. (2015) Speech Emotion Recognition with Acoustic and Lexical Features. 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). South Brisbane, 19-24 April 2015, 4749-4753. [Google Scholar] [CrossRef]
|
|
[3]
|
Zhang, S., Zhang, S., Huang, T. and Gao, W. (2018) Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching. IEEE Transactions on Multimedia, 20, 1576-1590. [Google Scholar] [CrossRef]
|
|
[4]
|
陶建华, 陈俊杰, 李永伟. 语音情感识别综述[J]. 信号处理, 2023, 39(4): 571-587.
|
|
[5]
|
程适, 骆晓宁, 李冬城, 等. 一种基于双向LSTM的语音情感识别模型[J]. 长江信息通信, 2022, 35(7): 19-22.
|
|
[6]
|
Mohan, M., Dhanalakshmi, P. and Kumar, R.S. (2023) Speech Emotion Classification Using Ensemble Models with MFCC. Procedia Computer Science, 218, 1857-1868. [Google Scholar] [CrossRef]
|
|
[7]
|
Hochreiter, S. and Schmidhuber, J. (1997) Long Short-Term Memory. Neural Computation, 9, 1735-1780. [Google Scholar] [CrossRef] [PubMed]
|
|
[8]
|
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., et al. (2017) Attention Is All You Need. Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, 4-9 December 2017, 6000-6010.
|
|
[9]
|
Wagner, J., Triantafyllopoulos, A., Wierstorf, H., Schmitt, M., Burkhardt, F., Eyben, F., et al. (2023) Dawn of the Transformer Era in Speech Emotion Recognition: Closing the Valence Gap. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45, 10745-10759. [Google Scholar] [CrossRef] [PubMed]
|
|
[10]
|
Tang, X., Lin, Y., Dang, T., Zhang, Y. and Cheng, J. (2024) Speech Emotion Recognition via CNN-Transformer and Multidimensional Attention Mechanism. arXiv: 2403.04743. [Google Scholar] [CrossRef]
|