|
[1]
|
Reynolds, D.A., Quatieri, T.F. and Unn, D.R. (2000) Speaker Verification Using Adapted Gaussian Mixture Models. Digital Signal Processing, 10, 19-41. [Google Scholar] [CrossRef]
|
|
[2]
|
Trabelsi, I., Ayed, D.B. and Ellouze, N. (2016) Comparison between GMM-SVM Sequence Kernel and GMM: Application to Speech Emotion Recognition. Journal of Engineering Science and Technology, 11, 1221-1233.
|
|
[3]
|
Kanagasundaram, A., Vogt, R., Dean, D., et al. (2011) i-Vector Based Speaker Recognition on Short Utterances. INTERSPEECH, Florence, 27-31 August 2011, 2341-2344. [Google Scholar] [CrossRef]
|
|
[4]
|
Dehak, N., Kenny, P.J., Dehak, R., et al. (2011) Front-End Factor Analysis for Speaker Verification. IEEE Transactions on Audio Speech and Language Processing, 19, 788-798. [Google Scholar] [CrossRef]
|
|
[5]
|
Snyder, D., Garcia-Romero, D., Povey, D., et al. (2017) Deep Neural Network Embeddings for Text-Independent Speaker Verification. INTERSPEECH, Stockholm, 20-24 August 2017, 999-1003. [Google Scholar] [CrossRef]
|
|
[6]
|
Snyder, D., Garcia-Romero, D., Sell, G., et al. (2018) X-Vectors: Robust DNN Embeddings for Speaker Recognition. Proc. ICASSP, Calgary, 15-20 April 2018, 5329-5333. [Google Scholar] [CrossRef]
|
|
[7]
|
Variani, E., Lei, X., Mcdermott, E., et al. (2014) Deep Neural Networks for Small Footprint Text-Dependent Speaker Verification. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Florence, 4-9 May 2014, 4052-4056. [Google Scholar] [CrossRef]
|
|
[8]
|
Heigold, G., Moreno, I., Bengio, S., et al. (2016) End-To end Text-Dependent Speaker Verification. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, 20-25 March 2016, 5115-5119. [Google Scholar] [CrossRef]
|
|
[9]
|
Liu, W., Wen, Y., Yu, Z., et al. (2016) Large-Margin Softmax Loss for Convolutional Neural Networks. The 33rd International Conference on Machine Learning (ICML 2016), New York, 19-24 June 2016, 507-516.
|
|
[10]
|
Liu, W., Wen, Y., Yu, Z., et al. (2017) Sphereface: Deep Hypersphere Embedding for Face Recognition. The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, 21-26 July 2017, 212-220. [Google Scholar] [CrossRef]
|
|
[11]
|
Wang, F., Xiang, X., Cheng, J., et al. (2017) NormFace: L2 Hypersphere Embedding for Face Verification. Proceedings of the 25th ACM International Conference on Multimedia, Mountain View, 23-27 October 2017, 1041-1049. [Google Scholar] [CrossRef]
|
|
[12]
|
Wang, H., Wang, Y., Zhou, Z., et al. (2018) CosFace: Large Margin Cosine Loss for Deep Face Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, 18-23 June 2018, 5265-5274. [Google Scholar] [CrossRef]
|
|
[13]
|
Wang, F., Liu, W., Liu, H., et al. (2018) Additive Margin Softmax for Face Verification. IEEE Signal Processing Letters, 25, 926-930. [Google Scholar] [CrossRef]
|
|
[14]
|
Huang, Z., Wang, S. and Yu, K. (2018) Angular Softmax for Short Duration Text-Independent Speaker Verification. INTERSPEECH, Salt Lake City, 18-23 June 2018, 3623-3627. [Google Scholar] [CrossRef]
|
|
[15]
|
Yu, Y.Q., Fan, L. and Li, W.J. (2019) Ensemble Additive Margin Softmax for Speaker Verification. 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, 12-17 May 2019, 6046-6050. [Google Scholar] [CrossRef]
|
|
[16]
|
Li, Y., Gao, F., Ou, Z., et al. (2019) Angular Softmax Loss for End-to-End Speaker Verification. 2018 11th International Symposium on Chinese Spoken Language Processing (ISCSLP), Taipei, 26-29 November 2018, 190-194. [Google Scholar] [CrossRef]
|
|
[17]
|
Bredin, H. (2017) Tristounet: Triplet Loss for Speaker Turn Embedding. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, 5-9 March 2017, 5430-5434. [Google Scholar] [CrossRef]
|
|
[18]
|
He, K., Zhang, X., Ren, S., et al. (2016) Deep Residual Learning for Image Recognition. IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, 27-30 June 2016, 770-778. [Google Scholar] [CrossRef]
|
|
[19]
|
Bu, H., Du, J., Na, X., et al. (2017) AISHELL-1: An Open-Source Mandarin Speech Corpus and a Speech Recognition Baseline. O-COCOSDA, Seoul, 1-3 November 2017, 1-5. [Google Scholar] [CrossRef]
|