标题:
与文本无关的单训练样本特征点提取研究The Research of Text-Independent Feature Extraction Based on Single Training Sample
作者:
郭建敏
关键字:
特征提取, 线性预测编码, Mel频率倒谱系数, 局部归一化倒谱系数, 小波包变换Feature Extraction, Linear Predictive Coding Cepstral, Mel-Frequency Cepstral Coefficients, LNCC, WPT
期刊名称:
《Computer Science and Application》, Vol.6 No.6, 2016-06-30
摘要:
现有的说话人识别是基于语音的线性预测编码(LPCC)、Mel频率倒谱系数(MFCC)、局部归一化倒谱系数和小波包变换等特征,这些特征对环境噪声都比较敏感。针对上述问题,本文提出了一种与文本无关的单训练样本的特征提取方法。该方法提取的语音特征能够充分反映说话人的基本发声特性,可以很好的将不同的说话者区分开。本文列出了以上四种特征提取方法在但语音训练样本上对于不同说话者的识别效果,也将其与本文的方法进行了比较。对英文与汉语语音数据库的仿真实验表明,该特征提取方法可以实现单训练样本下的说话人识别中对于特征的提取,而且在单样本识别中会有相对好的效果。
The existing speaker identification are based on Linear Predictive Coding Cepstral (LPCC) coeffi-cients, Mel-Frequency Cepstral Coefficients (MFCC), local normalized cepstral coefficients (LNCC) and wavelet packet transform (WPT) method; these features are sensitive to noisy and environmental sounds. This paper describes a novel robust text-independent feature extraction method using single training sample. In the proposed method, the features can reflect a person’s basic phonation characteristic and distinguish different speakers. This paper introduces the four methods in single training sample and compares them with the proposed method. Experimental results on speech databases in English and Chinese demonstrate that the proposed approach can implement feature extraction in speaker identification based on single training sample, and yields a better performance in single training sample.