基于无语音概率的语音增强算法
Speech Enhancement Algorithm Combining Speech Absence Probability
摘要: 本文的研究工作主要是在幅度平方谱最小均方估计器的基础上提出了一种新的算法。由于带噪语音的统计模型中语音存在不确定性,统一对语音信号进行处理必然会造成语音成分的丢失,从而影响语音增强的性能,因此,本文主要研究和估计出每个信号频点的无语音概率,然后与幅度平方谱最小均方误差算法的增益函数相结合,推导出一个全新的增益函数。最后通过实验仿真可以看出,本文提出的算法能够明显的改善语音质量,提高语音的可懂度。
Abstract: The research work of this paper is mainly on the basis of the amplitude squared spectrum least mean square estimator and proposes a new algorithm. Due to the uncertainty of the speech in the statistical model of noisy speech, the unified processing of speech signals will inevitably result in the loss of speech components, which will affect the performance of speech enhancement. Therefore, this paper mainly studies and estimates the frequency of each signal. The speech probability is then combined with the gain function of the squared spectrum least mean square error algorithm to derive a new gain function. Finally, we can see through the experimental simulation, the algorithm proposed in this paper can significantly improve the voice quality and improve the intelligibility of the voice.
文章引用:韩蕊蕊, 高颖, 陈晨. 基于无语音概率的语音增强算法[J]. 无线通信, 2018, 8(4): 141-147. https://doi.org/10.12677/HJWC.2018.84016

参考文献

[1] Lu, Y. and Loizou, P.C. (2011) Estimators of the Magnitude-Squared Spectrum and Methods for Incorporating SNR Uncertainty. IEEE Transactions on Audio, Speech, and Language Processing, 19, 1123-1137.
[Google Scholar] [CrossRef
[2] Diethom, E.J. (2000) Subband Noise Reduction Methods for Speech Enhancement. Acoustic Signal Processing for Telecommunication. Springer US, New York, 155-178.
[3] Xia, B. and Bao, C. (2014) Wiener Filtering Based Speech Enhancement with Weighted Denoising Auto-Encoder and Noise Classification. Speech Communication, 60, 13-29.
[Google Scholar] [CrossRef
[4] 卢志强. 基于新型先验信噪比估计的语音增强算法的对比研究[D]: [硕士学位论文]. 长沙: 湖南大学.
[5] Papoulis, A. and Pillai, S.U. (2002) Probability, Random Variables and Stochastic Processes with Errata Sheet. McGraw-Hill Education, New York, 31.
[6] Huan, Z. (2014) A New Soft Masking Method for Speech Enhancement in the Frequency Domain. Elektronika ir Elektrotechnika, 20, 1392-1215.
[7] Cohen, I. (2003) Noise Spectrum Estimation in Adverse Environments: Improved Minima Contorlled Recursive Averraging. IEEE Transaction on Speech and Audio Processing, 11, 466-475.
[Google Scholar] [CrossRef
[8] Ephraim, Y. and Malah, D. (1984) Speech Enhancement Using a Minimum-Mean Squared Error Short-Time Spectral Amplitude Estimator. IEEE Transactions on Acoustics, Speech and Signal Processing, 32, 1109-1121.
[Google Scholar] [CrossRef
[9] Taal, C.H., Hendriks, R.C., Heusdens, R., et al. (2011) An Algorithm for Intelligibility Prediction of Time-Frequency Weighted Noisy Speech. IEEE Transactions on Audio Speech & Language Processing, 19, 2125-2136.
[Google Scholar] [CrossRef
[10] Abramson, A. and Cohen, I. (2010) Simultaneous Detection and Estimation Approach for Speech Enhancement. IEEE Transactions on Audio Speech & Language Processing, 15, 2348-2359.
[Google Scholar] [CrossRef