Speech enhancement based on modified a priori SNR estimation

Yu FANG, Gang LIU, Jun GUO

PDF(116 KB)
PDF(116 KB)
Front. Electr. Electron. Eng. ›› 2011, Vol. 6 ›› Issue (4) : 542-546. DOI: 10.1007/s11460-011-0181-8
RESEARCH ARTICLE
RESEARCH ARTICLE

Speech enhancement based on modified a priori SNR estimation

Author information +
History +

Abstract

To solve the frame delay problem and match the previous frame, Plapous et al. [IEEE Transactions on Audio, Speech, and Language Processing, 2006, 14(6): 2098–2108] introduced a novel approach called two-step noise reduction (TSNR) technique to improve the performance of the speech enhancement system. However, TSNR approach results in spectral peaks of short duration and the broken spectral outlier, which degrade the spectral characteristics of the speech. To solve this problem, a cepstral smoothing step is added in order to remove these spectral peaks brought by TSNR approach. Theory analysis shows that the proposed approach can effectively smooth the spectral peaks and keep the spectral outlier so as to protect the speech characteristics. Experiment results also show that the proposed approach can bring significant improvement compared to decision-directed (DD) and TSNR approaches, especially in non-stationary noisy environments.

Keywords

speech enhancement / decision-directed (DD) / two-step noise reduction (TSNR) / signal-to-noise ratio (SNR) estimation

Cite this article

Download citation ▾
Yu FANG, Gang LIU, Jun GUO. Speech enhancement based on modified a priori SNR estimation. Front Elect Electr Eng Chin, 2011, 6(4): 542‒546 https://doi.org/10.1007/s11460-011-0181-8

References

[1]
Ephraim Y, Malah D. Speech enhancement using a minimum mean square error short time spectral amplitude estimator. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1984, 32(6): 1109-1121
CrossRef Google scholar
[2]
Boll S F. Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1979, 27(2): 113-120
CrossRef Google scholar
[3]
Cohen I. Relaxed statistical model for speech enhancement and a priori SNR estimation. IEEE Transactions on Speech and Audio Processing, 2005, 13(5): 870-881
CrossRef Google scholar
[4]
Cohen I. Speech enhancement using a noncausal a priori SNR estimator. IEEE Signal Processing Letters, 2004, 11(9): 725-728
CrossRef Google scholar
[5]
Plapous C, Marro C, Scalart P. Improved signal-to-noise ratio estimation for speech enhancement. IEEE Transactions on Audio, Speech, and Language Processing, 2006, 14(6): 2098-2108
[6]
Mauler D, Gerkmann T, Martin R. An analysis of quefrency selective temporal smoothing of the cepstrum in speech enhancement. In: Proceedings of the 11th International Workshop on Acoustic Echo and Noise Control. 2008, 1-4
[7]
Noll A M. Cepstrum pitch estimation. Journal of the Acoustical Society of America, 1967, 41(2): 293-309
CrossRef Pubmed Google scholar
[8]
Cappe O. Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor. IEEE Transactions on Speech and Audio Processing, 1994, 2(2): 345-349
CrossRef Google scholar
[9]
Garofolo J S, Lamel L F, Fisher W M, Fiscus J G, Pallett D S, Dahlgren N L, Zue V. DARPA TIMIT Acoustic-phonetic continuous speech corpus. NIST Speech Disc1-1.1, 1993
[10]
Varga A, Steeneken H J M, Tomlinson M, Jones D. The NOISEX-92 study on the effect of additive noise on automatic speech recognition. The NOISEX-92 CD-ROMs, 1992
[11]
Deller J R Jr, Hansen J H L, Proakis J G. Discrete-Time Processing of Speech Signals. 2nd ed. New York: IEEE Press, 2000

Acknowledgements

This work was partially supported by the National Natural Science Foundation of China (Grant Nos. 61005004, 61175011, and 61171193), the Next-Generation Broadband Wireless Mobile Communications Network Technology Key Project (No. 2011ZX03002-005-01), the 111 project (No. B08004), and Scientific Research Foundation for the Returned Overseas Chinese Scholars, State Education Ministry.

RIGHTS & PERMISSIONS

2014 Higher Education Press and Springer-Verlag Berlin Heidelberg
PDF(116 KB)

Accesses

Citations

Detail

Sections
Recommended

/