Speech enhancement through voice activity detection using speech absence probability based on Teager energy
Yun-sik Park , Sang-min Lee
Journal of Central South University ›› 2013, Vol. 20 ›› Issue (2) : 424 -432.
Speech enhancement through voice activity detection using speech absence probability based on Teager energy
In this work, a novel voice activity detection (VAD) algorithm that uses speech absence probability (SAP) based on Teager energy (TE) was proposed for speech enhancement. The proposed method employs local SAP (LSAP) based on the TE of noisy speech as a feature parameter for voice activity detection (VAD) in each frequency subband, rather than conventional LSAP. Results show that the TE operator can enhance the ability to discriminate speech and noise and further suppress noise components. Therefore, TE-based LSAP provides a better representation of LSAP, resulting in improved VAD for estimating noise power in a speech enhancement algorithm. In addition, the presented method utilizes TE-based global SAP (GSAP) derived in each frame as the weighting parameter for modifying the adopted TE operator and improving its performance. The proposed algorithm was evaluated by objective and subjective quality tests under various environments, and was shown to produce better results than the conventional method.
speech enhancement / Teager energy / speech absence probability / voice activity detection
| [1] |
TIA/EIA/IS-127.Enhanced variable rate codec, speech service option 3 for wideband spread spectrum digital systems [R], 1996EqglewoodTIA |
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
KARRAY L, MOKBEL C, MONNE J. Solutions for robust speech/non-speech detection in wireless environment [C]// Proceedings of IVTTA, Torino, 1988: 166–170. |
| [7] |
RABINER L R, SAMBUR M R. Voiced-unvoiced-silence detection using the Itakura LPC distance measure [C]// Proc IEEE Int Conf Acoust Speech Signal Process, Hartford, 1977: 323–326. |
| [8] |
|
| [9] |
SOHN J, SUNG W. A voice activity detector employing soft decision based noise spectrum adaptation [C]// Proc. IEEE Int Conf Acoustics, Speech, and Signal Processing, Seattle, 1998: 365–368. |
| [10] |
|
| [11] |
WANG K C, TSAI Y H. Voice activity detection algorithm with low signal-to-noise ratios based on spectrum entropy [C]// Second International Symposium on Universal Communication 2008, Osaka, 2008: 423–428. |
| [12] |
|
| [13] |
|
| [14] |
|
/
| 〈 |
|
〉 |