State Key Laboratory on Microwave and Digital Communications, Department of Electronic Engineering, Tsinghua University
Show less
History+
Published Online
2008-06-05
PDF
(185KB)
Abstract
The description precision of an excitation signal greatly influences the quality of reconstructed speech in low bit rate vocoders. To improve the reconstruction quality, the DCT_M model is proposed to express the excitation spectral parameter, which transforms the variable length vector to fixed dimension vector through DCT transformation. It then quantizes the fixed length vector using multi-stage vector quantization. Tests show that the proposed method can keep the shape of the entire spectral envelope and reduce model error thus greatly improve the description precision. Test results in the sine excitation linear prediction (SELP) vocoder show that the DCT_M model can improve the naturalness of reconstructed speech, with subjective test score of 65%.
DANG Xiaoyan, TANG Kun.
DCT_M model for excitation parameter in low bit
rate vocoder.
Front. Electr. Electron. Eng., 2008, 3(2): 204-207 DOI:10.1007/s11460-008-0043-1