Blind separation of speech signals based on wavelet transform and independent component analysis
Xiao Wu , Jingjing He , Shijiu Jin , Antao Xu , Weikui Wang
Transactions of Tianjin University ›› 2010, Vol. 16 ›› Issue (2) : 123 -128.
Blind separation of speech signals based on wavelet transform and independent component analysis
Speech signals in frequency domain were separated based on discrete wavelet transform (DWT) and independent component analysis (ICA). First, mixed speech signals were decomposed into different frequency domains by DWT and the subbands of speech signals were separated using ICA in each wavelet domain; then, the permutation and scaling problems of frequency domain blind source separation (BSS) were solved by utilizing the correlation between adjacent bins in speech signals; at last, source signals were reconstructed from single branches. Experiments were carried out with 2 sources and 6 microphones using speech signals at sampling rate of 40 kHz. The microphones were aligned with 2 sources in front of them, on the left and right. The separation of one male and one female speeches lasted 2.5 s. It is proved that the new method is better than single ICA method and the signal to noise ratio is improved by 1 dB approximately.
wavelet transform / independent component analysis / blind source separation
| [1] |
|
| [2] |
|
| [3] |
Makino S. Blind source separation of convolutive mixtures[C]. In: Proceedings of SPIE—The International Society for Optical Engineering. Kissimmee, FL, USA, 2006. |
| [4] |
Robledo-Arnuncio E, Juang B. Blind source separation of acoustic mixtures with distributed microphones[C]. In: 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP’ 07. Honolulu, HI, USA. 2007. 949–952. |
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
Li Wanlong, Ju L, Du Jun et al. Solving permutation problem in frequency-domain blind source separation using microphone sub-arrays[C]. In: IEEE International Conference Neural Networks and Signal Processing, ICNNSP. Zhejiang, China, 2008. 67–72. |
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
Sawada H, Mukai R, Araki S et al. A robust approach to the permutation problem of frequency-domain blind source separation[C]. In: IEEE International Conference on Acoustics, Speech and Signal Processing Proceedings. Hongkong, China, 2003. 381–384. |
| [17] |
|
| [18] |
Mukai R, Sawada H, de la Kethulle de Ryhove S et al. Array geometry arrangement for frequency domain blind source separation[C]. In: International Workshop on Acoustic Echo and Noise Control (IWAENC2003). Kyoto, Japan, 2003. 219–222. |
| [19] |
|
/
| 〈 |
|
〉 |