Latent source-specific generative factor learning formonaural speech separation using weighted-factor autoencoder

Jing-jing CHEN , Qi-rong MAO , You-cai QIN , Shuang-qing QIAN , Zhi-shen ZHENG

Front. Inform. Technol. Electron. Eng ›› 2020, Vol. 21 ›› Issue (11) : 1639 -1650.

PDF (1459KB)
Front. Inform. Technol. Electron. Eng ›› 2020, Vol. 21 ›› Issue (11) : 1639 -1650. DOI: 10.1631/FITEE.2000019
Orginal Article
Orginal Article

Latent source-specific generative factor learning formonaural speech separation using weighted-factor autoencoder

Author information +
History +
PDF (1459KB)

Abstract

Much recent progress in monaural speech separation (MSS) has been achieved through a series of deep learning architectures based on autoencoders, which use an encoder to condense the input signal into compressed features and then feed these features into a decoder to construct a specific audio source of interest. However, these approaches can neither learn generative factors of the original input for MSS nor construct each audio source in mixed speech. In this study, we propose a novel weighted-factor autoencoder (WFAE) model for MSS, which introduces a regularization loss in the objective function to isolate one source without containing other sources. By incorporating a latent attention mechanism and a supervised source constructor in the separation layer, WFAE can learn source-specific generative factors and a set of discriminative features for each source, leading to MSS performance improvement. Experiments on benchmark datasets show that our approach outperforms the existing methods. In terms of three important metrics, WFAE has great success on a relatively challenging MSS case, i.e., speaker-independent MSS.

Keywords

Speech separation / Generative factors / Autoencoder / Deep learning

Cite this article

Download citation ▾
Jing-jing CHEN, Qi-rong MAO, You-cai QIN, Shuang-qing QIAN, Zhi-shen ZHENG. Latent source-specific generative factor learning formonaural speech separation using weighted-factor autoencoder. Front. Inform. Technol. Electron. Eng, 2020, 21(11): 1639-1650 DOI:10.1631/FITEE.2000019

登录浏览全文

4963

注册一个新账户 忘记密码

References

RIGHTS & PERMISSIONS

Zhejiang University and Springer-Verlag GmbH Germany, part of Springer Nature

AI Summary AI Mindmap
PDF (1459KB)

Supplementary files

FITEE-1639-20007-JJC_suppl_1

FITEE-1639-20007-JJC_suppl_2

717

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/