Latent discriminative representation learning for speaker recognition

Duolin HUANG; Qirong MAO; Zhongchen MA; Zhishen ZHENG; Sidheswar ROUTRYAR; Elias-Nii-Noi OCQUAYE

doi:10.1631/FITEE.1900690

PDF(721 KB)

Front. Inform. Technol. Electron. Eng ›› 2021, Vol. 22 ›› Issue (5) : 697-708. DOI: 10.1631/FITEE.1900690

Orginal Article

Latent discriminative representation learning for speaker recognition

Author information +

History +

Abstract

Extracting discriminative speaker-specific representations from speech signals and transforming them into fixed length vectors are key steps in speaker identification and verification systems. In this study, we propose a latent discriminative representation learning method for speaker recognition. We mean that the learned representations in this study are not only discriminative but also relevant. Specifically, we introduce an additional speaker embedded lookup table to explore the relevance between different utterances from the same speaker. Moreover, a reconstruction constraint intended to learn a linear mapping matrix is introduced to make representation discriminative. Experimental results demonstrate that the proposed method outperforms state-of-the-art methods based on the Apollo dataset used in the Fearless Steps Challenge in INTERSPEECH2019 and the TIMIT dataset.

Keywords

Speaker recognition / Latent discriminative representation learning / Speaker embedding lookup table / Linear mapping matrix

Cite this article

EndNote

Ris (Procite)

Bibtex

Download citation ▾

Duolin HUANG, Qirong MAO, Zhongchen MA, Zhishen ZHENG, Sidheswar ROUTRYAR, Elias-Nii-Noi OCQUAYE. Latent discriminative representation learning for speaker recognition. Front. Inform. Technol. Electron. Eng, 2021, 22(5): 697‒708 https://doi.org/10.1631/FITEE.1900690