Duration-Distribution-Based HMM for Speech Recognition
WANG Zuo-ying, XIAO Xi
Author information+
Department of Electronic Engineering, Tsinghua University, Beijing 100084, China;
Show less
History+
Published
05 Mar 2006
Issue Date
05 Mar 2006
Abstract
To overcome the defects of the duration modeling in the homogeneous Hidden Markov Model (HMM) for speech recognition, a duration-distribution-based HMM (DDBHMM) is proposed in this paper based on a formalized definition of a left-to-right inhomogeneous Markov model. It has been demonstrated that it can be identically defined by either the state duration or the state transition probability. The speaker-independent continuous speech recognition experiments show that by only modeling the state duration in DDBHMM, a significant improvement (17.8% error rate reduction) can be achieved compared with the classical HMM. The ideal properties of DDBHMM give promise to many aspects of speech modeling, such as the modeling of the state duration, speed variation, speech discontinuity, and interframe correlation.
WANG Zuo-ying, XIAO Xi.
Duration-Distribution-Based HMM for Speech Recognition. Front. Electr. Electron. Eng., 2006, 1(1): 26‒30 https://doi.org/10.1007/s11460-005-0010-z
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
This is a preview of subscription content, contact us for subscripton.
AI Summary ×
Note: Please note that the content below is AI-generated. Frontiers Journals website shall not be held liable for any consequences associated with the use of this content.