Duration-Distribution-Based HMM for Speech Recognition

Front. Electr. Electron. Eng. ›› 2006, Vol. 1 ›› Issue (1) : 26 -30.

PDF (128KB)
Front. Electr. Electron. Eng. ›› 2006, Vol. 1 ›› Issue (1) : 26 -30. DOI: 10.1007/s11460-005-0010-z

Duration-Distribution-Based HMM for Speech Recognition

Author information +
History +
PDF (128KB)

Abstract

To overcome the defects of the duration modeling in the homogeneous Hidden Markov Model (HMM) for speech recognition, a duration-distribution-based HMM (DDBHMM) is proposed in this paper based on a formalized definition of a left-to-right inhomogeneous Markov model. It has been demonstrated that it can be identically defined by either the state duration or the state transition probability. The speaker-independent continuous speech recognition experiments show that by only modeling the state duration in DDBHMM, a significant improvement (17.8% error rate reduction) can be achieved compared with the classical HMM. The ideal properties of DDBHMM give promise to many aspects of speech modeling, such as the modeling of the state duration, speed variation, speech discontinuity, and interframe correlation.

Keywords

duration, speech recognition, DDBHMM

Cite this article

Download citation ▾
null. Duration-Distribution-Based HMM for Speech Recognition. Front. Electr. Electron. Eng., 2006, 1(1): 26-30 DOI:10.1007/s11460-005-0010-z

登录浏览全文

4963

注册一个新账户 忘记密码

References

AI Summary AI Mindmap
PDF (128KB)

1083

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/