Adaptive bands filter bank optimized by genetic algorithm for robust speech recognition system

Li-xia Huang , G. Evangelista , Xue-ying Zhang

Journal of Central South University ›› 2011, Vol. 18 ›› Issue (5) : 1595 -1601.

PDF
Journal of Central South University ›› 2011, Vol. 18 ›› Issue (5) : 1595 -1601. DOI: 10.1007/s11771-011-0877-1
Article

Adaptive bands filter bank optimized by genetic algorithm for robust speech recognition system

Author information +
History +
PDF

Abstract

Perceptual auditory filter banks such as Bark-scale filter bank are widely used as front-end processing in speech recognition systems. However, the problem of the design of optimized filter banks that provide higher accuracy in recognition tasks is still open. Owing to spectral analysis in feature extraction, an adaptive bands filter bank (ABFB) is presented. The design adopts flexible bandwidths and center frequencies for the frequency responses of the filters and utilizes genetic algorithm (GA) to optimize the design parameters. The optimization process is realized by combining the front-end filter bank with the back-end recognition network in the performance evaluation loop. The deployment of ABFB together with zero-crossing peak amplitude (ZCPA) feature as a front process for radial basis function (RBF) system shows significant improvement in robustness compared with the Bark-scale filter bank. In ABFB, several sub-bands are still more concentrated toward lower frequency but their exact locations are determined by the performance rather than the perceptual criteria. For the ease of optimization, only symmetrical bands are considered here, which still provide satisfactory results.

Keywords

perceptual filter banks / bark scale / speaker independent speech recognition systems / zero-crossing peak amplitude / genetic algorithm

Cite this article

Download citation ▾
Li-xia Huang, G. Evangelista, Xue-ying Zhang. Adaptive bands filter bank optimized by genetic algorithm for robust speech recognition system. Journal of Central South University, 2011, 18(5): 1595-1601 DOI:10.1007/s11771-011-0877-1

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

AtalB. S.. Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification [J]. Journal of the Acoustical Society of America, 1974, 55(6): 1304-1312

[2]

DavisS., MermelsteinP.. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences [J]. IEEE Transaction on Acoustics, Speech and Signal Processing, 1980, 28(4): 357-366

[3]

KimD. S., LeeS. Y., KilR. M.. Auditory processing of speech signal for robust speech recognition in real-world noisy environments [J]. IEEE Transaction on Speech and Audio Processing, 1999, 7(1): 55-69

[4]

JuangB. H., RabinerL. R.. Hidden Markov models for speech recognition [J]. Technometrics, 1991, 33(3): 251-272

[5]

BroomheadD. S., LoweD.. Multivariable functional interpolation and adaptive networks [J]. Complex Systems, 1988, 2(3): 321-355

[6]

SayoudH., OuamourS.. Speaker clustering of stereo audio documents based on sequential gathering process [J]. Journal of Information Hiding and Multimedia Signal Processing, 2010, 1(4): 344-360

[7]

HandelS.Listening: An introduction to the perception of auditory events [M], 1993, Massachusetts, MIT Press: 461-546

[8]

StropeB., AlwanA.. A model of dynamic auditory perception and its application to robust word recognition [J]. IEEE Transaction on Speech and Audio Processing, 1997, 5(5): 451-464

[9]

HolmbergM., GelbartD., HemmertW.. Automatic speech recognition with an adaptation model motivated by auditory processing [J]. IEEE Transaction on Audio, Speech, Language Processing, 2006, 14(1): 44-49

[10]

ZhangX.-y., HuangL.-x., EvangelistaG.. Warped filter banks used in noisy speech recognition [C]. Proceedings of Innovative Computing, Information and Control, 2009, Kaohsiung, IEEE: 1385-1388

[11]

HuangL.-x., ZhangX.-y., EvangelistaG.. Speaker independent recognition on OLLO French corpus by using different features [C]. Proceedings of Pervasive Computing, Signal Processing and Applications, 2010, Harbin, IEEE: 332-335

[12]

HuangH.-c., PanJ.-s., LuZ.-m., SunS.-h., HangH.-ming.. Vector quantization based on genetic simulated annealing [J]. Signal Processing, 2001, 81(7): 1513-1523

[13]

LiX., CaoG.-y., ZhuX.-j., WeiDong.. Identification and analysis based on genetic algorithm for proton exchange membrane fuel cell stack [J]. Journal of Central South University, 2006, 13(4): 428-431

[14]

YuS.-y., KuangS.-qiong.. Fuzzy adaptive genetic algorithm based on auto-regulating fuzzy rules [J]. Journal of Central South University, 2010, 17(1): 123-128

[15]

GosselinL., Tye-GingrasM., Mathieu-PotvinF.. Review of utilization of genetic algorithms in heat transfer problems [J]. International Journal of Heat and Mass Transfer, 2009, 52(9/10): 2169-2188

[16]

PrakotpolD., SrinophakunT.. GAPinch: genetic algorithm toolbox for water pinch technology [J]. Chemical Engineering and Processing, 2004, 43(2): 203-217

AI Summary AI Mindmap
PDF

95

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/