MFCC based real-time speech reproduction and recognition using distributed acoustic sensing technology

Ran Zhou, Shuai Zhao, Mingming Luo, Xin Meng, Jie Ma, Jianfei Liu

Optoelectronics Letters ›› 2024, Vol. 20 ›› Issue (4) : 222-227. DOI: 10.1007/s11801-024-3167-5
Article

MFCC based real-time speech reproduction and recognition using distributed acoustic sensing technology

Author information +
History +

Abstract

The distributed acoustic sensing technology was used for real-time speech reproduction and recognition, in which the voiceprint can be extracted by the Mel frequency cepstral coefficient (MFCC) method. A classic ancient Chinese poem “You Zi Yin”, also called “A Traveler’s Song”, was analyzed both in time and frequency domains, where its real-time reproduction was achieved with a 116.91 ms time delay. The smaller scaled MFCC0 at 1/12 of MFCC matrix was taken as a feature vector of each line against the ambient noise, which provides a recognition method via cross-correlation among the 6 original and recovered verse pairs. The averaged cross-correlation coefficient of the matching pairs is calculated to be 0.580 6 higher than 0.188 3 of the nonmatched pairs, promising an accurate and fast method for real-time speech reproduction and recognition over a passive optical fiber.

Cite this article

Download citation ▾
Ran Zhou, Shuai Zhao, Mingming Luo, Xin Meng, Jie Ma, Jianfei Liu. MFCC based real-time speech reproduction and recognition using distributed acoustic sensing technology. Optoelectronics Letters, 2024, 20(4): 222‒227 https://doi.org/10.1007/s11801-024-3167-5

References

[[1]]
Wittje R. The electrical imagination: sound analogies, equivalent circuits, and the rise of electroacoustics, 1863–1939[J]. Osiris, 2013, 28(1): 40-63,
CrossRef Google scholar
[[2]]
Huang Y, Zhou X. Non-reciprocal sound transmission in electro-acoustic systems with time-modulated circuits[J]. Acta mechanica solida sinica, 2022, 35(6): 940-948,
CrossRef Google scholar
[[3]]
Sherif M M, Khakimova E M, Tanks J, et al.. Cyclic flexural behavior of hybrid SMA/steel fiber reinforced concrete analyzed by optical and acoustic techniques[J]. Composite structures, 2018, 201: 248-260,
CrossRef Google scholar
[[4]]
Joe H E, Yun H, Jo S H, et al.. A review on optical fiber sensors for environmental monitoring[J]. International journal of precision engineering and manufacturing-green technology, 2018, 5(1): 173-191,
CrossRef Google scholar
[[5]]
Hubbard P G, Xu J, Zhang S, et al.. Dynamic structural health monitoring of a model wind turbine tower using distributed acoustic sensing (DAS)[J]. Journal of civil structural health monitoring, 2021, 11(3): 833-849,
CrossRef Google scholar
[[6]]
FERNANDZE-RUIZ M R, SOTO M A, WILLIAMS E F, et al. Distributed acoustic sensing for seismic activity monitoring[J]. APL photonics, 2020, 5(3).
[[7]]
Moccia M, Pisco M, Cutolo A, et al.. Op-to-acoustic behavior of coated fiber Bragg gratings[J]. Optics express, 2011, 19(20): 18842-18860,
CrossRef Pubmed Google scholar
[[8]]
Macia-Sanahuja C, Lamela H, Garcia-Souto J A. Fiber optic interferometric sensor for acoustic detection of partial discharges[J]. Journal of optical technology, 2007, 74(2): 122-126,
CrossRef Google scholar
[[9]]
Xiong J, Wang Z, Jiang J, et al.. High sensitivity and large measurable range distributed acoustic sensing with Rayleigh-enhanced fiber[J]. Optics letters, 2021, 46(11): 2569-2572,
CrossRef Pubmed Google scholar
[[10]]
Fang G, Xu T, Fenf S, et al.. Phase-sensitive optical time domain reflectometer based on phase-generated carrier algorithm[J]. Journal of lightwave technology, 2015, 33(13): 2811-2816,
CrossRef Google scholar
[[11]]
Wang Z, Zhang L, Wang S, et al.. Coherent Φ-OTDR based on I/Q demodulation and homodyne detection[J]. Optics express, 2016, 24(2): 853-858,
CrossRef Pubmed Google scholar
[[12]]
Tu G, Zhang X, Zhang Y, et al.. The development of an Φ-OTDR system for quantitative vibration measurement[J]. IEEE photonics technology letters, 2015, 27(12): 1349-1352,
CrossRef Google scholar
[[13]]
Wang S, Jiang J, Wang S, et al.. GPU-based fast processing for a distributed acoustic sensor using an LFM pulse[J]. Applied optics, 2020, 59(35): 11098-11103,
CrossRef Pubmed Google scholar
[[14]]
Zhu K, Zhou B, Wu H, et al.. Multipath distributed acoustic sensing system based on phase-sensitive optical time-domain reflectometry with frequency division multiplexing technique[J]. Optics and lasers in engineering, 2021, 142: 106593,
CrossRef Google scholar
[[15]]
Zhang X, Qiao W, Sun Z, et al.. A distributed optical fiber sensing system for synchronous vibration and loss measurement[J]. Optoelectronics letters, 2016, 12(5): 375-378,
CrossRef Google scholar
[[16]]
Lu Y, Zhu T, Chen L, et al.. Distributed vibration sensor based on coherent detection of phase-OTDR[J]. Journal of lightwave technology, 2010, 28(22): 3243-3249
[[17]]
Dong Y, Chen X, Liu E, et al.. Quantitative measurement of dynamic nanostrain based on a phase-sensitive optical time domain reflectometer[J]. Applied optics, 2016, 55(28): 7810-7815,
CrossRef Pubmed Google scholar
[[18]]
Masoudi A, Belal M, Newson T P. Distributed optical fiber audible frequency sensor[C]. 23rd International Conference on Optical Fiber Sensors, June 2–6, 2014, Santander, Spain, 2014 Washington SPIE 537-540
[[19]]
Franciscangelis C, Margulis W, Kjellberg L, et al.. Real-time distributed fiber microphone based on phase-OTDR[J]. Optics express, 2016, 24(26): 29597-29602,
CrossRef Pubmed Google scholar
[[20]]
Wu Y, Gan J, Li Q, et al.. Distributed fiber voice sensor based on phase-sensitive optical time-domain reflectometry[J]. IEEE photonics journal, 2015, 7(6): 1-10,
CrossRef Google scholar
[[21]]
Zhang P, Venketeswaran A, Wright R, et al.. Feature extraction for pipeline defects inspection based upon distributed acoustic fiber optic sensing data[C]. Fiber Optic Sensors and Applications XVIII, April 3–June 12, 2022, Virtual, 2022 Washington SPIE 14-29
[[22]]
Tabjula J, Sharma J. Feature extraction techniques for noisy distributed acoustic sensor data acquired in a wellbore[J]. Applied optics, 2023, 62(16): E51-E61,
CrossRef Pubmed Google scholar
[[23]]
Ning F, Cheng Z, Meng D, et al.. A framework combining acoustic features extraction method and random forest algorithm for gas pipeline leak detection and classification[J]. Applied acoustics, 2021, 182: 108255,
CrossRef Google scholar
[[24]]
Wu H, Wang C, Liu X, et al.. Intelligent target recognition for distributed acoustic sensors by using both manual and deep features[J]. Applied optics, 2021, 60(23): 6878-6887,
CrossRef Pubmed Google scholar
[[25]]
Jiang F, Li H, Zhang Z, et al.. An event recognition method for fiber distributed acoustic sensing systems based on the combination of MFCC and CNN[C]. 2017 International Conference on Optical Instruments and Technology: Advanced Optical Sensors and Applications, October 28–30, 2017, Beijing, China, 2018 Washington SPIE 15-21
[[26]]
Shi Y, Liu X, Wei C. An event recognition method based on MFCC, superposition algorithm and deep learning for buried distributed optical fiber sensors[J]. Optics communications, 2022, 522: 128647,
CrossRef Google scholar
[[27]]
Shang Y, Yang J, Chen W, et al.. Speech signal enhancement based on deep learning in distributed acoustic sensing[J]. Optics express, 2023, 31(3): 4067-4079,
CrossRef Pubmed Google scholar
[[28]]
Bencharif B A E, Bayar S, Özkan E. Parallel implementation of distributed acoustic sensor acquired signals: detection, processing, and classification[J]. Journal of applied remote sensing, 2022, 16(2): 024504-024504,
CrossRef Google scholar
[[29]]
AYVAZ U, GURULER H, KHAN F, et al. Automatic speaker recognition using mel-frequency cepstral coefficients through machine learning[J]. CMC-computers materials & continua, 2022, 71(3).
[[30]]
Arpitha Y, Mashumathi G L, Balaji N. Spectrogram analysis of ECG signal and classification efficiency using MFCC feature extraction technique[J]. Journal of ambient intelligence and humanized computing, 2022, 13(2): 757-767,
CrossRef Google scholar
[[31]]
GANCHEV T, FAKOTAKIS N, KOKKINAKIS G. Comparative evaluation of various MFCC implementations on the speaker verification task[C]//Proceedings of the SPECOM, October 17–19, 2005, Patras, Greece. Moscow, 2005: 191–194.
[[32]]
Blotekjaer K. Fundamental noise sources that limit the ultimate resolution of fiber optic sensors[C]. Optical and Fiber Optic Sensor Systems, September 16–19, 1998, Beijing, China, 1998 Washington SPIE 1-12

Accesses

Citations

Detail

Sections
Recommended

/