MFCC based real-time speech reproduction and recognition using distributed acoustic sensing technology

Ran Zhou; Shuai Zhao; Mingming Luo; Xin Meng; Jie Ma; Jianfei Liu

doi:10.1007/s11801-024-3167-5

Optoelectronics Letters ›› 2024, Vol. 20 ›› Issue (4) : 222-227. DOI: 10.1007/s11801-024-3167-5

Article

MFCC based real-time speech reproduction and recognition using distributed acoustic sensing technology

Ran Zhou¹^,²^,³ ,
Shuai Zhao¹^,²^,³ ,
Mingming Luo¹^,²^,³^,^c ,
Xin Meng¹^,²^,³ ,
Jie Ma¹^,²^,³ ,
Jianfei Liu¹^,²^,³

Author information +

History +

Abstract

The distributed acoustic sensing technology was used for real-time speech reproduction and recognition, in which the voiceprint can be extracted by the Mel frequency cepstral coefficient (MFCC) method. A classic ancient Chinese poem “You Zi Yin”, also called “A Traveler’s Song”, was analyzed both in time and frequency domains, where its real-time reproduction was achieved with a 116.91 ms time delay. The smaller scaled MFCC₀ at 1/12 of MFCC matrix was taken as a feature vector of each line against the ambient noise, which provides a recognition method via cross-correlation among the 6 original and recovered verse pairs. The averaged cross-correlation coefficient of the matching pairs is calculated to be 0.580 6 higher than 0.188 3 of the nonmatched pairs, promising an accurate and fast method for real-time speech reproduction and recognition over a passive optical fiber.

Cite this article

EndNote

Ris (Procite)

Bibtex

Download citation ▾

Ran Zhou, Shuai Zhao, Mingming Luo, Xin Meng, Jie Ma, Jianfei Liu. MFCC based real-time speech reproduction and recognition using distributed acoustic sensing technology. Optoelectronics Letters, 2024, 20(4): 222‒227 https://doi.org/10.1007/s11801-024-3167-5

References

Publishing order | Descend order by publishing year | Descend order by cited within

[[1]]

Wittje

. The electrical imagination: sound analogies, equivalent circuits, and the rise of electroacoustics, 1863–1939[J]. Osiris, 2013, 28(1): 40-63,

CrossRef Google scholar

[[2]]

Huang

, Zhou

. Non-reciprocal sound transmission in electro-acoustic systems with time-modulated circuits[J]. Acta mechanica solida sinica, 2022, 35(6): 940-948,

CrossRef Google scholar

[[3]]

Sherif

M M

, Khakimova

E M

, Tanks

, et al.. Cyclic flexural behavior of hybrid SMA/steel fiber reinforced concrete analyzed by optical and acoustic techniques[J]. Composite structures, 2018, 201: 248-260,

CrossRef Google scholar

[[4]]

Joe

H E

, Yun

, Jo

S H

, et al.. A review on optical fiber sensors for environmental monitoring[J]. International journal of precision engineering and manufacturing-green technology, 2018, 5(1): 173-191,

CrossRef Google scholar

[[5]]

Hubbard

P G

, Xu

, Zhang

, et al.. Dynamic structural health monitoring of a model wind turbine tower using distributed acoustic sensing (DAS)[J]. Journal of civil structural health monitoring, 2021, 11(3): 833-849,

CrossRef Google scholar

[[6]]

FERNANDZE-RUIZ M R, SOTO M A, WILLIAMS E F, et al. Distributed acoustic sensing for seismic activity monitoring[J]. APL photonics, 2020, 5(3).

[[7]]

Moccia

, Pisco

, Cutolo

, et al.. Op-to-acoustic behavior of coated fiber Bragg gratings[J]. Optics express, 2011, 19(20): 18842-18860,

CrossRef Pubmed Google scholar

[[8]]

Macia-Sanahuja

, Lamela

, Garcia-Souto

J A

. Fiber optic interferometric sensor for acoustic detection of partial discharges[J]. Journal of optical technology, 2007, 74(2): 122-126,

CrossRef Google scholar

[[9]]

Xiong

, Wang

, Jiang

, et al.. High sensitivity and large measurable range distributed acoustic sensing with Rayleigh-enhanced fiber[J]. Optics letters, 2021, 46(11): 2569-2572,

CrossRef Pubmed Google scholar

[[10]]

Fang

, Xu

, Fenf

, et al.. Phase-sensitive optical time domain reflectometer based on phase-generated carrier algorithm[J]. Journal of lightwave technology, 2015, 33(13): 2811-2816,

CrossRef Google scholar

[[11]]

Wang

, Zhang

, Wang

, et al.. Coherent Φ-OTDR based on I/Q demodulation and homodyne detection[J]. Optics express, 2016, 24(2): 853-858,

CrossRef Pubmed Google scholar

[[12]]

, Zhang

, et al.. The development of an Φ-OTDR system for quantitative vibration measurement[J]. IEEE photonics technology letters, 2015, 27(12): 1349-1352,

CrossRef Google scholar

[[13]]

Wang

, Jiang

, Wang

, et al.. GPU-based fast processing for a distributed acoustic sensor using an LFM pulse[J]. Applied optics, 2020, 59(35): 11098-11103,

CrossRef Pubmed Google scholar

[[14]]

Zhu

, Zhou

, Wu

, et al.. Multipath distributed acoustic sensing system based on phase-sensitive optical time-domain reflectometry with frequency division multiplexing technique[J]. Optics and lasers in engineering, 2021, 142: 106593,

CrossRef Google scholar

[[15]]

Zhang

, Qiao

, Sun

, et al.. A distributed optical fiber sensing system for synchronous vibration and loss measurement[J]. Optoelectronics letters, 2016, 12(5): 375-378,

CrossRef Google scholar

[[16]]

, Zhu

, Chen

, et al.. Distributed vibration sensor based on coherent detection of phase-OTDR[J]. Journal of lightwave technology, 2010, 28(22): 3243-3249

[[17]]

Dong

, Chen

, Liu

, et al.. Quantitative measurement of dynamic nanostrain based on a phase-sensitive optical time domain reflectometer[J]. Applied optics, 2016, 55(28): 7810-7815,

CrossRef Pubmed Google scholar

[[18]]

Masoudi

, Belal

, Newson

T P

. Distributed optical fiber audible frequency sensor[C]. 23rd International Conference on Optical Fiber Sensors, June 2–6, 2014, Santander, Spain, 2014 Washington SPIE 537-540

[[19]]

Franciscangelis

, Margulis

, Kjellberg

, et al.. Real-time distributed fiber microphone based on phase-OTDR[J]. Optics express, 2016, 24(26): 29597-29602,

CrossRef Pubmed Google scholar

[[20]]

, Gan

, Li

, et al.. Distributed fiber voice sensor based on phase-sensitive optical time-domain reflectometry[J]. IEEE photonics journal, 2015, 7(6): 1-10,

CrossRef Google scholar

[[21]]

Zhang

, Venketeswaran

, Wright

, et al.. Feature extraction for pipeline defects inspection based upon distributed acoustic fiber optic sensing data[C]. Fiber Optic Sensors and Applications XVIII, April 3–June 12, 2022, Virtual, 2022 Washington SPIE 14-29

[[22]]

Tabjula

, Sharma

. Feature extraction techniques for noisy distributed acoustic sensor data acquired in a wellbore[J]. Applied optics, 2023, 62(16): E51-E61,

CrossRef Pubmed Google scholar

[[23]]

Ning

, Cheng

, Meng

, et al.. A framework combining acoustic features extraction method and random forest algorithm for gas pipeline leak detection and classification[J]. Applied acoustics, 2021, 182: 108255,

CrossRef Google scholar

[[24]]

, Wang

, Liu

, et al.. Intelligent target recognition for distributed acoustic sensors by using both manual and deep features[J]. Applied optics, 2021, 60(23): 6878-6887,

CrossRef Pubmed Google scholar

[[25]]

Jiang

, Li

, Zhang

, et al.. An event recognition method for fiber distributed acoustic sensing systems based on the combination of MFCC and CNN[C]. 2017 International Conference on Optical Instruments and Technology: Advanced Optical Sensors and Applications, October 28–30, 2017, Beijing, China, 2018 Washington SPIE 15-21

[[26]]

Shi

, Liu

, Wei

. An event recognition method based on MFCC, superposition algorithm and deep learning for buried distributed optical fiber sensors[J]. Optics communications, 2022, 522: 128647,

CrossRef Google scholar

[[27]]

Shang

, Yang

, Chen

, et al.. Speech signal enhancement based on deep learning in distributed acoustic sensing[J]. Optics express, 2023, 31(3): 4067-4079,

CrossRef Pubmed Google scholar

[[28]]

Bencharif

B A E

, Bayar

, Özkan

. Parallel implementation of distributed acoustic sensor acquired signals: detection, processing, and classification[J]. Journal of applied remote sensing, 2022, 16(2): 024504-024504,

CrossRef Google scholar

[[29]]

AYVAZ U, GURULER H, KHAN F, et al. Automatic speaker recognition using mel-frequency cepstral coefficients through machine learning[J]. CMC-computers materials & continua, 2022, 71(3).

[[30]]

Arpitha

, Mashumathi

G L

, Balaji

. Spectrogram analysis of ECG signal and classification efficiency using MFCC feature extraction technique[J]. Journal of ambient intelligence and humanized computing, 2022, 13(2): 757-767,

CrossRef Google scholar

[[31]]

GANCHEV T, FAKOTAKIS N, KOKKINAKIS G. Comparative evaluation of various MFCC implementations on the speaker verification task[C]//Proceedings of the SPECOM, October 17–19, 2005, Patras, Greece. Moscow, 2005: 191–194.

[[32]]

Blotekjaer

. Fundamental noise sources that limit the ultimate resolution of fiber optic sensors[C]. Optical and Fiber Optic Sensor Systems, September 16–19, 1998, Beijing, China, 1998 Washington SPIE 1-12