MFCC based real-time speech reproduction and recognition using distributed acoustic sensing technology
Ran Zhou , Shuai Zhao , Mingming Luo , Xin Meng , Jie Ma , Jianfei Liu
Optoelectronics Letters ›› 2024, Vol. 20 ›› Issue (4) : 222 -227.
MFCC based real-time speech reproduction and recognition using distributed acoustic sensing technology
The distributed acoustic sensing technology was used for real-time speech reproduction and recognition, in which the voiceprint can be extracted by the Mel frequency cepstral coefficient (MFCC) method. A classic ancient Chinese poem “You Zi Yin”, also called “A Traveler’s Song”, was analyzed both in time and frequency domains, where its real-time reproduction was achieved with a 116.91 ms time delay. The smaller scaled MFCC0 at 1/12 of MFCC matrix was taken as a feature vector of each line against the ambient noise, which provides a recognition method via cross-correlation among the 6 original and recovered verse pairs. The averaged cross-correlation coefficient of the matching pairs is calculated to be 0.580 6 higher than 0.188 3 of the nonmatched pairs, promising an accurate and fast method for real-time speech reproduction and recognition over a passive optical fiber.
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
FERNANDZE-RUIZ M R, SOTO M A, WILLIAMS E F, et al. Distributed acoustic sensing for seismic activity monitoring[J]. APL photonics, 2020, 5(3). |
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
|
| [25] |
|
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
AYVAZ U, GURULER H, KHAN F, et al. Automatic speaker recognition using mel-frequency cepstral coefficients through machine learning[J]. CMC-computers materials & continua, 2022, 71(3). |
| [30] |
|
| [31] |
GANCHEV T, FAKOTAKIS N, KOKKINAKIS G. Comparative evaluation of various MFCC implementations on the speaker verification task[C]//Proceedings of the SPECOM, October 17–19, 2005, Patras, Greece. Moscow, 2005: 191–194. |
| [32] |
|
/
| 〈 |
|
〉 |