Improved spectral dynamic features extracted from audio data for classification of marine vessels
Murillo de Brito Santos , Rogério de Moraes Calazan
Intelligent Marine Technology and Systems ›› 2024, Vol. 2 ›› Issue (1)
Improved spectral dynamic features extracted from audio data for classification of marine vessels
Underwater sound classification presents a unique challenge due to the complex propagation characteristics of sound in water, including absorption, scattering, and refraction. These complexities can distort and alter spectral features, hindering the effectiveness of traditional feature extraction methods for vessel classification. To address this challenge, this study proposes a novel feature extraction method that combines Mel-frequency cepstral coefficients (MFCCs) with a spectral dynamic feature (SDF) vector. MFCCs capture the spectral content of the audio signal, whereas SDF provides information on the temporal dynamics of spectral features. This combined approach aims to achieve a more comprehensive representation of underwater vessel sounds, potentially leading to improved classification accuracy. Validation with real-world underwater audio recordings demonstrated the effectiveness of the proposed method. Results indicated an improvement of up to 94.68% in classification accuracy when combining SDF with several classical extractors evaluated. This finding highlights the potential of SDF in overcoming the challenges associated with underwater sound classification.
Underwater sound classification / Feature extraction / Cepstral domain / Vessel classification / Mel filter bank
| [1] |
Alcaraz Meseguer N (2009) Speech analysis for automatic speech recognition. Master’s thesis, Department of Electronics and Telecommunications |
| [2] |
|
| [3] |
Chen Y, Xu X (2017) The research of underwater target recognition method based on deep learning. In: 2017 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC), Xiamen, pp 1–5. https://doi.org/10.1109/ICSPCC.2017.8242464 |
| [4] |
|
| [5] |
Dixit A, Vidwans A, Sharma P (2016) Improved MFCC and LPC algorithm for Bundelkhandi isolated digit speech recognition. In: 2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT), Chennai, pp 3755–3759 |
| [6] |
|
| [7] |
|
| [8] |
Li XX, Yang S, Yu M (2008) Feature extraction from underwater signals using wavelet packet transform. In: 2008 International Conference on Neural Networks and Signal Processing, Nanjing, pp 400–405. https://doi.org/10.1109/ICNNSP.2008.4590381 |
| [9] |
Lian Z, Xu K, Wan J, Li G (2017) Underwater acoustic target classification based on modified GFCC features. In: 2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), Chongqing, pp 258–262. https://doi.org/10.1109/IAEAC.2017.8054017 |
| [10] |
Liu J, He Y, Liu Z, Xiong Y (2014) Underwater target recognition based on line spectrum and support vector machine. In: Proceedings of the 2014 International Conference on Mechatronics, Control and Electronic Engineering, Shenyang, pp 79–84. https://doi.org/10.2991/mce-14.2014.17 |
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
Sabara R, Soares C, Zabel F, Oliveira J, Jesus S (2020) Automatic acoustic target detection and classification off the coast of portugal. In: Global Oceans 2020: Singapore–U.S. Gulf Coast, Biloxi, pp 1–9. https://doi.org/10.1109/IEEECONF38699.2020.9389067 |
| [16] |
|
| [17] |
Saravanan R, Sujatha P (2018) A state of art techniques on machine learning algorithms: a perspective of supervised learning approaches in data classification. In: 2018 Second International Conference on Intelligent Computing and Control Systems (ICICCS), Madurai, pp 945–949 |
| [18] |
|
| [19] |
Tong Y, Zhang X, Ge Y (2020) Classification and recognition of underwater target based on MFCC feature extraction. In: 2020 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC), Macau, pp 1–4. https://doi.org/10.1109/ICSPCC50002.2020.9259457 |
| [20] |
Trang H, Tran L, Nam H (2015) Proposed combination of PCA and MFCC feature extraction in speech recognition system. In: 2014 International Conference on Advanced Technologies for Communications (ATC 2014), Hanoi, pp 697–702. https://doi.org/10.1109/ATC.2014.7043477 |
| [21] |
Winursito A, Hidayat R, Bejo A (2018) Improvement of MFCC feature extraction accuracy using PCA in indonesian speech recognition. In: 2018 International Conference on Information and Communications Technology (ICOIACT), Yogyakarta, pp 379–383 |
| [22] |
|
| [23] |
|
| [24] |
Zhang Y, Jiao L, Hu S (1998) An efficient method of target classification. In: ICSP ’98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344), Beijing, pp 1181–1184. https://doi.org/10.1109/ICOSP.1998.770828 |
/
| 〈 |
|
〉 |