A subband excitation substitute based scheme for narrowband speech watermarking
Wei LIU, Ai-qun HU
A subband excitation substitute based scheme for narrowband speech watermarking
We propose a new narrowband speech watermarking scheme by replacing part of the speech with a scaled and spectrally shaped hidden signal. Theoretically, it is proved that if a small amount of host speech is modified, then not only an ideal channel model for hidden communication can be established, but also high imperceptibility and good intelligibility can be achieved. Furthermore, a practical system implementation is proposed. At the embedder, the power normalization criterion is first imposed on a passband watermark signal by forcing its power level to be the same as the original passband excitation of the cover speech, and a synthesis filter is then used to spectrally shape the scaled watermark signal. At the extractor, a bandpass filter is first used to get rid of the out-of-band signal, and an analysis filter is then employed to compensate for the distortion introduced by the synthesis filter. Experimental results show that the data rate is as high as 400 bits/s with better bandwidth efficiency, and good imperceptibility is achieved. Moreover, this method is robust against various attacks existing in real applications.
Analysis filter / Linear prediction / Narrowband speech watermarking / Passband excitation replacement / Power normalization / Spectral envelope shaping / Synthesis filter
[1] |
Cai, L.B., Tu, R.H., Zhao, J.Y. ,
|
[2] |
Chen, S., Leung, H., 2006. Concurrent data transmission through PSTN by CDMA.IEEE Int. Symp. on Circuits and Systems, p.3001–3004. http://dx.doi.org/10.1109/ISCAS.2006.1693256
|
[3] |
Chen, S., Leung, H., Ding, H., 2007. Telephony speech enhancement by data hiding. IEEE Trans. Instrum. Meas., 56(1):63–74. http://dx.doi.org/10.1109/TIM.2006.887409
|
[4] |
Chen, Z., Zhao, C., Geng, G.,
|
[5] |
Cheng, Q., Sorensen , J., 2001. Spread spectrum signaling for speech watermarking.IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, p.1337–1340. http://dx.doi.org/10.1109/ICASSP.2001.941175
|
[6] |
Eslami, R., Deller, J.R.Jr, Radha, H. , 2006. On the detection of multiplicative watermarks for speech signals in the wavelet and DCT domains.IEEE Int. Conf. on Multimedia and Expo, p.1369–1372. http://dx.doi.org/10.1109/ICME.2006.262793
|
[7] |
Fan, M.Q., Liu, P.P., Wang, H.X. ,
|
[8] |
Faundez-Zanuy, M., Hagmüller , M., Kubin, G. , 2006. Speaker verification security improvement by means of speech watermarking. Speech Commun., 48(12):1608–1619. http://dx.doi.org/10.1016/j.specom.2006.06.010
|
[9] |
Faundez-Zanuy, M., Hagmüller , M., Kubin, G. , 2007. Speaker identification security improvement by means of speech watermarking. Patt. Recogn., 40(11):3027–3034. http://dx.doi.org/10.1016/j.patcog.2007.02.016
|
[10] |
Faundez-Zanuy, M., Lucena-Molina , J.J., Hagmüller, M., 2010. Speech watermarking: an approach for the forensic analysis of digital telephonic recordings.J. Forens. Sci. , 55(4):1080–1087.http://dx.doi.org/10.1111/j.1556-4029.2010.01395.x
|
[11] |
Malepati, H., 2010. Digital Media Processing: DSP Algorithms Using C.Elsevier, Burlington, USA, p.416–431. http://dx.doi.org/10.1016/B978-1-85617-678-1.00008-9
|
[12] |
Hofbauer, K., Hering, H., 2007. Noise robust speech watermarking with bit synchronisation for the aeronautical radio. LNCS, 4567:252–266. http://dx.doi.org/10.1007/978-3-540-77370-2_17
|
[13] |
Hofbauer, K., Kubin, G., 2006. High-rate data embedding in unvoiced speech.INTERSPEECH, p.241–244.
|
[14] |
Hofbauer, K., Hering, H., Kubin, G., 2005. Speech watermarking for the VHF radio channel.EUROCONTROL Innovative Research Workshop and Exhibition: Envisioning the Future, p.215–220.
|
[15] |
Hofbauer, K., Kubin, G., Kleijn, W.B. , 2009. Speech watermarking for analog flat-fading bandpass channels. IEEE Trans. Audio Speech Lang. Process., 17(8):1624–1637. http://dx.doi.org/10.1109/TASL.2009.2021543
|
[16] |
Nematollahi, M.A., Al-Haddad , S.A.R., 2013. An overview of digital speech watermarking. Int. J. Speech Technol., 16(4):471–488. http://dx.doi.org/10.1007/s10772-013-9192-6
|
[17] |
Nematollahi, M.A., Gamboa-Rosales , H., Akhaee, M.A. ,
|
[18] |
Nematollahi, M.A., Akhaee , M.A., Al-Haddad, S.A.R. ,
|
[19] |
Nematollahi, M.A., Vorakulpipat , C., Rosales, H.G. , 2017. Digital Watermarking: Techniques and Trends.Springer, Singapore, p.39–51. http://dx.doi.org/10.1007/978-981-10-2095-7
|
[20] |
Park, C.M., Thapa, D., Wang, G.N., 2007. Speech authentication system using digital watermarking and pattern recovery. Patt. Recogn. Lett., 28(8):931–938. http://dx.doi.org/10.1016/j.patrec.2006.12.010
|
[21] |
Sarreshtedari, S., Akhaee , M.A., Abbasfar, A. , 2015. A watermarking method for digital speech self-recovery. IEEE/ACM Trans. Audio Speech Lang. Process., 23(11): 1917–1925. http://dx.doi.org/10.1109/TASLP.2015.2456431
|
[22] |
Suzuki, J., Hingdi, B., Yashima, H. , 1997. Transmission of data on analog speech channel by spread spectrum modulation.IEEE Pacific Rim Conf. on Communications, Computers and Signal Processing, p.697–700. http://dx.doi.org/10.1109/PACRIM.1997.620355
|
[23] |
Wang, S.B., Unoki, M., 2015. Speech watermarking method based on formant tuning. IEICE Trans. Inform. Syst., E98D(1):29–37. http://dx.doi.org/10.1587/TRANSINF.2014MUP0009
|
[24] |
Yan, B., Guo, Y.J., 2013. Speech authentication by semi-fragile speech watermarking utilizing analysis by synthesis and spectral distortion optimization. Multim. Tools Appl., 67(2):383–405. http://dx.doi.org/10.1007/s11042-011-0861-7
|
[25] |
Zamani, M., Manaf, A.B.A., 2015. Genetic algorithm for fragile audio watermarking. Telecommun. Syst., 59(3): 291–304. http://dx.doi.org/10.1007/s11235-014-9936-x
|
[26] |
Zheng, W.X., 2005. Fast identification of autoregressive signals from noisy observations. IEEE Trans. Circ. Syst. II, 52(1):43–48. http://dx.doi.org/10.1109/TCSII.2004.838435
|
/
〈 | 〉 |