Full Text:   <2155>

Summary:  <1806>

CLC number: TP309.2

On-line Access: 2017-05-24

Received: 2016-08-26

Revision Accepted: 2017-03-03

Crosschecked: 2017-04-27

Cited: 0

Clicked: 5815

Citations:  Bibtex RefMan EndNote GB/T7714

-   Go to

Article info.
Open peer comments

Frontiers of Information Technology & Electronic Engineering  2017 Vol.18 No.5 P.627-643


A subband excitation substitute based scheme for narrowband speech watermarking

Author(s):  Wei Liu, Ai-qun Hu

Affiliation(s):  School of Information Science and Engineering, Southeast University, Nanjing 210096, China

Corresponding email(s):   weiliu@seu.edu.cn, aqhu@seu.edu.cn

Key Words:  Analysis filter, Linear prediction, Narrowband speech watermarking, Passband excitation replacement, Power normalization, Spectral envelope shaping, Synthesis filter

Wei Liu, Ai-qun Hu. A subband excitation substitute based scheme for narrowband speech watermarking[J]. Frontiers of Information Technology & Electronic Engineering, 2017, 18(5): 627-643.

@article{title="A subband excitation substitute based scheme for narrowband speech watermarking",
author="Wei Liu, Ai-qun Hu",
journal="Frontiers of Information Technology & Electronic Engineering",
publisher="Zhejiang University Press & Springer",

%0 Journal Article
%T A subband excitation substitute based scheme for narrowband speech watermarking
%A Wei Liu
%A Ai-qun Hu
%J Frontiers of Information Technology & Electronic Engineering
%V 18
%N 5
%P 627-643
%@ 2095-9184
%D 2017
%I Zhejiang University Press & Springer
%DOI 10.1631/FITEE.1601503

T1 - A subband excitation substitute based scheme for narrowband speech watermarking
A1 - Wei Liu
A1 - Ai-qun Hu
J0 - Frontiers of Information Technology & Electronic Engineering
VL - 18
IS - 5
SP - 627
EP - 643
%@ 2095-9184
Y1 - 2017
PB - Zhejiang University Press & Springer
ER -
DOI - 10.1631/FITEE.1601503

We propose a new narrowband speech watermarking scheme by replacing part of the speech with a scaled and spectrally shaped hidden signal. Theoretically, it is proved that if a small amount of host speech is modified, then not only an ideal channel model for hidden communication can be established, but also high imperceptibility and good intelligibility can be achieved. Furthermore, a practical system implementation is proposed. At the embedder, the power normalization criterion is first imposed on a passband watermark signal by forcing its power level to be the same as the original passband excitation of the cover speech, and a synthesis filter is then used to spectrally shape the scaled watermark signal. At the extractor, a bandpass filter is first used to get rid of the out-of-band signal, and an analysis filter is then employed to compensate for the distortion introduced by the synthesis filter. Experimental results show that the data rate is as high as 400 bits/s with better bandwidth efficiency, and good imperceptibility is achieved. Moreover, this method is robust against various attacks existing in real applications.


概要:本文建议了一种新的窄带语音水印算法,该算法将部分语音信号替换为一个经过幅度调节和频谱成形的隐蔽信号。文中从理论上证明了若仅对一小部分载体语音作修改,则不但可以建立一个用于隐蔽通信的理想信道模型,而且还能确保隐藏信号的不可感知性和带水印信号的可懂度。在此基础上,文中建议了一个实际的系统模型。在嵌入器中,使用了能量规范化准则,使得通带水印信号的能量与原始通带语音激励信号的能量相等,然后使用合成滤波器来对该水印信号进行频谱成形。在提取器中,先用带通滤波器去除带外信号,然后用分析滤波器对失真进行补偿。实验结果显示,本文建议算法的数据速率可达400 bits/s,具有更佳的带宽效率,且有良好的不可感知性。此外,该算法对实际应用中的各种攻击也是顽健的。


Darkslateblue:Affiliate; Royal Blue:Author; Turquoise:Article


[1]Cai, L.B., Tu, R.H., Zhao, J.Y., et al., 2007. Speech quality evaluation: a new application of digital watermarking. IEEE Trans. Instrum. Meas., 56(1):45-55.

[2]Chen, S., Leung, H., 2006. Concurrent data transmission through PSTN by CDMA. IEEE Int. Symp. on Circuits and Systems, p.3001-3004.

[3]Chen, S., Leung, H., Ding, H., 2007. Telephony speech enhancement by data hiding. IEEE Trans. Instrum. Meas., 56(1):63-74.

[4]Chen, Z., Zhao, C., Geng, G., et al., 2013. An audio watermark-based speech bandwidth extension method. EURASIP J. Audio Speech Music Process., 2013(1):1-8.

[5]Cheng, Q., Sorensen, J., 2001. Spread spectrum signaling for speech watermarking. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, p.1337-1340.

[6]Eslami, R., Deller, J.R.Jr, Radha, H., 2006. On the detection of multiplicative watermarks for speech signals in the wavelet and DCT domains. IEEE Int. Conf. on Multimedia and Expo, p.1369-1372.

[7]Fan, M.Q., Liu, P.P., Wang, H.X., et al., 2013. A semi-fragile watermarking scheme for authenticating audio signal based on dual-tree complex wavelet transform and discrete cosine transform. Int. J. Comput. Math., 90(12): 2588-2602.

[8]Faundez-Zanuy, M., Hagmüller, M., Kubin, G., 2006. Speaker verification security improvement by means of speech watermarking. Speech Commun., 48(12):1608-1619.

[9]Faundez-Zanuy, M., Hagmüller, M., Kubin, G., 2007. Speaker identification security improvement by means of speech watermarking. Patt. Recogn., 40(11):3027-3034.

[10]Faundez-Zanuy, M., Lucena-Molina, J.J., Hagmüller, M., 2010. Speech watermarking: an approach for the forensic analysis of digital telephonic recordings. J. Forens. Sci., 55(4):1080-1087.

[11]Malepati, H., 2010. Digital Media Processing: DSP Algorithms Using C. Elsevier, Burlington, USA, p.416-431.

[12]Hofbauer, K., Hering, H., 2007. Noise robust speech watermarking with bit synchronisation for the aeronautical radio. LNCS, 4567:252-266.

[13]Hofbauer, K., Kubin, G., 2006. High-rate data embedding in unvoiced speech. INTERSPEECH, p.241-244.

[14]Hofbauer, K., Hering, H., Kubin, G., 2005. Speech watermarking for the VHF radio channel. EUROCONTROL Innovative Research Workshop and Exhibition: Envisioning the Future, p.215-220.

[15]Hofbauer, K., Kubin, G., Kleijn, W.B., 2009. Speech watermarking for analog flat-fading bandpass channels. IEEE Trans. Audio Speech Lang. Process., 17(8):1624-1637.

[16]Nematollahi, M.A., Al-Haddad, S.A.R., 2013. An overview of digital speech watermarking. Int. J. Speech Technol., 16(4):471-488.

[17]Nematollahi, M.A., Gamboa-Rosales, H., Akhaee, M.A., et al., 2015a. Robust digital speech watermarking for online speaker recognition. Math. Probl. Eng., 2015:372398.

[18]Nematollahi, M.A., Akhaee, M.A., Al-Haddad, S.A.R., et al., 2015b. Semi-fragile digital speech watermarking for online speaker recognition. EURASIP J. Audio Speech Music Process., 2015(1):1-15.

[19]Nematollahi, M.A., Vorakulpipat, C., Rosales, H.G., 2017. Digital Watermarking: Techniques and Trends. Springer, Singapore, p.39-51.

[20]Park, C.M., Thapa, D., Wang, G.N., 2007. Speech authentication system using digital watermarking and pattern recovery. Patt. Recogn. Lett., 28(8):931-938.

[21]Sarreshtedari, S., Akhaee, M.A., Abbasfar, A., 2015. A watermarking method for digital speech self-recovery. IEEE/ACM Trans. Audio Speech Lang. Process., 23(11): 1917-1925.

[22]Suzuki, J., Hingdi, B., Yashima, H., 1997. Transmission of data on analog speech channel by spread spectrum modulation. IEEE Pacific Rim Conf. on Communications, Computers and Signal Processing, p.697-700.

[23]Wang, S.B., Unoki, M., 2015. Speech watermarking method based on formant tuning. IEICE Trans. Inform. Syst., E98D(1):29-37.

[24]Yan, B., Guo, Y.J., 2013. Speech authentication by semi-fragile speech watermarking utilizing analysis by synthesis and spectral distortion optimization. Multim. Tools Appl., 67(2):383-405.

[25]Zamani, M., Manaf, A.B.A., 2015. Genetic algorithm for fragile audio watermarking. Telecommun. Syst., 59(3): 291-304.

[26]Zheng, W.X., 2005. Fast identification of autoregressive signals from noisy observations. IEEE Trans. Circ. Syst. II, 52(1):43-48.

Open peer comments: Debate/Discuss/Question/Opinion


Please provide your name, email address and a comment

Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952783; E-mail: cjzhang@zju.edu.cn
Copyright © 2000 - 2024 Journal of Zhejiang University-SCIENCE