HIGH RATE DATA HIDING IN SPEECH SIGNAL

Ehsan Jahangiri, Shahrokh Ghaemmaghami

2007

Abstract

One of the main issues with data hiding algorithms is capacity of data embedding. Most of data hiding methods suffer from low capacity that could make them inappropriate in certain hiding applications. This paper presents a high capacity data hiding method that uses encryption and the multi-band speech synthesis paradigm. In this method, an encrypted covert message is embedded in the unvoiced bands of the speech signal that leads to a high data hiding capacity of tens of kbps in a typical digital voice file transmission scheme. The proposed method yields a new standpoint in design of data hiding systems in the sense of three major, basically conflicting requirements in steganography, i.e. inaudibility, robustness, and data rate. The procedures to implement the method in both basic speech synthesis systems and in the standard mixed-excitation linear prediction (MELP) vocoder are also given in detail.

References

  1. Agaian, S.S., Akopian, D., Caglayan, O., D'Souza, S.A., 2005. Lossless Adaptive Digital Audio Steganography. Thirty-Ninth Asilomar Conference on Signals, Systems and Computers, October 28 - November 1, On page(s): 903- 906.
  2. Ansari, R., Malik, H., Khokhar, A., 2004. Data-hiding in audio using frequency-selective phase alteration. International Conference on Acoustics, Speech, and Signal Processing (ICASSP 7804), 17-21 May, vol.5 on page(s): V- 389-92.
  3. Beker, H., and Piper, F., 1982. Cipher Systems: The Protection of Communications, John Wiley & Sons.
  4. Bender, W., Gruhl, D., Morimoto, N., Lu, A., 1996. Techniques for data hiding. IBM system Journal, vol.35,nr. 3/4.
  5. Chang, P.C., Yu, H.M., 2002. Dither-like data hiding in multistage vector quantization of MELP and G.729 speech coding. Thirty-Sixth Asilomar Conference on Signals, Systems and Computers, 3-6 November, Volume 2, Page(s):1199 - 1203.
  6. Chiu, K.M., Ching, P.C., 1994. A dual-band excitation LSP codec for very low bit rate transmission. International Symposium on Speech, Image Processing and Neural Networks, 13-16 April, vol.2 on page(s): 479-482.
  7. Chu, W. C., 2003. Speech Coding Algorithms: Foundation and Evolution of Standardized Coders, John Wiley & Sons.
  8. Gopalan, K., 2005. Audio steganography by cepstrum modification. (ICASSP 7805) Volume 5, 18-23 March, v/484 Vol. Page(s):v/481.
  9. Gopalan, K., Wenndt, S., 2006. Audio Steganography for Covert Data Transmission by Imperceptible Tone Insertion. IASTED Con.f. Comm. Systems and Applications Banff, Alberta, Canada July 3-5.
  10. Griffin, D.W., Lim, J.S., 1988. Multi-band excitation vocoder. IEEE Trans. ASSP, 36(8); August, 664-678.
  11. Heys, H.M., 2001. An Analysis of the Statistical SelfSynchronization of Stream Ciphers. Proceedings of INFOCOM, Anchorage, Alaska, Apri,, pp. 897-904.
  12. ITU 1996. Coding of Speech at 8 kbit/s Using ConjugateStructure Algebraic-Code-Excited Linear-Prediction (CS-ACELP)-ITU-T Recommendation G.729.
  13. Kharrazi, M., Sencar, H.T., Memon, N., 2004. Image Steganography: Concepts and Practice. April 22, WSPC/Lecture Note Series. http://www.ims.nus.edu.sg/preprints/2004-25.pdf.
  14. Kondoz, A.M., 1994. Digital Speech: coding for Low Bit Rate Communications Systems, John Wiley & Sons.
  15. Mansour, M.F., Tewfik, A.H., 2001. time-scale invariant audio data embedding. IEEE International conference on Multimedia and Expo, ICME, Japan, August.
  16. Matsuoka H., 2006. Spread Spectrum Audio Steganography Using Sub-band Phase Shifting. Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 7806. on Dec, Page(s):3 - 6.
  17. McCree, A.V., Supplee, L.M., Cohn, R.P., Collura, J.S., 1997. MELP: The New Federal Standard at 2400 bps. IEEE ICASSP, pp. 1591-1594.
  18. Sencar, H., Ramkumar, M., Akansu, A., 2004. Data Hiding Fundamentals and Applications: Content Security in Digital Multimedia, ELSIVIER ACADEMIC PRESS.
Download


Paper Citation


in Harvard Style

Jahangiri E. and Ghaemmaghami S. (2007). HIGH RATE DATA HIDING IN SPEECH SIGNAL . In Proceedings of the Second International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2007) ISBN 978-989-8111-13-5, pages 287-292. DOI: 10.5220/0002137102870292


in Bibtex Style

@conference{sigmap07,
author={Ehsan Jahangiri and Shahrokh Ghaemmaghami},
title={HIGH RATE DATA HIDING IN SPEECH SIGNAL},
booktitle={Proceedings of the Second International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2007)},
year={2007},
pages={287-292},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002137102870292},
isbn={978-989-8111-13-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Second International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2007)
TI - HIGH RATE DATA HIDING IN SPEECH SIGNAL
SN - 978-989-8111-13-5
AU - Jahangiri E.
AU - Ghaemmaghami S.
PY - 2007
SP - 287
EP - 292
DO - 10.5220/0002137102870292