A ROBUST SPEECH COMMAND RECOGNIZER FOR EMBEDDED APPLICATIONS

Alexandre Maciel, Arlindo Veiga, Cláudio Neves, José Lopes, Carla Lopes, Fernando Perdigão, Luís Sá

2008

Abstract

This paper describes a command-based robust speech recognition system for the Portuguese language. Due to an efficient noise reduction algorithm the system can be operated in adverse noise environments such as in cars or factories. The recognizer was trained and tested with a speech database with 250 commands spoken by 345 speakers in clean and noisy conditions. The system incorporates a user friendly application programming interface and was optimized for embedded platforms with limited computational resources. Performance tests for the recognizer are presented.

References

  1. ETSI, 2003. ETSI ES 202 050 v1.1.3. Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Front-end Feature Extraction Algorithm; Compression Algorithms. Technical Report ETSI ES 202 050, ETSI.
  2. HTK3, 2006. The HTK book (for HTK version 3.4). Technical report, Cambridge University. England. http://htk.eng.cam.ac.uk/.
  3. Li, J.-Y., Liu, B., Wang, R.-H., and Dai L.-R., 2004. A Complexity Reduction of ETSI Advanced Front-end for DSR. In proc. of ICASSP'2004, vol. I, pp. 61-64. Montreal, Canada.
  4. Neves, C., Veiga, A., Sá, L., and Perdigão, F., 2008. Efficient Noise-Robust Speech Recognition Front-end Based on the ETSI Standard. Submitted to INTERSPEECH'2008. Brisbane, Australia.
  5. Peinado, A., and Segura, J., 2006. Speech Recognition over Digital Channels: Robustness and Standards, John Wiley & Sons, Ltd. England.
  6. Tecnovoz, 2008. http://www.tecnovoz.pt/web/home.asp.
  7. Yu, D., Ju, Y., Wang, Y.-Y., and Alex, W., 2006. N-Gram Based Filler Model for Robust Grammar Authoring. In proc. of ICASSP'2006, vol. I, pp. 565-568. Toulouse, France.
Download


Paper Citation


in Harvard Style

Maciel A., Veiga A., Neves C., Lopes J., Lopes C., Perdigão F. and Sá L. (2008). A ROBUST SPEECH COMMAND RECOGNIZER FOR EMBEDDED APPLICATIONS . In Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2008) ISBN 978-989-8111-60-9, pages 92-95. DOI: 10.5220/0001938700920095


in Bibtex Style

@conference{sigmap08,
author={Alexandre Maciel and Arlindo Veiga and Cláudio Neves and José Lopes and Carla Lopes and Fernando Perdigão and Luís Sá},
title={A ROBUST SPEECH COMMAND RECOGNIZER FOR EMBEDDED APPLICATIONS},
booktitle={Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2008)},
year={2008},
pages={92-95},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001938700920095},
isbn={978-989-8111-60-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Signal Processing and Multimedia Applications - Volume 1: SIGMAP, (ICETE 2008)
TI - A ROBUST SPEECH COMMAND RECOGNIZER FOR EMBEDDED APPLICATIONS
SN - 978-989-8111-60-9
AU - Maciel A.
AU - Veiga A.
AU - Neves C.
AU - Lopes J.
AU - Lopes C.
AU - Perdigão F.
AU - Sá L.
PY - 2008
SP - 92
EP - 95
DO - 10.5220/0001938700920095