Authors: Chenguang Yang 1 ; Ghaith Hammouri 2 and Berk Sunar 1

Affiliations: 1 Worcester Polytechnic Institute, United States ; 2 Claveo Software, United States

ISBN: 978-989-8565-24-2

Keyword(s): Voice, Entropy, Mel Frequency Cepstral Coefficients, Gaussian Mixture Model.

Related Ontology Subjects/Areas/Topics: Biometrics Security and Privacy ; Identification, Authentication and Non-Repudiation ; Information and Systems Security

Abstract: We demonstrate an attack on basic voice authentication technologies. Specifically, we show how one member of a voice database can manipulate his voice in order to gain access to resources by impersonating another member in the same database. The attack targets a voice authentication system build around parallel and independent speech recognition and speaker verification modules and assumes that adapted Gaussian Mixture Model (GMM) is used to model basic Mel-frequency cepstral coefficients (MFCC) features of speakers. We experimentally verify our attack using the YOHO database. The experiments conclude that in a database of 138 users an attacker can impersonate anyone in the database with a 98% success probability after at most nine authorization attempts. The attack still succeeds, albeit at lower success rates, if fewer attempts are permitted. The attack is quite practical and highlights the limited amount of entropy that can be extracted from the human voice when using MFCC features.

Paper citation in several formats:
Yang, C.; Hammouri, G. and Sunar, B. (2012). Voice Passwords Revisited.In Proceedings of the International Conference on Security and Cryptography - Volume 1: SECRYPT, (ICETE 2012) ISBN 978-989-8565-24-2, pages 163-171. DOI: 10.5220/0004060201630171

