loading
Papers

Research.Publish.Connect.

Paper

Authors: Dijana Petrovska-Delacrétaz 1 and Houssemeddine Khemiri 2

Affiliations: 1 Télécom SudParis, SAMOVAR CNRS and Université Paris-Saclay, France ; 2 PW Consultants, France

ISBN: 978-989-758-222-6

Keyword(s): Unsupervised Data-driven Modeling, Hidden Markov Models, Text-dependent Speaker Verification, Concurrent Scoring.

Related Ontology Subjects/Areas/Topics: Applications ; Biomedical Engineering ; Biomedical Signal Processing ; Biometrics ; Biometrics and Pattern Recognition ; Classification ; Computer Vision, Visualization and Computer Graphics ; Image and Video Analysis ; Multimedia ; Multimedia Signal Processing ; Pattern Recognition ; Software Engineering ; Telecommunications ; Theory and Methods ; Video Analysis

Abstract: We present a text-dependent speaker verification system based on unsupervised data-driven Hidden Markov Models (HMMs) in order to take into account the temporal information of speech data. The originality of our proposal is to train unsupervised HMMs with only raw speech without transcriptions, that provide pseudo phonetic segmentation of speech data. The proposed text-dependent system is composed of the following steps. First, generic unsupervised HMMs are trained. Then the enrollment speech data for each target speaker is segmented with the generic models, and further processing is done in order to obtain speaker and text adapted HMMs, that will represent each speaker. During the test phase, in order to verify the claimed identity of the speaker, the test speech is segmented with the generic and the speaker dependent HMMs. Finally, two approaches based on log-likelihood ratio and concurrent scoring are proposed to compute the score between the test utterance and the speaker’s model. The system is evaluated on Part1 of the RSR2015 database with Equal Error Rate (EER) on the development set, and Half Total Error Rate (HTER) on the evaluation set. An average EER of 1.29% is achieved on the development set, while for the evaluation part the average HTER is equal to 1.32%. (More)

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.91.106.44

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Petrovska-Delacrétaz , D. and Khemiri, H. (2017). Unsupervised Data-driven Hidden Markov Modeling for Text-dependent Speaker Verification.In Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-758-222-6, pages 199-207. DOI: 10.5220/0006202001990207

@conference{icpram17,
author={Dijana Petrovska{-}Delacrétaz . and Houssemeddine Khemiri.},
title={Unsupervised Data-driven Hidden Markov Modeling for Text-dependent Speaker Verification},
booktitle={Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2017},
pages={199-207},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006202001990207},
isbn={978-989-758-222-6},
}

TY - CONF

JO - Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - Unsupervised Data-driven Hidden Markov Modeling for Text-dependent Speaker Verification
SN - 978-989-758-222-6
AU - Petrovska-Delacrétaz , D.
AU - Khemiri, H.
PY - 2017
SP - 199
EP - 207
DO - 10.5220/0006202001990207

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.