loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Saeideh Mirzaei 1 ; Pierrick Milhorat 2 ; Jérôme Boudy 3 ; Gérard Chollet 4 and Mikko Kurimo 1

Affiliations: 1 Aalto University, Finland ; 2 Kyoto University, Japan ; 3 Telecom SudParis, France ; 4 Telecom ParisTech, France

Keyword(s): Speech Recognition, Speaker Adaptation, Linear Regression, Vocal Tract.

Related Ontology Subjects/Areas/Topics: Applications ; Audio and Speech Processing ; Digital Signal Processing ; Incremental Learning ; Multimedia ; Multimedia Signal Processing ; Pattern Recognition ; Software Engineering ; Telecommunications ; Theory and Methods

Abstract: To improve the performance of Automatic Speech Recognition (ASR) systems, the models must be retrained in order to better adjust to the speaker’s voice characteristics, the environmental and channel conditions or the context of the task. In this project we focus on the mismatch between the acoustic features used to train the model and the vocal characteristics of the front-end user of the system. To overcome this mismatch, speaker adaptation techniques have been used. A significant performance improvement has been shown using using constrained Maximum Likelihood Linear Regression (cMLLR) model adaptation methods, while a fast adaptation is guaranteed by using linear Vocal Tract Length Normalization (lVTLN).We have achieved a relative gain of approximately 9.44% in the word error rate with unsupervised cMLLR adaptation. We also compare our ASR system with the Google ASR and show that, using adaptation methods, we exceed its performance.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.236.112.101

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Mirzaei, S.; Milhorat, P.; Boudy, J.; Chollet, G. and Kurimo, M. (2016). Experiments on Adaptation Methods to Improve Acoustic Modeling for French Speech Recognition. In Proceedings of the 5th International Conference on Pattern Recognition Applications and Methods - ICPRAM; ISBN 978-989-758-173-1; ISSN 2184-4313, SciTePress, pages 278-282. DOI: 10.5220/0005703702780282

@conference{icpram16,
author={Saeideh Mirzaei. and Pierrick Milhorat. and Jérôme Boudy. and Gérard Chollet. and Mikko Kurimo.},
title={Experiments on Adaptation Methods to Improve Acoustic Modeling for French Speech Recognition},
booktitle={Proceedings of the 5th International Conference on Pattern Recognition Applications and Methods - ICPRAM},
year={2016},
pages={278-282},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005703702780282},
isbn={978-989-758-173-1},
issn={2184-4313},
}

TY - CONF

JO - Proceedings of the 5th International Conference on Pattern Recognition Applications and Methods - ICPRAM
TI - Experiments on Adaptation Methods to Improve Acoustic Modeling for French Speech Recognition
SN - 978-989-758-173-1
IS - 2184-4313
AU - Mirzaei, S.
AU - Milhorat, P.
AU - Boudy, J.
AU - Chollet, G.
AU - Kurimo, M.
PY - 2016
SP - 278
EP - 282
DO - 10.5220/0005703702780282
PB - SciTePress