loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Jan Vanek ; Josef V. Psutka ; Aleš Pražák and Josef Psutka

Affiliation: West Bohemia University, Czech Republic

Keyword(s): Acoustic models training, discriminative training, clustering, gender-dependent models.

Related Ontology Subjects/Areas/Topics: Applications ; Audio and Speech Processing ; Digital Signal Processing ; Multimedia ; Multimedia Signal Processing ; Pattern Recognition ; Software Engineering ; Telecommunications

Abstract: The paper deals with training of speaker-clustered acoustic models. Various training techniques - Maximum Likelihood, Discriminative Training and two adaptation based on the MAP and Discriminative MAP were tested in order to minimize an impact of speaker changes to the correct function of the recognizer when a response of the automatic cluster detector is delayed or incorrect. Such situation is very frequent e.g. in online subtitling of TV discussions (Parliament meetings). In our experiments the best cluster-dependent training procedure was discriminative adaptation which provided the best trade-off between recognition results with correct and non-correct cluster detector information.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.226.222.12

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Vanek, J.; V. Psutka, J.; Pražák, A. and Psutka, J. (2009). TRAINING OF SPEAKER-CLUSTERED ACOUSTIC MODELS FOR USE IN REAL-TIME RECOGNIZERS. In Proceedings of the International Conference on Signal Processing and Multimedia Applications (ICETE 2009) - SIGMAP; ISBN 978-989-674-007-8, SciTePress, pages 131-135. DOI: 10.5220/0002262001310135

@conference{sigmap09,
author={Jan Vanek. and Josef {V. Psutka}. and Aleš Pražák. and Josef Psutka.},
title={TRAINING OF SPEAKER-CLUSTERED ACOUSTIC MODELS FOR USE IN REAL-TIME RECOGNIZERS},
booktitle={Proceedings of the International Conference on Signal Processing and Multimedia Applications (ICETE 2009) - SIGMAP},
year={2009},
pages={131-135},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002262001310135},
isbn={978-989-674-007-8},
}

TY - CONF

JO - Proceedings of the International Conference on Signal Processing and Multimedia Applications (ICETE 2009) - SIGMAP
TI - TRAINING OF SPEAKER-CLUSTERED ACOUSTIC MODELS FOR USE IN REAL-TIME RECOGNIZERS
SN - 978-989-674-007-8
AU - Vanek, J.
AU - V. Psutka, J.
AU - Pražák, A.
AU - Psutka, J.
PY - 2009
SP - 131
EP - 135
DO - 10.5220/0002262001310135
PB - SciTePress