Authors:
Corneliu Octavian Dumitru
1
and
Inge Gavat
2
Affiliations:
1
1Politehnica University Bucharest, Faculty of Electronics Telecommunication and Information Technology; GET/INT 9, France
;
2
Politehnica University Bucharest, Faculty of Electronics Telecommunication and Information Technology, Romania
Keyword(s):
HMM, MFCC, PLP, LPC, context dependent modeling, continuous speech.
Related
Ontology
Subjects/Areas/Topics:
Informatics in Control, Automation and Robotics
;
Signal Processing, Sensors, Systems Modeling and Control
;
Speech Recognition
Abstract:
This paper describes continuous speech recognition experiments for Romanian language, by using HMM (Hidden Markov Models) modeling. The following questions are to be discussed: the realization of a new front-end reconsidering linear prediction, the enhancement of recognition rates by context dependent modeling, the evaluation of training strategies ensuring speaker independence of the recognition process without speaker adaptation procedures, by speaker selection for training. The experiments lead to a development of the initial system with a promising front-end based on PLP (Perceptual Linear Prediction) coefficients, second ranked for the recognition performance obtained, near the first ranked front-end based on mel-frequency cepstral coefficients (MFCC), but far better as the last ranked, based on simple linear prediction. Concerning the implemented algorithm for context dependent modeling, it permits in all situations enhanced recognition rates. The experiments made with gender s
peaker selection enhanced under certain conditions the recognition rate, proving good generalization properties especially by training with the male speakers database.
(More)