loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Giorgio Biagetti ; Paolo Crippa ; Alessandro Curzi ; Simone Orcioni and Claudio Turchetti

Affiliation: Università Politecnica delle Marche, Italy

Keyword(s): Speaker Identification, Speaker Recognition, Classification, Speech, Speech Frames, Short Sequences, DKLT, GMM, EM Algorithm, MFCC, Cepstral Analysis, Feature Extraction, Digitized Voice Samples.

Related Ontology Subjects/Areas/Topics: Applications ; Audio and Speech Processing ; Cardiovascular Imaging and Cardiography ; Cardiovascular Technologies ; Digital Signal Processing ; Health Engineering and Technology Applications ; Multimedia ; Multimedia Signal Processing ; Pattern Recognition ; Signal Processing ; Software Engineering ; Telecommunications

Abstract: In biometric person identification systems, speaker identification plays a crucial role as the voice is the more natural signal to produce and the simplest to acquire. Mel frequency cepstral coefficients (MFCCs) have been widely adopted for decades in speech processing to capture the speech-specific characteristics with a reduced dimensionality. However, although their ability to de-correlate the vocal source and the vocal tract filter make them suitable for speech recognition, they show up some drawbacks in speaker recognition. This paper presents an experimental evaluation showing that reducing the dimension of features by using the discrete Karhunen-Loève transform (DKLT), guarantees better performance with respect to conventional MFCC features. In particular with short sequences of speech frames, that is with utterance duration of less than 1 s, the performance of truncated DKLT representation are always better than MFCC.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.236.112.101

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Biagetti, G.; Crippa, P.; Curzi, A.; Orcioni, S. and Turchetti, C. (2015). Speaker Identification with Short Sequences of Speech Frames. In Proceedings of the International Conference on Pattern Recognition Applications and Methods - Volume 2: ICPRAM; ISBN 978-989-758-077-2; ISSN 2184-4313, SciTePress, pages 178-185. DOI: 10.5220/0005191701780185

@conference{icpram15,
author={Giorgio Biagetti. and Paolo Crippa. and Alessandro Curzi. and Simone Orcioni. and Claudio Turchetti.},
title={Speaker Identification with Short Sequences of Speech Frames},
booktitle={Proceedings of the International Conference on Pattern Recognition Applications and Methods - Volume 2: ICPRAM},
year={2015},
pages={178-185},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005191701780185},
isbn={978-989-758-077-2},
issn={2184-4313},
}

TY - CONF

JO - Proceedings of the International Conference on Pattern Recognition Applications and Methods - Volume 2: ICPRAM
TI - Speaker Identification with Short Sequences of Speech Frames
SN - 978-989-758-077-2
IS - 2184-4313
AU - Biagetti, G.
AU - Crippa, P.
AU - Curzi, A.
AU - Orcioni, S.
AU - Turchetti, C.
PY - 2015
SP - 178
EP - 185
DO - 10.5220/0005191701780185
PB - SciTePress